Deploy a Production Ready Kubernetes Cluster

Home server setup

This home server setup utilizes the following:

3 nodes
kube-vip (ARP mode): LoadBalancer
Rook Ceph: persistent storage
Istio: service mesh
Argo CD: GitOps
kubeseal: encrypt Secrets
Gitea: git service
getlocalcert: ACME + DNS management

# ansible
VENVDIR=kubespray-venv
python3 -m venv $VENVDIR
source $VENVDIR/bin/activate
pip install -U -r requirements.txt
pip install ruamel.yaml

cp -rfp inventory/sample inventory/mycluster
declare -a IPS=(192.168.1.191 192.168.1.192 192.168.1.193)
CONFIG_FILE=inventory/mycluster/hosts.yaml python3 contrib/inventory_builder/inventory.py ${IPS[@]}

# if using longhorn, edit k8s_cluster/addons.yaml
registry_storage_class: longhorn

ansible-playbook -i inventory/mycluster/hosts.yaml  --become --become-user=root cluster.yml --extra-vars "ansible_sudo_pass=<password>"

# kubeconfig
mkdir ~/.kube
sudo cp /etc/kubernetes/admin.conf ~/.kube/config
sudo chown -R $(whoami):$(whoami) ~/.kube/config

##########

# argocd
kubectl get secret -n argocd argocd-initial-admin-secret -o jsonpath="{.data.password}" | base64 -d
kubectl port-forward -n argocd svc/argocd-server 8080:443

# persistent storage, choose one: ceph or longhorn
## ceph
### logical volumes for ceph
sudo fdisk /dev/sda # create a partition, 'n' to create, 'w' to write
sudo vgextend vg0 /dev/sda4
sudo lvcreate -n ceph_vl -l 100%FREE vg0
sudo dmsetup info /dev/dm-1 # this should match vg0-ceph_vl
### https://rook.io/docs/rook/v1.15/Helm-Charts/operator-chart/
argocd app create -f argocd-init-apps/rook-ceph-operator.yaml
kubectl get pods -n rook-ceph -l "app=rook-ceph-operator"
argocd app create -f argocd-init-apps/rook-ceph-cluster.yaml
kubectl get cephcluster -n rook-ceph
kubectl krew install rook-ceph
## longhorn
argocd app create -f argocd-init-apps/longhorn.yaml

# gitea
# https://docs.gitea.com/installation/install-on-kubernetes
argocd app create -f argocd-init-apps/gitea.yaml
kubectl port-forward -n gitea svc/gitea-http 8081:3000 # kube-vip LoadBalancer isn't ready yet

# kubeseal
argocd app create -f argocd-init-apps/sealed-secrets.yaml
kubeseal --fetch-cert > kubeseal-cert.pem

# add gitea credentials to argocd
export gitea_username=$(echo -n "<username>")
export gitea_password=$(echo -n "<password>")
yq e '.stringData.username = strenv(gitea_username) | .stringData.password = strenv(gitea_password)' argocd-init-apps/gitea-credentials.tpl > argocd-init-apps/gitea-credentials.unsealed.yaml
kubeseal --cert kubeseal-cert.pem --format yaml < argocd-init-apps/gitea-credentials.unsealed.yaml > argocd-init-apps/gitea-credentials.sealed.yaml
kubectl create -f argocd-init-apps/gitea-credentials.sealed.yaml

# app of apps
kubectl create -f argocd-init-apps/app-of-apps.yaml
## lightweight version
kubectl create -f argocd-init-apps/app-of-apps-lw.yaml

##########

# kube-vip
# https://kube-vip.io/docs/usage/cloud-provider/
kubectl get gtw -n homeserver homeserver-gateway # LoadBalancer should be operational now

# gitea runner
# https://gitea.com/gitea/act_runner/src/tag/v0.2.11/examples/kubernetes
export gitea_token=$(echo -n "<token>")
yq e '.stringData.token = strenv(gitea_token)' argocd-apps/gitea-token.tpl > argocd-apps/gitea-token.unsealed.yaml
kubeseal --cert kubeseal-cert.pem --format yaml < argocd-apps/gitea-token.unsealed.yaml > argocd-apps/gitea-token.sealed.yaml
git add argocd-apps/gitea-token.sealed.yaml

# use acmedns
# https://docs.getlocalcert.net/acme-clients/cert-manager/
export acmedns=`cat localcert-acmedns.json`
yq e '.stringData."acmedns.json" = strenv(acmedns)' argocd-apps/localcert-acmedns.tpl > argocd-apps/localcert-acmedns.unsealed.yaml
kubeseal --cert kubeseal-cert.pem --format yaml < argocd-apps/localcert-acmedns.unsealed.yaml > argocd-apps/localcert-acmedns.sealed.yaml
git add argocd-apps/localcert-acmedns.sealed.yaml

Deploy a Production Ready Kubernetes Cluster

If you have questions, check the documentation at kubespray.io and join us on the kubernetes slack, channel #kubespray. You can get your invite here

Can be deployed on AWS, GCE, Azure, OpenStack, vSphere, Equinix Metal (bare metal), Oracle Cloud Infrastructure (Experimental), or Baremetal
Highly available cluster
Composable (Choice of the network plugin for instance)
Supports most popular Linux distributions
Continuous integration tests

Quick Start

Below are several ways to use Kubespray to deploy a Kubernetes cluster.

Ansible

Usage

Install Ansible according to Ansible installation guide then run the following steps:

# Copy ``inventory/sample`` as ``inventory/mycluster``
cp -rfp inventory/sample inventory/mycluster

# Update Ansible inventory file with inventory builder
declare -a IPS=(10.10.1.3 10.10.1.4 10.10.1.5)
CONFIG_FILE=inventory/mycluster/hosts.yaml python3 contrib/inventory_builder/inventory.py ${IPS[@]}

# Review and change parameters under ``inventory/mycluster/group_vars``
cat inventory/mycluster/group_vars/all/all.yml
cat inventory/mycluster/group_vars/k8s_cluster/k8s-cluster.yml

# Clean up old Kubernetes cluster with Ansible Playbook - run the playbook as root
# The option `--become` is required, as for example cleaning up SSL keys in /etc/,
# uninstalling old packages and interacting with various systemd daemons.
# Without --become the playbook will fail to run!
# And be mind it will remove the current kubernetes cluster (if it's running)!
ansible-playbook -i inventory/mycluster/hosts.yaml  --become --become-user=root reset.yml

# Deploy Kubespray with Ansible Playbook - run the playbook as root
# The option `--become` is required, as for example writing SSL keys in /etc/,
# installing packages and interacting with various systemd daemons.
# Without --become the playbook will fail to run!
ansible-playbook -i inventory/mycluster/hosts.yaml  --become --become-user=root cluster.yml

Note: When Ansible is already installed via system packages on the control node, Python packages installed via sudo pip install -r requirements.txt will go to a different directory tree (e.g. /usr/local/lib/python2.7/dist-packages on Ubuntu) from Ansible's (e.g. /usr/lib/python2.7/dist-packages/ansible still on Ubuntu). As a consequence, the ansible-playbook command will fail with:

ERROR! no action detected in task. This often indicates a misspelled module name, or incorrect module path.

This likely indicates that a task depends on a module present in requirements.txt.

One way of addressing this is to uninstall the system Ansible package then reinstall Ansible via pip, but this not always possible and one must take care regarding package versions. A workaround consists of setting the ANSIBLE_LIBRARY and ANSIBLE_MODULE_UTILS environment variables respectively to the ansible/modules and ansible/module_utils subdirectories of the pip installation location, which is the Location shown by running pip show [package] before executing ansible-playbook.

A simple way to ensure you get all the correct version of Ansible is to use the pre-built docker image from Quay. You will then need to use bind mounts to access the inventory and SSH key in the container, like this:

git checkout v2.25.0
docker pull quay.io/kubespray/kubespray:v2.25.0
docker run --rm -it --mount type=bind,source="$(pwd)"/inventory/sample,dst=/inventory \
  --mount type=bind,source="${HOME}"/.ssh/id_rsa,dst=/root/.ssh/id_rsa \
  quay.io/kubespray/kubespray:v2.25.0 bash
# Inside the container you may now run the kubespray playbooks:
ansible-playbook -i /inventory/inventory.ini --private-key /root/.ssh/id_rsa cluster.yml

Collection

See here if you wish to use this repository as an Ansible collection

Vagrant

For Vagrant we need to install Python dependencies for provisioning tasks. Check that Python and pip are installed:

python -V && pip -V

If this returns the version of the software, you're good to go. If not, download and install Python from here https://www.python.org/downloads/source/

Install Ansible according to Ansible installation guide then run the following step:

vagrant up

Documents

Supported Linux Distributions

Flatcar Container Linux by Kinvolk
Debian Bookworm, Bullseye
Ubuntu 20.04, 22.04, 24.04
CentOS/RHEL 8, 9
Fedora 37, 38
Fedora CoreOS (see fcos Note)
openSUSE Leap 15.x/Tumbleweed
Oracle Linux 8, 9
Alma Linux 8, 9
Rocky Linux 8, 9
Kylin Linux Advanced Server V10 (experimental: see kylin linux notes)
Amazon Linux 2 (experimental: see amazon linux notes)
UOS Linux (experimental: see uos linux notes)
openEuler (experimental: see openEuler notes)

Note: Upstart/SysV init based OS types are not supported.

Supported Components

Core
- kubernetes v1.30.4
- etcd v3.5.12
- docker v26.1
- containerd v1.7.21
- cri-o v1.30.3 (experimental: see CRI-O Note. Only on fedora, ubuntu and centos based OS)
Network Plugin
- cni-plugins v1.2.0
- calico v3.28.1
- cilium v1.15.4
- flannel v0.22.0
- kube-ovn v1.12.21
- kube-router v2.0.0
- multus v3.8
- weave v2.8.7
- kube-vip v0.8.0
Application
- cert-manager v1.14.7
- coredns v1.11.1
- ingress-nginx v1.11.2
- krew v0.4.4
- argocd v2.11.0
- helm v3.15.4
- metallb v0.13.9
- registry v2.8.1
Storage Plugin
- cephfs-provisioner v2.1.0-k8s1.11
- rbd-provisioner v2.1.1-k8s1.11
- aws-ebs-csi-plugin v0.5.0
- azure-csi-plugin v1.10.0
- cinder-csi-plugin v1.30.0
- gcp-pd-csi-plugin v1.9.2
- local-path-provisioner v0.0.24
- local-volume-provisioner v2.5.0
- node-feature-discovery v0.16.4

Container Runtime Notes

The cri-o version should be aligned with the respective kubernetes version (i.e. kube_version=1.20.x, crio_version=1.20)

Requirements

Minimum required version of Kubernetes is v1.28
Ansible v2.14+, Jinja 2.11+ and python-netaddr is installed on the machine that will run Ansible commands
The target servers must have access to the Internet in order to pull docker images. Otherwise, additional configuration is required (See Offline Environment)
The target servers are configured to allow IPv4 forwarding.
If using IPv6 for pods and services, the target servers are configured to allow IPv6 forwarding.
The firewalls are not managed, you'll need to implement your own rules the way you used to. in order to avoid any issue during deployment you should disable your firewall.
If kubespray is run from non-root user account, correct privilege escalation method should be configured in the target servers. Then the ansible_become flag or command parameters --become or -b should be specified.

Hardware: These limits are safeguarded by Kubespray. Actual requirements for your workload can differ. For a sizing guide go to the Building Large Clusters guide.

Master
- Memory: 1500 MB
Node
- Memory: 1024 MB

Network Plugins

You can choose among ten network plugins. (default: calico, except Vagrant uses flannel)

flannel: gre/vxlan (layer 2) networking.
Calico is a networking and network policy provider. Calico supports a flexible set of networking options designed to give you the most efficient networking across a range of situations, including non-overlay and overlay networks, with or without BGP. Calico uses the same engine to enforce network policy for hosts, pods, and (if using Istio and Envoy) applications at the service mesh layer.
cilium: layer 3/4 networking (as well as layer 7 to protect and secure application protocols), supports dynamic insertion of BPF bytecode into the Linux kernel to implement security services, networking and visibility logic.
weave: Weave is a lightweight container overlay network that doesn't require an external K/V database cluster. (Please refer to weave troubleshooting documentation).
kube-ovn: Kube-OVN integrates the OVN-based Network Virtualization with Kubernetes. It offers an advanced Container Network Fabric for Enterprises.
kube-router: Kube-router is a L3 CNI for Kubernetes networking aiming to provide operational simplicity and high performance: it uses IPVS to provide Kube Services Proxy (if setup to replace kube-proxy), iptables for network policies, and BGP for ods L3 networking (with optionally BGP peering with out-of-cluster BGP peers). It can also optionally advertise routes to Kubernetes cluster Pods CIDRs, ClusterIPs, ExternalIPs and LoadBalancerIPs.
macvlan: Macvlan is a Linux network driver. Pods have their own unique Mac and Ip address, connected directly the physical (layer 2) network.
multus: Multus is a meta CNI plugin that provides multiple network interface support to pods. For each interface Multus delegates CNI calls to secondary CNI plugins such as Calico, macvlan, etc.
custom_cni : You can specify some manifests that will be applied to the clusters to bring you own CNI and use non-supported ones by Kubespray. See tests/files/custom_cni/README.md and tests/files/custom_cni/values.yamlfor an example with a CNI provided by a Helm Chart.

The network plugin to use is defined by the variable kube_network_plugin. There is also an option to leverage built-in cloud provider networking instead. See also Network checker.

Ingress Plugins

nginx: the NGINX Ingress Controller.
metallb: the MetalLB bare-metal service LoadBalancer provider.

Community docs and resources

Tools and projects on top of Kubespray

CI Tests

CI/end-to-end tests sponsored by: CNCF, Equinix Metal, OVHcloud, ELASTX.

See the test matrix for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7,805 Commits
.github		.github
.gitlab-ci		.gitlab-ci
argocd-apps-lw		argocd-apps-lw
argocd-apps		argocd-apps
argocd-init-apps		argocd-init-apps
contrib		contrib
docs		docs
extra_playbooks		extra_playbooks
inventory		inventory
library		library
logo		logo
meta		meta
playbooks		playbooks
plugins/modules		plugins/modules
roles		roles
scripts		scripts
test-infra		test-infra
tests		tests
.ansible-lint		.ansible-lint
.ansible-lint-ignore		.ansible-lint-ignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
.md_style.rb		.md_style.rb
.mdlrc		.mdlrc
.nojekyll		.nojekyll
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint		.yamllint
CHANGELOG.md		CHANGELOG.md
CNAME		CNAME
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
OWNERS_ALIASES		OWNERS_ALIASES
README.md		README.md
RELEASE.md		RELEASE.md
SECURITY_CONTACTS		SECURITY_CONTACTS
Vagrantfile		Vagrantfile
_config.yml		_config.yml
ansible.cfg		ansible.cfg
cluster.yml		cluster.yml
code-of-conduct.md		code-of-conduct.md
galaxy.yml		galaxy.yml
index.html		index.html
localcert-acmedns.json.example		localcert-acmedns.json.example
pipeline.Dockerfile		pipeline.Dockerfile
recover-control-plane.yml		recover-control-plane.yml
remove-node.yml		remove-node.yml
requirements.txt		requirements.txt
reset.yml		reset.yml
run.rc		run.rc
scale.yml		scale.yml
setup.cfg		setup.cfg
setup.py		setup.py
upgrade-cluster.yml		upgrade-cluster.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Home server setup

Deploy a Production Ready Kubernetes Cluster

Quick Start

Ansible

Usage

Collection

Vagrant

Documents

Supported Linux Distributions

Supported Components

Container Runtime Notes

Requirements

Network Plugins

Ingress Plugins

Community docs and resources

Tools and projects on top of Kubespray

CI Tests

About

Releases

Packages

Languages

License

nguyenminh-phuc/k8s-homeserver

Folders and files

Latest commit

History

Repository files navigation

Home server setup

Deploy a Production Ready Kubernetes Cluster

Quick Start

Ansible

Usage

Collection

Vagrant

Documents

Supported Linux Distributions

Supported Components

Container Runtime Notes

Requirements

Network Plugins

Ingress Plugins

Community docs and resources

Tools and projects on top of Kubespray

CI Tests

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages