Launching OVN-Kubernetes with kubeadm

The following is a walkthrough for an installation in an environment with 4 virtual machines, and a cluster deployed with kubeadm. This shall serve as a guide for people who are curious enough to deploy OVN-Kubernetes on a manually created cluster and to play around with the components.

Note that the resulting environment might be highly unstable.

If your goal is to set up an environment quickly or to set up a development environment, see the kind installation documentation instead.

Environment setup¶

Overview¶

The environment consists of 4 libvirt/qemu virtual machines, all deployed with Rocky Linux 8 or CentOS 8. node1 will serve as the sole master node and nodes node2 and node3 as the worker nodes. gw1 will be the default gateway for the cluster via the Isolated Network. It will also host an HTTP registry to store the OVN-Kubernetes images.

       to hypervisor         to hypervisor         to hypervisor
             │                     │                     │
             │                     │                     │
           ┌─┴─┐                 ┌─┴─┐                 ┌─┴─┐
           │if1│                 │if1│                 │if1│
     ┌─────┴───┴─────┐     ┌─────┴───┴─────┐     ┌─────┴───┴─────┐
     │               │     │               │     │               │
     │               │     │               │     │               │
     │     node1     │     │     node2     │     │     node3     │
     │               │     │               │     │               │
     │               │     │               │     │               │
     └─────┬───┬─────┘     └─────┬───┬─────┘     └─────┬───┬─────┘
           │if2│                 │if2│                 │if2│
           └─┬─┘                 └─┬─┘                 └─┬─┘
             │                     │                     │
             │                     │                     │
             │                    xxxxxxxx               │
             │                 xxx       xxx             │
             │                xx           xx            │
             │               x   Isolated   x            │
             └──────────────x     Network   x────────────┘
                            xxx            x
                              xxxxxx  xxxxx
                                   xxxx
                                   │
                                 ┌─┴─┐
                                 │if2│
                           ┌─────┴───┴─────┐
                           │               │
                           │               │
                           │      gw1      │
                           │               │
                           │               │
                           └─────┬───┬─────┘
                                 │if1│
                                 └─┬─┘
                                   │
                                   │
                              to hypervisor

Legend: * if1 - enp1s0 | 192.168.122.0/24 * if2 - enp7s0 | 192.168.123.0/24

to hypervisor is libvirt's default network with full DHCP. It will be used as management access to all nodes as well as on gw1 as the interface for outside connectivity:

$ sudo virsh net-dumpxml default
<network connections='2'>
  <name>default</name>
  <uuid>76b7e8c1-7c2c-456b-ac10-09c98c6275a5</uuid>
  <forward mode='nat'>
    <nat>
      <port start='1024' end='65535'/>
    </nat>
  </forward>
  <bridge name='virbr0' stp='on' delay='0'/>
  <mac address='52:54:00:4b:4d:f8'/>
  <ip address='192.168.122.1' netmask='255.255.255.0'>
    <dhcp>
      <range start='192.168.122.2' end='192.168.122.254'/>
    </dhcp>
  </ip>
</network>

And Isolated Network is an isolated network. gw1 will be the default gateway for this network, and node1 through node3 will have their default route go through this network:

$ sudo virsh net-dumpxml ovn
<network connections='2'>
  <name>ovn</name>
  <uuid>fecea98b-8b92-438e-a759-f6cfb366614c</uuid>
  <bridge name='virbr2' stp='on' delay='0'/>
  <mac address='52:54:00:d4:f2:cc'/>
  <domain name='ovn'/>
</network>

Gateway setup (gw1)¶

Deploy the gateway virtual machine first. Set it up as a simple gateway which will NAT everything that comes in on interface enp7s0:

IF1=enp1s0
IF2=enp7s0
hostnamectl set-hostname gw1
nmcli conn mod ${IF1} connection.autoconnect yes
nmcli conn mod ${IF2} ipv4.address 192.168.123.254/24
nmcli conn mod ${IF2} ipv4.method static
nmcli conn mod ${IF2} connection.autoconnect yes
nmcli conn reload
systemctl stop firewalld
cat /proc/sys/net/ipv4/ip_forward
sysctl -a | grep ip_forward
echo "net.ipv4.ip_forward=1" >> /etc/sysctl.d/99-sysctl.conf
sysctl --system
yum install iptables-services -y
yum remove firewalld -y
systemctl enable --now iptables
iptables-save
iptables -t nat -I POSTROUTING --src 192.168.123.0/24  -j MASQUERADE
iptables -I FORWARD --j ACCEPT
iptables -I INPUT -p tcp --dport 5000 -j ACCEPT
iptables-save > /etc/sysconfig/iptables

Also set up an HTTP registry (optional — only needed if you plan to mirror the OVN-Kubernetes image to gw1 instead of pulling from a public registry):

yum install podman -y
mkdir -p /opt/registry/data
podman run --name mirror-registry \
  -p 5000:5000 -v /opt/registry/data:/var/lib/registry:z      \
  -d docker.io/library/registry:2
podman generate systemd --name mirror-registry > /etc/systemd/system/mirror-registry-container.service
systemctl daemon-reload
systemctl enable --now mirror-registry-container

Now, reboot the gateway:

reboot

node1 through node3 base setup¶

You must install Open vSwitch on node1 through node3. You will then connect enp7s0 to an OVS bridge called br-ex. This bridge will be used later by OVN-Kubernetes. Furthermore, you must assign IP addresses to br-ex and point the nodes' default route via br-ex to gw1.

Set hostnames¶

Set the hostnames manually, even if they are set correctly by DHCP. Set them manually to:

hostnamectl set-hostname node<x>

Disable swap¶

Make sure to disable swap. Kubelet will not run otherwise:

sed -i '/ swap /d' /etc/fstab
reboot

Remove firewalld¶

Make sure to uninstall firewalld. Otherwise, it will block the kubernetes management ports (that can easily be fixed by configuration) and it will also preempt and block the OVN-Kubernetes installed NAT and FORWARD rules (this is more difficult to remediate). The easiest fix is hence not to use firewalld at all:

systemctl disable --now firewalld
yum remove -y firewalld

For more details, see https://gitmemory.com/issue/firewalld/firewalld/767/790687269; this is about Calico, but it highlights the same issue.

Install Open vSwitch¶

Install Open vSwitch from https://wiki.centos.org/SpecialInterestGroup/NFV

On CentOS:¶

yum install centos-release-nfv-openvswitch -y
yum install openvswitch2.13 --nobest -y
yum install NetworkManager-ovs.x86_64 -y
systemctl enable --now openvswitch

On Rocky Linux¶

Rocky doesn't have access to CentOS's repositories. However, you can still use the CentOS NFV repositories:

rpm -ivh http://mirror.centos.org/centos/8-stream/extras/x86_64/os/Packages/centos-release-nfv-common-1-3.el8.noarch.rpm --nodeps
rpm -ivh http://mirror.centos.org/centos/8-stream/extras/x86_64/os/Packages/centos-release-nfv-openvswitch-1-3.el8.noarch.rpm
yum install openvswitch2.13 --nobest -y
yum install NetworkManager-ovs.x86_64 -y
systemctl enable --now openvswitch

Alternatively, on Rocky Linux, you can also build your own RPMs directly from the SRPMs (optional — only needed if the NFV SIG packages above are unavailable or you need a specific OVS build), e.g.:

yum install '@Development Tools'
yum install desktop-file-utils libcap-ng-devel libmnl-devel numactl-devel openssl-devel python3-devel python3-pyOpenSSL python3-setuptools python3-sphinx rdma-core-devel unbound-devel -y
rpmbuild --rebuild  http://ftp.redhat.com/pub/redhat/linux/enterprise/8Base/en/Fast-Datapath/SRPMS/openvswitch2.13-2.13.0-79.el8fdp.src.rpm
yum install selinux-policy-devel -y
rpmbuild --rebuild http://ftp.redhat.com/pub/redhat/linux/enterprise/8Base/en/Fast-Datapath/SRPMS/openvswitch-selinux-extra-policy-1.0-28.el8fdp.src.rpm
yum localinstall /root/rpmbuild/RPMS/noarch/openvswitch-selinux-extra-policy-1.0-28.el8.noarch.rpm /root/rpmbuild/RPMS/x86_64/openvswitch2.13-2.13.0-79.el8.x86_64.rpm -y
yum install NetworkManager-ovs.x86_64 -y
systemctl enable --now openvswitch

Configure networking¶

Set up networking:

BRIDGE_NAME=br-ex
IF1=enp1s0
IF2=enp7s0
IP_ADDRESS="192.168.123.$(hostname | sed 's/node//')/24"

Verify the IP_ADDRESS - it should be unique for every node and the last octet should be the same as the node's numeric identifier:

echo $IP_ADDRESS

Then, continue:

nmcli c add type ovs-bridge conn.interface ${BRIDGE_NAME} con-name ${BRIDGE_NAME}
nmcli c add type ovs-port conn.interface ${BRIDGE_NAME} master ${BRIDGE_NAME} con-name ovs-port-${BRIDGE_NAME}
nmcli c add type ovs-interface slave-type ovs-port conn.interface ${BRIDGE_NAME} master ovs-port-${BRIDGE_NAME}  con-name ovs-if-${BRIDGE_NAME}
nmcli c add type ovs-port conn.interface ${IF2} master ${BRIDGE_NAME} con-name ovs-port-${IF2}
nmcli c add type ethernet conn.interface ${IF2} master ovs-port-${IF2} con-name ovs-if-${IF2}
nmcli conn delete ${IF2}
nmcli conn mod ${BRIDGE_NAME} connection.autoconnect yes
nmcli conn mod ovs-if-${BRIDGE_NAME} connection.autoconnect yes
nmcli conn mod ovs-if-${IF2} connection.autoconnect yes
nmcli conn mod ovs-port-${IF2} connection.autoconnect yes
nmcli conn mod ovs-port-${BRIDGE_NAME} connection.autoconnect yes
nmcli conn mod ovs-if-${BRIDGE_NAME} ipv4.address ${IP_ADDRESS}
nmcli conn mod ovs-if-${BRIDGE_NAME} ipv4.method static
nmcli conn mod ovs-if-${BRIDGE_NAME} ipv4.route-metric 50

# move the default route to br-ex
BRIDGE_NAME=br-ex
nmcli conn mod ovs-if-${BRIDGE_NAME} ipv4.gateway "192.168.123.254"
nmcli conn mod ${IF1} ipv4.never-default yes
# Change DNS to 8.8.8.8
nmcli conn mod ${IF1} ipv4.ignore-auto-dns yes
nmcli conn mod ovs-if-${BRIDGE_NAME} ipv4.dns "8.8.8.8"

Now, reboot the node:

reboot

After the reboot, you should see something like this, for example on node1:

[root@node1 ~]# cat /etc/resolv.conf 
# Generated by NetworkManager
nameserver 8.8.8.8
[root@node1 ~]# ovs-vsctl show
c1aee179-b425-4b48-8648-dd8746f59add
    Bridge br-ex
        Port enp7s0
            Interface enp7s0
                type: system
        Port br-ex
            Interface br-ex
                type: internal
    ovs_version: "2.13.4"
[root@node1 ~]# ip r
default via 192.168.123.254 dev br-ex proto static metric 800 
192.168.122.0/24 dev enp1s0 proto kernel scope link src 192.168.122.205 metric 100 
192.168.123.0/24 dev br-ex proto kernel scope link src 192.168.123.1 metric 800 
[root@node1 ~]# ip a ls dev br-ex
6: br-ex: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN group default qlen 1000
    link/ether 26:98:69:4a:d7:43 brd ff:ff:ff:ff:ff:ff
    inet 192.168.123.1/24 brd 192.168.123.255 scope global noprefixroute br-ex
       valid_lft forever preferred_lft forever
    inet6 fe80::4a1d:4d35:7c28:1ff2/64 scope link noprefixroute 
       valid_lft forever preferred_lft forever
[root@node1 ~]# nmcli conn
NAME             UUID                                  TYPE           DEVICE 
ovs-if-br-ex     d434980e-ea23-4ab4-8414-289b7af44c50  ovs-interface  br-ex  
enp1s0           52060cdd-913e-4df8-9e9e-776f31647323  ethernet       enp1s0 
br-ex            950f405f-cd5c-4d51-b2ab-3d8e1e938c8b  ovs-bridge     br-ex  
ovs-if-enp7s0    0279d1c9-212c-4be8-8dfe-88a7b0b6d623  ethernet       enp7s0 
ovs-port-br-ex   3b47e5ae-a27a-4522-bea5-1fbf9c8c08eb  ovs-port       br-ex  
ovs-port-enp7s0  1baea5a3-09ee-4972-8f6b-bb8195ae46c4  ovs-port       enp7s0

And you should be able to ping outside of the cluster:

[root@node1 ~]# ping -c1 -W1 8.8.8.8
PING 8.8.8.8 (8.8.8.8) 56(84) bytes of data.
64 bytes from 8.8.8.8: icmp_seq=1 ttl=112 time=18.5 ms

--- 8.8.8.8 ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 18.506/18.506/18.506/0.000 ms

Install container runtime engine and kubeadm (node1, node2, node3)¶

The following will be a brief walkthrough of what's requried to install the container runtime and kubernetes. For further details, follow the kubeadm documentation: * https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/create-cluster-kubeadm/ * https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/install-kubeadm/

Install the container runtime¶

See https://kubernetes.io/docs/setup/production-environment/container-runtimes/ for further details.

Set up iptables:

# Create the .conf file to load the modules at bootup
cat <<EOF | sudo tee /etc/modules-load.d/crio.conf
overlay
br_netfilter
EOF

sudo modprobe overlay
sudo modprobe br_netfilter

# Set up required sysctl params, these persist across reboots.
cat <<EOF | sudo tee /etc/sysctl.d/99-kubernetes-cri.conf
net.bridge.bridge-nf-call-iptables  = 1
net.ipv4.ip_forward                 = 1
net.bridge.bridge-nf-call-ip6tables = 1
EOF

sudo sysctl --system

Then, install cri-o. The kubic OBS repository older docs reference has been retired; modern cri-o ships from pkgs.k8s.io. Pin to the latest cri-o stable stream that's actually published (cri-o stable streams typically trail upstream Kubernetes by one or two minor versions, so they can be older than the kubeadm you install below):

CRIO_VERSION=v1.32
cat <<EOF > /etc/yum.repos.d/cri-o.repo
[cri-o]
name=CRI-O
baseurl=https://pkgs.k8s.io/addons:/cri-o:/stable:/${CRIO_VERSION}/rpm/
enabled=1
gpgcheck=1
gpgkey=https://pkgs.k8s.io/addons:/cri-o:/stable:/${CRIO_VERSION}/rpm/repodata/repomd.xml.key
EOF
yum install cri-o -y

Make sure to set 192.168.123.254 (gw1) as an insecure registry:

cat <<'EOF' | tee /etc/containers/registries.conf.d/999-insecure.conf
[[registry]]
location = "192.168.123.254:5000"
insecure = true
EOF

Also, make sure to remove /etc/cni/net.d/100-crio-bridge.conf as we do not want to fall back to crio's default networking:

mv /etc/cni/net.d/100-crio-bridge.conf /root/.

Note: If you forget to move or delete this file, your CoreDNS pods will come up with an IP address in the 10.0.0.0/8 range.

Finally, start crio:

systemctl daemon-reload
systemctl enable crio --now

Install kubelet, kubectl, kubeadm¶

See https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/install-kubeadm/#installing-kubeadm-kubelet-and-kubectl for further details.

KUBE_VERSION=v1.35
cat <<EOF | sudo tee /etc/yum.repos.d/kubernetes.repo
[kubernetes]
name=Kubernetes
baseurl=https://pkgs.k8s.io/core:/stable:/${KUBE_VERSION}/rpm/
enabled=1
gpgcheck=1
gpgkey=https://pkgs.k8s.io/core:/stable:/${KUBE_VERSION}/rpm/repodata/repomd.xml.key
exclude=kubelet kubeadm kubectl cri-tools kubernetes-cni
EOF

# Set SELinux in permissive mode (effectively disabling it)
sudo setenforce 0
sudo sed -i 's/^SELINUX=enforcing$/SELINUX=permissive/' /etc/selinux/config

sudo yum install -y kubelet kubeadm kubectl conntrack-tools --disableexcludes=kubernetes

sudo systemctl enable --now kubelet

Deploying a cluster with OVN-Kubernetes¶

Execute the following instructions only on the master node, node1.

Install instructions for kubeadm¶

Deploy on the master node node1. Use CIDRs that match the OVN-Kubernetes Helm chart defaults, and skip the kube-proxy addon (OVN-Kubernetes provides its own service implementation):

kubeadm init \
    --pod-network-cidr=10.244.0.0/16 \
    --service-cidr=10.96.0.0/16 \
    --apiserver-advertise-address=192.168.123.1 \
    --skip-phases=addon/kube-proxy
mkdir -p $HOME/.kube
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
sudo chown $(id -u):$(id -g) $HOME/.kube/config

Write down the join command for worker nodes - you will need it later.

You will now have a one node cluster without a CNI plugin and as such the CoreDNS pods will not start:

[root@node1 ~]# kubectl get pods -o wide -A
NAMESPACE     NAME                            READY   STATUS              RESTARTS   AGE   IP                NODE    NOMINATED NODE   READINESS GATES
kube-system   coredns-78fcd69978-dvpjg        0/1     ContainerCreating   0          21s   <none>            node1   <none>           <none>
kube-system   coredns-78fcd69978-mzpzr        0/1     ContainerCreating   0          21s   <none>            node1   <none>           <none>
kube-system   etcd-node1                      1/1     Running             2          33s   192.168.122.205   node1   <none>           <none>
kube-system   kube-apiserver-node1            1/1     Running             2          33s   192.168.122.205   node1   <none>           <none>
kube-system   kube-controller-manager-node1   1/1     Running             3          33s   192.168.122.205   node1   <none>           <none>
kube-system   kube-scheduler-node1            1/1     Running             3          28s   192.168.122.205   node1   <none>           <none>

Now, deploy OVN-Kubernetes - see below.

Deploying OVN-Kubernetes on node1¶

Several of the next sub-steps — install build dependencies, install Go, build the OVN-Kubernetes image, and push it to the local registry — are optional: they're only needed if you want to build a custom OVN-Kubernetes image and serve it from gw1's registry. The clone step in between is required either way, because helm install runs from the chart directory inside the clone. If you're happy using the public ghcr.io/ovn-kubernetes/ovn-kubernetes/ovn-kube-ubuntu:master image, run only the clone step below, then jump to the helm install step further down and adjust --set global.image.repository / --set global.image.tag accordingly.

Install build dependencies and create a softlink for pip to pip3:

yum install git python3-pip make podman buildah -y
ln -s $(which pip3) /usr/local/bin/pip

Install golang, for further details see https://golang.org/doc/install:

curl -L -O https://golang.org/dl/go1.17.linux-amd64.tar.gz
rm -rf /usr/local/go && tar -C /usr/local -xzf go1.17.linux-amd64.tar.gz
echo "export PATH=$PATH:/usr/local/go/bin" >> ~/.bashrc
source ~/.bashrc
go version

Now, clone the OVN-Kubernetes repository:

yum install -y git
mkdir -p $HOME/work/src/github.com/ovn-kubernetes
cd $HOME/work/src/github.com/ovn-kubernetes
git clone https://github.com/ovn-kubernetes/ovn-kubernetes
cd $HOME/work/src/github.com/ovn-kubernetes/ovn-kubernetes/dist/images

Build the latest ovn-daemonset image and push it to the registry. Prepare the binaries:

# Build ovn docker image
pushd ../../go-controller
make
popd

# Build ovn kube image
# Find all built executables, but ignore the 'windows' directory if it exists
find ../../go-controller/_output/go/bin/ -maxdepth 1 -type f -exec cp -f {} . \;
echo "ref: $(git rev-parse  --symbolic-full-name HEAD)  commit: $(git rev-parse  HEAD)" > git_info

Now, build and push the image with:

OVN_IMAGE=192.168.123.254:5000/ovn-daemonset-fedora:latest
buildah bud -t $OVN_IMAGE -f Dockerfile.fedora .
podman push $OVN_IMAGE

Before starting OVN-Kubernetes, work around an issue where br-int is added by OVN but the necessary files in /var/run/openvswitch are not created until Open vSwitch is restarted (see Issues / workarounds). This only matters on the master, so pre-create br-int there:

ovs-vsctl add-br br-int

Single-node-zone interconnect requires every node to be labeled with its zone name before the Helm install:

for n in node1 node2 node3; do kubectl label node $n k8s.ovn.org/zone-name=$n --overwrite; done

Next, install OVN-Kubernetes with the Helm chart, pointing global.image.repository at the image you just pushed (or use the public ghcr.io/ovn-kubernetes/ovn-kubernetes/ovn-kube-ubuntu image to skip the build steps above). If helm isn't already on the master, install it with curl -fsSL https://raw.githubusercontent.com/helm/helm/main/scripts/get-helm-3 | bash.

cd $HOME/work/src/github.com/ovn-kubernetes/ovn-kubernetes/helm/ovn-kubernetes
helm install ovn-kubernetes . \
    -f values-single-node-zone.yaml \
    --set k8sAPIServer="https://192.168.123.1:6443" \
    --set global.image.repository=192.168.123.254:5000/ovn-daemonset-fedora \
    --set global.image.tag=latest

CEL CRD note: Several OVN-Kubernetes CRDs use CEL x-kubernetes-validations rules that don't install cleanly on older Kubernetes versions — the per-CRD cost budget rejects four of them on Kubernetes 1.31, and the indexed all(i, v, …) macro used by vteps and routeadvertisements is rejected on 1.32. Use Kubernetes 1.33 or newer (the example above pins to v1.35) and the chart's CRDs install cleanly via helm install.

The Helm chart does not remove kube-proxy. Since the kubeadm init command above installs kube-proxy by default, delete the kube-proxy DaemonSet:

kubectl delete ds -n kube-system kube-proxy

The Helm chart deploys the namespace, the cluster manager (ovnkube-control-plane), and the per-node pods (ovnkube-single-node-zone + ovnkube-node + ovs-node) in one step. Watch them come up:

kubectl get pods -n ovn-kubernetes -o wide -w

Once all OVN related pods are up, you should see that the CoreDNS pods have started as well and they should be in the correct network.

[root@node1 images]# kubectl get pods -A -o wide | grep coredns
kube-system      coredns-78fcd69978-ms969         1/1     Running   0          29s     172.16.0.6        node1   <none>           <none>
kube-system      coredns-78fcd69978-w6k2z         1/1     Running   0          36s     172.16.0.5        node1   <none>           <none>

You should now see the following when listing all pods (kube-proxy is absent because it was disabled at kubeadm init time with --skip-phases=addon/kube-proxy):

[root@node1 ~]# kubectl get pods -A -o wide
NAMESPACE        NAME                             READY   STATUS    RESTARTS   AGE     IP                NODE    NOMINATED NODE   READINESS GATES
kube-system      coredns-78fcd69978-rhjgh         1/1     Running   0          10s     172.16.0.4        node1   <none>           <none>
kube-system      coredns-78fcd69978-xcxnx         1/1     Running   0          17s     172.16.0.3        node1   <none>           <none>
kube-system      etcd-node1                       1/1     Running   1          74m     192.168.122.205   node1   <none>           <none>
kube-system      kube-apiserver-node1             1/1     Running   1          74m     192.168.122.205   node1   <none>           <none>
kube-system      kube-controller-manager-node1    1/1     Running   1          74m     192.168.122.205   node1   <none>           <none>
kube-system      kube-scheduler-node1             1/1     Running   1          74m     192.168.122.205   node1   <none>           <none>
ovn-kubernetes   ovnkube-control-plane-6f44d456df-bv2x8  1/1   Running   0          11m     192.168.122.205   node1   <none>           <none>
ovn-kubernetes   ovnkube-node-xmgrj                      3/3   Running   0          8m49s   192.168.122.205   node1   <none>           <none>
ovn-kubernetes   ovs-node-jl7lh                          1/1   Running   0          8m49s   192.168.122.205   node1   <none>           <none>

Verifying the deployment¶

Create a test deployment to make sure that everything works as expected:

cd ~
cat <<'EOF' > fedora.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: fedora-deployment
  labels:
    app: fedora-deployment
spec:
  replicas: 2
  selector:
    matchLabels:
      app: fedora-pod
  template:
    metadata:
      labels:
        app: fedora-pod
    spec:
      tolerations:
      - key: "node-role.kubernetes.io/control-plane"
        operator: "Exists"
      containers:
      - name: fedora
        image: fedora
        command:
          - sleep
          - infinity
        imagePullPolicy: IfNotPresent
        securityContext:
          runAsUser: 0
          capabilities:
            add:
              - "SETFCAP"
              - "CAP_NET_RAW"
              - "CAP_NET_ADMIN"
EOF
kubectl apply -f fedora.yaml

Make sure that the pods have a correct IP address and that they can reach the outside world, e.g. by installing some software:

[root@node1 ~]# kubectl get pods -o wide
NAME                                 READY   STATUS    RESTARTS   AGE   IP           NODE    NOMINATED NODE   READINESS GATES
fedora-deployment-86f7647bd6-dllbs   1/1     Running   0          58s   172.16.0.5   node1   <none>           <none>
fedora-deployment-86f7647bd6-k42wm   1/1     Running   0          36s   172.16.0.6   node1   <none>           <none>
[root@node1 ~]# kubectl exec -it fedora-deployment-86f7647bd6-dllbs -- /bin/bash
[root@fedora-deployment-86f7647bd6-dllbs /]# yum install iputils -y
Fedora 34 - x86_64                                                                   4.2 MB/s |  74 MB     00:17    
Fedora 34 openh264 (From Cisco) - x86_64                                             1.7 kB/s | 2.5 kB     00:01    
Fedora Modular 34 - x86_64                                                           2.8 MB/s | 4.9 MB     00:01    
Fedora 34 - x86_64 - Updates                                                         3.7 MB/s |  25 MB     00:06    
Fedora Modular 34 - x86_64 - Updates                                                 2.0 MB/s | 4.6 MB     00:02    
Last metadata expiration check: 0:00:01 ago on Tue Aug 24 17:04:04 2021.
Dependencies resolved.
=====================================================================================================================
 Package                   Architecture             Version                           Repository                Size
=====================================================================================================================
Installing:
 iputils                   x86_64                   20210202-2.fc34                   fedora                   170 k

Transaction Summary
=====================================================================================================================
Install  1 Package

Total download size: 170 k
Installed size: 527 k
Downloading Packages:
iputils-20210202-2.fc34.x86_64.rpm                                                   1.2 MB/s | 170 kB     00:00    
---------------------------------------------------------------------------------------------------------------------
Total                                                                                265 kB/s | 170 kB     00:00     
Running transaction check
Transaction check succeeded.
Running transaction test
Transaction test succeeded.
Running transaction
  Preparing        :                                                                                             1/1 
  Installing       : iputils-20210202-2.fc34.x86_64                                                              1/1 
  Running scriptlet: iputils-20210202-2.fc34.x86_64                                                              1/1 
  Verifying        : iputils-20210202-2.fc34.x86_64                                                              1/1 

Installed:
  iputils-20210202-2.fc34.x86_64                                                                                     

Complete!

Once the worker nodes have joined (see Joining worker nodes), you can also confirm the geneve tunnel between zones works by running two pods on different nodes and verifying cross-node connectivity plus service-cluster DNS:

cat <<'EOF' | kubectl apply -f -
apiVersion: v1
kind: Pod
metadata: {name: test-n2}
spec:
  nodeSelector: {kubernetes.io/hostname: node2}
  containers:
  - name: c
    image: registry.k8s.io/e2e-test-images/agnhost:2.47
    command: ["/agnhost", "netexec", "--http-port=8080"]
    securityContext: {capabilities: {add: [NET_RAW]}}
---
apiVersion: v1
kind: Pod
metadata: {name: test-n3}
spec:
  nodeSelector: {kubernetes.io/hostname: node3}
  containers:
  - name: c
    image: registry.k8s.io/e2e-test-images/agnhost:2.47
    command: ["/agnhost", "netexec", "--http-port=8080"]
    securityContext: {capabilities: {add: [NET_RAW]}}
EOF

# Cross-node ICMP exercises the geneve tunnel
N3_IP=$(kubectl get pod test-n3 -o jsonpath='{.status.podIP}')
kubectl exec test-n2 -- ping -c 3 -W 2 $N3_IP

# Cross-node TCP via agnhost's HTTP listener
kubectl exec test-n2 -- /agnhost connect --timeout=3s $N3_IP:8080

# Service DNS resolution via the cluster DNS service IP
kubectl exec test-n2 -- nslookup kubernetes.default.svc.cluster.local 10.96.0.10

Uninstalling OVN-Kubernetes¶

If you deployed OVN-Kubernetes with Helm as described above, uninstall it with:

helm uninstall ovn-kubernetes

Issues / workarounds:¶

br-int might be added by OVN, but the files for it are not created in /var/run/openvswitch. ovs-ofctl dump-flows br-int fails, and one will see the following log messages among others:

2021-08-24T12:42:43.810Z|00025|rconn|WARN|unix:/var/run/openvswitch/br-int.mgmt: connection failed (No such file or directory)

The best workaroud is to pre-create br-int before the OVN-Kubernetes installation:

ovs-vsctl add-br br-int

br-ex disappears from the OVS database whenever the OVN-Kubernetes Helm release is uninstalled and reinstalled. The ovs-node container resets the OVS DB on startup, removing any user-managed bridges. NetworkManager keeps the br-ex connection profile but cannot recreate it because the OVS DB is now under container control. The simplest recovery is to reboot each affected node — NetworkManager re-establishes br-ex from its persisted profile during boot, and ovn-kubernetes reconciles the patch port back to br-int. Plan for a per-node reboot whenever the chart is reinstalled.

Joining worker nodes to the environment¶

Finally, join your worker nodes. Set them up using the base setup steps for the nodes and the CRI and kubeadm installation steps. Then, use the output from the kubeadm init command that you ran earlier to join the node to the cluster:

kubeadm join 192.168.123.10:6443 --token <...> \
    --discovery-token-ca-cert-hash <...>

kubeadm reset instructions¶

If you must reset your master and worker nodes, the following commands can be used to reset the lab environment. Run this on each node and then ideally reboot the node right after:

IF2=enp7s0
echo "y" | kubeadm reset
rm -f /etc/cni/net.d/10-*
rm -Rf ~/.kube
rm -f /etc/openvswitch/conf.db
nmcli conn del cni0
systemctl restart openvswitch
systemctl restart NetworkManager
nmcli conn up ovs-if-${IF2}