October 1, 2021 2 min read

Azure Kubernetes Service Upgrades: A Practical Guide

Keeping your Azure Kubernetes Service (AKS) clusters up to date is crucial for security, performance, and accessing new features. In this post, I’ll walk you through the practical aspects of upgrading AKS clusters.

Understanding AKS Version Support

Microsoft supports three minor GA versions of Kubernetes. When a new minor version is released, the oldest supported version is deprecated. You typically have 30 days after deprecation to upgrade before the version goes out of support.

Checking Available Upgrades

First, let’s check what upgrades are available for your cluster:

# Get the current cluster version
az aks show --resource-group myResourceGroup --name myAKSCluster --query kubernetesVersion -o tsv

# Check available upgrades
az aks get-upgrades --resource-group myResourceGroup --name myAKSCluster --output table

Planning Your Upgrade Strategy

Before upgrading, consider these factors:

Test in non-production first - Always validate upgrades in dev/staging environments
Review release notes - Check for breaking changes and deprecated APIs
Validate workloads - Ensure your applications are compatible with the target version

Performing the Upgrade

Control Plane Upgrade

You can upgrade just the control plane first:

az aks upgrade \
    --resource-group myResourceGroup \
    --name myAKSCluster \
    --control-plane-only \
    --kubernetes-version 1.22.2

Full Cluster Upgrade

To upgrade both control plane and node pools:

az aks upgrade \
    --resource-group myResourceGroup \
    --name myAKSCluster \
    --kubernetes-version 1.22.2

Upgrade Strategy with Node Pools

For production clusters, I recommend a staged approach:

# Step 1: Upgrade control plane only
az aks upgrade \
    --resource-group myResourceGroup \
    --name myAKSCluster \
    --control-plane-only \
    --kubernetes-version 1.22.2

# Step 2: Upgrade system node pool
az aks nodepool upgrade \
    --resource-group myResourceGroup \
    --cluster-name myAKSCluster \
    --name systempool \
    --kubernetes-version 1.22.2

# Step 3: Upgrade user node pools one at a time
az aks nodepool upgrade \
    --resource-group myResourceGroup \
    --cluster-name myAKSCluster \
    --name userpool1 \
    --kubernetes-version 1.22.2

Setting Max Surge for Faster Upgrades

By default, AKS upgrades nodes one at a time. You can speed this up with max surge:

az aks nodepool update \
    --resource-group myResourceGroup \
    --cluster-name myAKSCluster \
    --name userpool1 \
    --max-surge 33%

Monitoring the Upgrade

Watch the upgrade progress:

# Watch node status
kubectl get nodes -w

# Check pod status
kubectl get pods --all-namespaces -o wide

Handling Upgrade Failures

If an upgrade fails, you can check the activity log:

az monitor activity-log list \
    --resource-group myResourceGroup \
    --query "[?contains(operationName.value, 'Microsoft.ContainerService')]"

Automating Upgrades

For non-production environments, consider auto-upgrade channels:

az aks update \
    --resource-group myResourceGroup \
    --name myAKSCluster \
    --auto-upgrade-channel stable

Available channels:

none - No automatic upgrades
patch - Automatically upgrade to the latest patch version
stable - Automatically upgrade to the latest stable version
rapid - Automatically upgrade to the latest supported version
node-image - Automatically upgrade node images

Conclusion

Regular AKS upgrades are essential for maintaining a secure and well-supported cluster. By following a staged approach and testing thoroughly, you can minimize downtime and ensure smooth transitions between versions.

Tomorrow, we’ll dive deeper into AKS node pools and how to design them for different workload requirements.