🧠 mattdunn.info

Search

Kubernetes Engine (GKE)

Last updated 25 Mar 2024

google-cloud
gke
compute
kubernetes

Overview

Managed Kubernetes clusters
Two modes:
- Standard—total control
- Autopilot—fully managed
Auto-repair and upgrade
Autoscaling
Containerise legacy apps and migrate using Migrate for Anthos and GKE
GKE Multi-Tenancy

Clusters

At least one control plane and multiple worker nodes
- Zonal—one control plane node
- Regional—multiple control plane nodes
Can’t switch from regional to zonal (or vice versa) once cluster is created
Private clusters—VPC native, internal IPs only
For HA use multi-zonal node pools

Networking

Routing options:
- Routes-based
  - Custom static routes
- VPC-native
  - Alias IP ranges
  - Scale better
  - Needed for private clusters
  - Pods routable within cluster’s VPC, and other networks connected via VPC peering
    - Also generally routable from on-prem via VPN/Interconnect
  - Pod IP addresses reserved before Pods created in the cluster—prevents conflicts
  - Don’t consume custom routes quota
  - Can apply firewall rules to Pod IPs

GKE Dataplane V2

Optimized for Kubernetes networking
Consistent user experience—GKE and GKE Enterprise
Real-time visibility of network activity
Simpler architecture
Based on eBPF and Cilium—process network packets in-kernel

Advantages

Security—network policy always on, no need for add-ons e.g. Calico
Scalability—implemented without kube-proxy, doesn’t rely on iptables (eBPF maps instead)
Operations—network policy logging built in, audit pod communications

Architecture

eBPF
Kernel programs route/process packets—no need for kernel code change or kernel modules
No need for iptables—can process packets based on Kubernetes-specific metadata

Workload Identity

Links Kubernetes Service Accounts with Google Cloud Service Accounts—allows access to other Google Cloud resources from within the cluster

Operations

Integrated with Cloud Monitoring and Cloud Logging
Metrics:
- System metrics—low-level, e.g. CPU, memory
- Workload metrics—exposed by the workloads in the cluster

Workload Rightsizing

Allows applications running in Google Cloud to be optimally sized for CPU and memory
- Gives recommendations for resource request and limits
Use to reduce cost, and assess optimization opportunities
Combine with Autopilot—priced on Pod resource requests
- Changes directly reflected in bill
Suggestions based on observed usage patterns
Recommendations presented in Cloud Console—or option to generate Kubernetes YAML manifest
Metrics exposed in Cloud Monitoring—used to create custom reports, dashboards, alerts etc.
- Recommended per replica request cores—CPU
- Recommended per replica request bytes—memory
Memory recommendations not supported for JVM-based workloads
- Not possible due to JVM heap management

Autopilot

Price cutoff—if average node utilisation >~53%
- GKE standard cheaper

References

Google Cloud Decision Trees

Graph View

Backlinks

Cloud EKM
Cloud IAM
Cloud Profiler
Cloud Trace
EHR Healthcare Case Study
GKE Cluster Autoscaling
GKE Enterprise
GKE Multi-Tenancy
GKE Security Best Practices
Google Cloud Application Modernization
Google Cloud Compute Autoscaling
Google Cloud Data Lifecycle
Google Distributed Cloud Edge
Identity Aware Proxy
Migrate for Anthos and GKE
Mountkirk Games Case Study
Multi Cluster Ingress
Optimizing Kubernetes for Reliability and Cost-Efficiency
TerramEarth Case Study
Web Security Scanner
What's Next for Operations and Platform Builders - October 2023
Google Cloud Certified Professional Cloud Developer Certification
Google Cloud

Copyright © Matt Dunn 2024

GitHub
LinkedIn
Quartz v4.2.3