How does Consul Service Mesh Work on Kubernetes?
Consul service mesh automates service-to-service authorization and encryption across your Consul services. You can use service mesh in Kubernetes-orchestrated networks to secure communication between pods as well as communication between pods and external Kubernetes services.
Workflow
Consul service mesh is enabled by default when you install Consul on Kubernetes using the Consul Helm chart. Consul also automatically injects sidecars into the pods in your clusters that run Envoy. These sidecar proxies, called Consul dataplanes, are enabled when connectInject.default
is set to false
in the Helm chart. Refer to the following documentation for additional information about these concepts:
- Installation and Configuration in this topic
- Consul Helm chart reference
- Simplified Service Mesh with Consul Dataplane
If connectInject.default
is set to false
or you want to explicitly enable service mesh sidecar proxy injection for a specific deployment, add the consul.hashicorp.com/connect-inject
annotation to the pod specification template and set it to true
when connecting services to the mesh.
Service names
When the service is onboarded, the name registered in Consul is set to the name of the Kubernetes Service associated with the Pod. You can use the consul.hashicorp.com/connect-service
annotation to specify a custom name for the service, but if ACLs are enabled then the name of the service registered in Consul must match the Pod's ServiceAccount
name.
Transparent proxy mode
By default, the Consul service mesh runs in transparent proxy mode. This mode forces inbound and outbound traffic through the sidecar proxy even though the service binds to all interfaces. Transparent proxy infers the location of upstream services using Consul service intentions, and also allows you to use Kubernetes DNS as you normally would for your workloads.
When transparent proxy mode is enabled, all service-to-service traffic is required to use mTLS. When onboarding new services to service mesh, your network may have mixed mTLS and non-mTLS traffic, which can result in broken service-to-service communication. You can temporarily enable permissive mTLS mode during the onboarding process so that existing mesh services can accept traffic from services that are not yet fully onboarded. Permissive mTLS enables sidecar proxies to access both mTLS and non-mTLS traffic. Refer to Onboard mesh services in transparent proxy mode for additional information.
Kubernetes service mesh workload scenarios
Note: A Kubernetes Service is required in order to register services on the Consul service mesh. Consul monitors the lifecycle of the Kubernetes Service and its service instances using the service object. In addition, the Kubernetes service is used to register and de-register the service from Consul's catalog.
The following configurations are examples for registering workloads on Kubernetes into Consul's service mesh in different scenarios. Each scenario provides an example Kubernetes manifest to demonstrate how to use Consul's service mesh with a specific Kubernetes workload type.
- Kubernetes Pods running as a deployment
- Connecting to mesh-enabled Services
- Kubernetes Jobs
- Kubernetes Pods with multiple ports
Kubernetes Pods running as a deployment
The following example shows a Kubernetes configuration that specifically enables service mesh connections for the static-server
service. Consul starts and registers a sidecar proxy that listens on port 20000 by default and proxies valid inbound connections to port 8080.
To establish a connection to the upstream Pod using service mesh, a client must dial the upstream workload using a mesh proxy. The client mesh proxy will use Consul service discovery to find all available upstream proxies and their public ports.
Connecting to mesh-enabled Services
The example Deployment specification below configures a Deployment that is capable of establishing connections to our previous example "static-server" service. The connection to this static text service happens over an authorized and encrypted connection via service mesh.
By default when ACLs are enabled or when ACLs default policy is allow
,
Consul will automatically configure proxies with all upstreams from the same datacenter.
When ACLs are enabled with default deny
policy,
you must supply an intention to tell Consul which upstream you need to talk to.
When upstreams are specified explicitly with the
consul.hashicorp.com/connect-service-upstreams
annotation,
the injector will also set environment variables <NAME>_CONNECT_SERVICE_HOST
and <NAME>_CONNECT_SERVICE_PORT
in every container in the Pod for every defined
upstream. This is analogous to the standard Kubernetes service environment variables, but
point instead to the correct local proxy port to establish connections via
service mesh.
You cannot reference auto-generated environment variables when the upstream annotation contains a dot. This is because Consul also renders the environment variables to include a dot. For example, Consul renders the variables generated for static-server.svc:8080
as STATIC-SERVER.SVC_CONNECT_SERVICE_HOST
and STATIC_SERVER.SVC_CONNECT_SERVICE_PORT
, which makes the variables unusable.
You can verify access to the static text server using kubectl exec
.
Because transparent proxy is enabled by default,
use Kubernetes DNS to connect to your desired upstream.
You can control access to the server using intentions. If you use the Consul UI or CLI to deny communication between "static-client" and "static-server", connections are immediately rejected without updating either of the running pods. You can then remove this intention to allow connections again.
Kubernetes Jobs
Kubernetes Jobs run pods that only make outbound requests to services on the mesh and successfully terminate when they are complete. In order to register a Kubernetes Job with the mesh, you must provide an integer value for the consul.hashicorp.com/sidecar-proxy-lifecycle-shutdown-grace-period-seconds
annotation. Then, issue a request to the http://127.0.0.1:20600/graceful_shutdown
API endpoint so that Kubernetes gracefully shuts down the consul-dataplane
sidecar after the job is complete.
Below is an example Kubernetes manifest that deploys a job correctly.
Upon completing the job you should be able to verify that all containers are shut down within the pod.
In addition, based on the logs emitted by the pod you can verify that the proxy was shut down before the Job completed.
Kubernetes Pods with multiple ports
To configure a pod with multiple ports to be a part of the service mesh and receive and send service mesh traffic, you will need to add configuration so that a Consul service can be registered per port. This is because services in Consul currently support a single port per service instance.
In the following example, suppose we have a pod which exposes 2 ports, 8080
and 9090
, both of which will need to
receive service mesh traffic.
First, decide on the names for the two Consul services that will correspond to those ports. In this example, the user
chooses the names web
for 8080
and web-admin
for 9090
.
Create two service accounts for web
and web-admin
:
Create two Service objects for web
and web-admin
:
web
will target containerPort
8080
and select pods labeled app: web
. web-admin
will target containerPort
9090
and will also select the same pods.
Kubernetes 1.24+ only In Kubernetes 1.24+ you need to create a Kubernetes secret for each additional Consul service associated with the pod in order to expose the Kubernetes ServiceAccount token to the Consul dataplane container running under the pod serviceAccount. The Kubernetes secret name must match the ServiceAccount name:
Create a Deployment with any chosen name, and use the following annotations:
Note that the order the ports are listed in the same order as the service names, i.e. the first service name web
corresponds to the first port, 8080
, and the second service name web-admin
corresponds to the second port, 9090
.
The service account on the pod spec for the deployment should be set to the first service name web
:
The following deployment example demonstrates the required annotations for the manifest. In addition, the previous YAML manifests can also be combined into a single manifest for easier deployment.
After deploying the web
application, you can test service mesh connections by deploying the static-client
application with the configuration in the previous section and add the
consul.hashicorp.com/connect-service-upstreams: 'web:1234,web-admin:2234'
annotation to the pod template on static-client
:
1 2 3 4 5 6 7 8 9 101112131415161718192021222324252627282930313233343536373839404142
If you exec on to a static-client pod, using a command like:
you can then run:
to see the output hello world
and run:
to see the output hello world from 9090
.
The way this works is that a Consul service instance is being registered per port on the Pod, so there are 2 Consul
services in this case. An additional Envoy sidecar proxy and connect-init
init container are also deployed per port in
the Pod. So the upstream configuration can use the individual service names to reach each port as seen in the example.
Caveats for Multi-port Pods
- Transparent proxy is not supported for multi-port Pods.
- Metrics and metrics merging is not supported for multi-port Pods.
- Upstreams will only be set on the first service's Envoy sidecar proxy for the pod.
- This means that ServiceIntentions from a multi-port pod to elsewhere, will need to use the first service's name,
web
in the example above to accept connections from eitherweb
orweb-admin
. ServiceIntentions from elsewhere to a multi-port pod can use the individual service names within the multi-port Pod.
- This means that ServiceIntentions from a multi-port pod to elsewhere, will need to use the first service's name,
- Health checking is done on a per-Pod basis, so if any Kubernetes health checks (like readiness, liveness, etc) are
failing for any container on the Pod, the entire Pod is marked unhealthy, and any Consul service referencing that Pod
will also be marked as unhealthy. So, if
web
has a failing health check,web-admin
would also be marked as unhealthy for service mesh traffic.
Installation and Configuration
The service mesh sidecar proxy is injected via a mutating admission webhook call the connect injector provided by the consul-k8s project. This enables the automatic pod mutation shown in the usage section above. Installation of the mutating admission webhook is automated using the Helm chart.
To install the connect injector, enable the connect injection feature using
Helm values and
upgrade the installation using helm upgrade
for existing installs or
helm install
for a fresh install.
This will configure the injector to inject when the
injection annotation
is set to true
. Other values in the Helm chart can be used to limit the namespaces
the injector runs in, enable injection by default, and more.
Verifying the Installation
To verify the installation, run the
"Accepting Inbound Connections"
example from the "Usage" section above. After running this example, run
kubectl get pod static-server --output yaml
. In the raw YAML output, you should
see connect injected containers and an annotation
consul.hashicorp.com/connect-inject-status
set to injected
. This
confirms that injection is working properly.
If you do not see this, then use kubectl logs
against the injector pod
and note any errors.
Controlling Injection Via Annotation
By default, the injector will inject only when the
injection annotation
on the pod (not the deployment) is set to true
:
Injection Defaults
If you wish for the injector to always inject, you can set the default to true
in the Helm chart:
You can then exclude specific pods via annotation:
Controlling Injection Via Namespace
You can control which Kubernetes namespaces are allowed to be injected via
the k8sAllowNamespaces
and k8sDenyNamespaces
keys:
In the default configuration (shown above), services from all namespaces are allowed
to be injected. Whether or not they're injected depends on the value of connectInject.default
and the consul.hashicorp.com/connect-inject
annotation.
If you wish to only enable injection in specific namespaces, you can list only those
namespaces in the k8sAllowNamespaces
key. In the configuration below
only the my-ns-1
and my-ns-2
namespaces will be enabled for injection.
All other namespaces will be ignored, even if the connect inject annotation
is set.
If you wish to enable injection in every namespace except specific namespaces, you can
use *
in the allow list to allow all namespaces and then specify the namespaces to exclude in the deny list:
NOTE: The deny list takes precedence over the allow list. If a namespace is listed in both lists, it will not be synced.
NOTE: The kube-system
and kube-public
namespaces will never be injected.
Consul Enterprise Namespaces
Consul Enterprise 1.7+ supports Consul namespaces. When Kubernetes pods are registered into Consul, you can control which Consul namespace they are registered into.
There are three options available:
Single Destination Namespace – Register all Kubernetes pods, regardless of namespace, into the same Consul namespace.
This can be configured with:
NOTE: If the destination namespace does not exist we will create it.
Mirror Namespaces - Register each Kubernetes pod into a Consul namespace with the same name as its Kubernetes namespace. For example, pod
foo
in Kubernetes namespacens-1
will be synced to the Consul namespacens-1
. If a mirrored namespace does not exist in Consul, it will be created.This can be configured with:
Mirror Namespaces With Prefix - Register each Kubernetes pod into a Consul namespace with the same name as its Kubernetes namespace with a prefix. For example, given a prefix
k8s-
, podfoo
in Kubernetes namespacens-1
will be synced to the Consul namespacek8s-ns-1
.This can be configured with:
Consul Enterprise Namespace Upstreams
When transparent proxy is enabled and ACLs are disabled, the upstreams will be configured automatically across Consul namespaces. When ACLs are enabled, you must configure it by specifying an intention, allowing services across Consul namespaces to talk to each other.
If you wish to specify an upstream explicitly via the consul.hashicorp.com/connect-service-upstreams
annotation,
use the format [service-name].[namespace]:[port]:[optional datacenter]
:
See consul.hashicorp.com/connect-service-upstreams for more details.
Note: When you specify upstreams via an upstreams annotation, you will need to use
localhost:<port>
with the port from the upstreams annotation instead of KubeDNS to connect to your upstream
application.