Ways to display your Kubernetes / OpenShift network flows
Red Hat OpenShift Container Platform (OCP) has had monitoring capabilities from the start and Network Observability as Developer Preview since OCP 4.10 release.
Network Observability brings user interfaces in OpenShift web console administration to filter and visualize your cluster network flows as:
-
Graph (top 5 flow rates stacked with total example)
-
Table (showing sources and destinations sorted by time)
-
Topology (using directed graph layout)
If you are interested in Network Observability for your cluster, check official documentation for installation.
Network representations
Topology view is a great way to understand how the different components of your cluster interact internally and identify which ones are communicating with the external world. It can also help you define your network policies or highlight potential threats.
To avoid huge and unreadable network graphs, we introduced in Network Observability multiple options to focus on the content you are looking for.
-
Time range Showing by default the last 5 minutes, it allows you to focus on a particular time window
-
Filtering To select a particular set of records based on Source and / or Destination criterias such as
IPs
,Ports
,Protocol
,Kind
,Name
etc -
Level of details (scope display option) Allowing you to select from
Node
/Namespace
/Owner
/Resource
aggregating metrics into the specified layer -
Groups To arrange components by their ownership
Example of usage
What happens in your network when you deploy a httpd sample application ?
You can see that the final deployed pod in blue involved another pod called httpd-sample-1-build
that pulled the image from openshift-image-registry
(1), did some DNS resolution for provided image URL (2), pulled it from the resolved external IP (3) before creating our pod through kubernetes services (4).
Finally, some flows are showing between our pod and openshift-ingress
after opening the hosted page (5).
After moving my time window or waiting a bit, the httpd-sample-1-build
pod disappears as it's status is now Completed
.
Complex representations
Sometimes we need to see more than a single application to troubleshoot network issues or to highlight bandwidth usage at cluster level.
2D network topology may start to show its limits as you need to switch between multiple options and never get an overview into a single render.
We can do the following observation from this:
- it is almost impossible to have a good overview of your entire network
- drawing each connection on a flat representation is hard to read
- it is difficult to understand ownership and relations between objects without a concrete representation
Another dimension
Here is where 3D Topology makes the scene !
3D representations are pretty old and you can find a lot of them online such as networkmaps, vagrant-mesh-net or even more generic ones such as splunk-3D-graph-network-topology-viz. Each of these has its own rendering philosophy to resolve specific use cases.
This is a way to render your network using a representation everybody knows: "buildings".
To enable this feature in Network Observability, you will need to use Network Observability v1.0 and add "&3d=preview" in the URL.
Then go to Topology tab -> Show advanced options -> Display options
From there set Scope option as "Resource" and Layout as "3D".
- Building drawn by dotted lines represent your
Cluster
Every internal communication is inside these lines Each floor is represented by a line - Bricks surrounding the building in a circle shape account for
external ips
- Ground floor with purple bricks represents your
Nodes
You can have multiple nodes all located at ground level - Upper floors in green shows your
Namespaces
Namespaces are aligned between nodes so each floor represent a single namespace If a namespace is not carried by a node, you will see an empty space - Rooms stand for owners such as
Deployments
in dark blue orStatefulSets
in light blue You can have multiple owners per namespaces They will repeat on each node + namespace combination if the load is distributed - Finally, rooms contents depict
Pods
These are represented in the proper node + namespace + owner combination according to its kindred
This representation emphasizes the entire ownership chain from Node
to Pod
passing by Namespace
.
It also pin up how your load is balanced between nodes.
The important part of your network traffic is highlighted since lines are less likely to cross than in the 2D view and their size and color differ according to bytes rate. A thin black line will represent a smaller amount than a heavy red line.
Conclusion
We need to identify the various use cases to elaborate proper representations. This is a daily step by step work between internal teams on both engineering and UI / UX and customers. We are likely to implement more views in the future to highlight network issues, health, threats and usage. One of the next implementations is going to be sankey chart that will be useful for connection tracking.
We need you !
Feel free to contribute by commenting this post, opening issues in netobserv console plugin or opening pull request in any netobserv component
Tell us more about your expectations, the way you currently solve issues and what could help your daily experience.