Fix discrepancy between export and k8s node name #3609

odinuge · 2025-04-10T08:58:18Z

The HUBBLE_NODE_NAME env var is useful to add extra metadata to the "node_name" field for exported events. Often the Kubernetes node name is not very descriptive, and additional information is useful. This can be archived by eg. overriding the following helm values, where k8s will replace the $(NODE_NAME) with the existing NODE_NAME env var in the helm chart, populated from the actual node name.

tetragon.extraEnv=[{"name":"HUBBLE_NODE_NAME", "value":"$(NODE_NAME)-additional-info-here.domain.tld"}]

Prior to bcf9429 ("Watcher: fix NODE_NAME if missing") this worked as expected where the export events keep this value, and the Kubernetes pod watcher used the existing NODE_NAME. After that commit, the node watcher stated using the HUBBLE_NODE_NAME, resulting in it not receiving any pod events and pod attribution not working for exported events.

Fixes: bcf9429 ("Watcher: fix NODE_NAME if missing")

Fixes

Description

Changelog

events: fix  source pod attribution when env var HUBBLE_NODE_NAME is set

odinuge · 2025-04-10T09:05:33Z

When installing using;

$ helm install tetragon cilium/tetragon -n kube-system '--set-json=tetragon.extraEnv=[{"name":"HUBBLE_NODE_NAME", "value":"$(NODE_NAME)-additional-info-here.domain.tld"}]'

Current main:

{
  "process_exec": {
    "process": {
      "exec_id": "a2luZC1jb250cm9sLXBsYW5lLWFkZGl0aW9uYWwtaW5mby1oZXJlLmRvbWFpbi50bGQ6MzEzNDAyODE0ODA3MjE5Ojg2OTIyOA==",
      "pid": 869228,
      "uid": 0,
      "cwd": "/",
      "binary": "/usr/bin/wget",
      "arguments": "tetragon.io",
      "flags": "execve rootcwd",
      "start_time": "2025-04-10T08:29:58.907019715Z",
      "auid": 4294967295,
      "docker": "97efb07e10aefbfacb68ee439eb1099",
      "parent_exec_id": "a2luZC1jb250cm9sLXBsYW5lLWFkZGl0aW9uYWwtaW5mby1oZXJlLmRvbWFpbi50bGQ6MDo4Njg5OTY=",
      "tid": 869228,
      "in_init_tree": false
    }
  },
  "node_name": "kind-control-plane-additional-info-here.domain.tld",
  "time": "2025-04-10T08:29:58.906997952Z"
}

With this patch;

{
  "process_exec": {
    "process": {
      "exec_id": "a2luZC1jb250cm9sLXBsYW5lLWFkZGl0aW9uYWwtaW5mby1oZXJlLmRvbWFpbi50bGQ6MzE1MzI2MDIzMTU0Mzg4Ojg5MjMxOA==",
      "pid": 892318,
      "uid": 0,
      "cwd": "/",
      "binary": "/usr/bin/wget",
      "arguments": "tetragon.io",
      "flags": "execve rootcwd",
      "start_time": "2025-04-10T09:02:02.143911508Z",
      "auid": 4294967295,
      "pod": {
        "namespace": "default",
        "name": "sh",
        "container": {
          "id": "containerd://97efb07e10aefbfacb68ee439eb109998d40639ea2a04dc4184b53b7efffecd1",
          "name": "sh",
          "image": {
            "id": "docker.io/library/alpine@sha256:a8560b36e8b8210634f77d9f7f9efd7ffa463e380b75e2e74aff4511df3ef88c",
            "name": "docker.io/library/alpine:latest"
          },
          "start_time": "2025-04-10T08:27:23Z",
          "pid": 81
        },
        "pod_labels": {
          "run": "sh"
        },
        "workload": "sh",
        "workload_kind": "Pod"
      },
      "docker": "97efb07e10aefbfacb68ee439eb1099",
      "parent_exec_id": "a2luZC1jb250cm9sLXBsYW5lLWFkZGl0aW9uYWwtaW5mby1oZXJlLmRvbWFpbi50bGQ6MDo4Njg5OTY=",
      "tid": 892318,
      "in_init_tree": false
    }
  },
  "node_name": "kind-control-plane-additional-info-here.domain.tld",
  "time": "2025-04-10T09:02:02.143938494Z"
}

Notice the lack of the pod field in the prior, and the fact that the node_name field contains the additional information we provided.

The HUBBLE_NODE_NAME env var is useful to add extra metadata to the "node_name" field for exported events. Often the Kubernetes node name is not very descriptive, and additional information is useful. This can be archived by eg. overriding the following helm values, where k8s will replace the $(NODE_NAME) with the existing NODE_NAME env var in the helm chart, populated from the actual node name. tetragon.extraEnv=[{"name":"HUBBLE_NODE_NAME", "value":"$(NODE_NAME)-additional-info-here.domain.tld"}] Prior to bcf9429 ("Watcher: fix NODE_NAME if missing") this worked as expected where the export events keep this value, and the Kubernetes pod watcher used the existing NODE_NAME. After that commit, the node watcher stated using the HUBBLE_NODE_NAME, resulting in it not receiving any pod events and pod attribution not working for exported events. Fixes: bcf9429 ("Watcher: fix NODE_NAME if missing") Signed-off-by: Odin Ugedal <[email protected]> Signed-off-by: Odin Ugedal <[email protected]>

odinuge · 2025-04-10T09:14:28Z

cc @kevsecurity since you made the initial PR. I'm not 110% sold we need the fallback for k8s node since the helm chart always adds the NODE_NAME env var, and I'm not fully sure about the intention of #2824 - but I'm keeping the fallback for now.

We could also logline on startup with the actual name for easier debugging, since we spent a lot of time debugging why our setup broke.

pkg/reader/node/node.go

kkourt

LGTM, thanks.

kevsecurity · 2025-04-10T09:44:57Z

cc @kevsecurity since you made the initial PR. I'm not 110% sold we need the fallback for k8s node since the helm chart always adds the NODE_NAME env var, and I'm not fully sure about the intention of #2824 - but I'm keeping the fallback for now.

We could also logline on startup with the actual name for easier debugging, since we spent a lot of time debugging why our setup broke.

Happy to revert. If the original situation that prompted my change arises again, I'll document better and take this use case into account.

kkourt · 2025-04-10T09:55:13Z

cc @kevsecurity since you made the initial PR. I'm not 110% sold we need the fallback for k8s node since the helm chart always adds the NODE_NAME env var, and I'm not fully sure about the intention of #2824 - but I'm keeping the fallback for now.

For some context, a reason for this fallback is for setups where the tetragon agent runs in a k8s node but not via a daemonset deployed by helm, but via, for example, a systemd service.

odinuge force-pushed the node-name branch from 07c937c to 2152f4d Compare April 10, 2025 09:09

kkourt added the release-note/bug This PR fixes an issue in a previous release of Tetragon. label Apr 10, 2025

kkourt reviewed Apr 10, 2025

View reviewed changes

pkg/reader/node/node.go Show resolved Hide resolved

kkourt approved these changes Apr 10, 2025

View reviewed changes

odinuge marked this pull request as ready for review April 10, 2025 09:41

odinuge requested a review from a team as a code owner April 10, 2025 09:41

odinuge requested a review from tixxdz April 10, 2025 09:41

kkourt added needs-backport/1.2 This PR needs backporting to 1.2 needs-backport/1.3 This PR needs backporting to 1.3 needs-backport/1.4 labels Apr 10, 2025

kevsecurity approved these changes Apr 10, 2025

View reviewed changes

kkourt merged commit 29e9ebe into cilium:main Apr 10, 2025
43 of 44 checks passed

kkourt mentioned this pull request Apr 10, 2025

1.4 backports #3610

Merged

odinuge deleted the node-name branch April 10, 2025 14:49

kkourt added backport-done/1.4 and removed needs-backport/1.4 labels May 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix discrepancy between export and k8s node name #3609

Fix discrepancy between export and k8s node name #3609

Uh oh!

odinuge commented Apr 10, 2025 •

edited by kkourt

Loading

Uh oh!

odinuge commented Apr 10, 2025

Uh oh!

odinuge commented Apr 10, 2025

Uh oh!

Uh oh!

kkourt left a comment

Uh oh!

kevsecurity commented Apr 10, 2025

Uh oh!

kkourt commented Apr 10, 2025

Uh oh!

Uh oh!

Uh oh!

Fix discrepancy between export and k8s node name #3609

Fix discrepancy between export and k8s node name #3609

Uh oh!

Conversation

odinuge commented Apr 10, 2025 • edited by kkourt Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changelog

Uh oh!

odinuge commented Apr 10, 2025

Uh oh!

odinuge commented Apr 10, 2025

Uh oh!

Uh oh!

kkourt left a comment

Choose a reason for hiding this comment

Uh oh!

kevsecurity commented Apr 10, 2025

Uh oh!

kkourt commented Apr 10, 2025

Uh oh!

Uh oh!

Uh oh!

odinuge commented Apr 10, 2025 •

edited by kkourt

Loading