[ASCII-2626] Add FIPS linux cipher compliance tests for agent flavor #32366

jeremy-hanna · 2024-12-18T20:46:51Z

What does this PR do?

For FIPS compliance we need to confirm that the FIPS flavor of the agent we provide uses approved ciphers.

This PR adds the e2e tests proposed in #29560 with the correct CI dependency setup for the current build

Prerequisite PRs:

fips flavor builds: Add foundations for FIPS flavor #31004
install script support: [ASCII-2593] Add datadog-fips-agent to the linux install script hash as viable flavor agent-linux-install-script#306
e2e flavor install support: Add flavor to Agent PackageVersion test-infra-definitions#1286

Motivation

Describe how you validated your changes

only add tests

Possible Drawbacks / Trade-offs

Additional Notes

… job dependencies

agent-platform-auto-pr · 2024-12-18T20:51:46Z

Gitlab CI Configuration Changes

Added Jobs

new-e2e-fips-compliance-test

new-e2e-fips-compliance-test:
  after_script:
  - $CI_PROJECT_DIR/tools/ci/junit_upload.sh
  artifacts:
    expire_in: 2 weeks
    paths:
    - $E2E_OUTPUT_DIR
    - junit-*.tgz
    reports:
      annotations:
      - $EXTERNAL_LINKS_PATH
    when: always
  before_script:
  - mkdir -p $GOPATH/pkg/mod/cache && tar xJf modcache_e2e.tar.xz -C $GOPATH/pkg/mod/cache
    || exit 101
  - rm -f modcache_e2e.tar.xz
  - mkdir -p ~/.aws
  - $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $AGENT_QA_E2E profile >> ~/.aws/config
    || exit $?
  - export AWS_PROFILE=agent-qa-ci
  - $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $AGENT_QA_E2E ssh_public_key_rsa > $E2E_AWS_PUBLIC_KEY_PATH
    || exit $?
  - touch $E2E_AWS_PRIVATE_KEY_PATH && chmod 600 $E2E_AWS_PRIVATE_KEY_PATH && $CI_PROJECT_DIR/tools/ci/fetch_secret.sh
    $AGENT_QA_E2E ssh_key_rsa > $E2E_AWS_PRIVATE_KEY_PATH || exit $?
  - $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $AGENT_QA_E2E ssh_public_key_rsa > $E2E_AZURE_PUBLIC_KEY_PATH
    || exit $?
  - touch $E2E_AZURE_PRIVATE_KEY_PATH && chmod 600 $E2E_AZURE_PRIVATE_KEY_PATH &&
    $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $AGENT_QA_E2E ssh_key_rsa > $E2E_AZURE_PRIVATE_KEY_PATH
    || exit $?
  - $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $AGENT_QA_E2E ssh_public_key_rsa > $E2E_GCP_PUBLIC_KEY_PATH
    || exit $?
  - touch $E2E_GCP_PRIVATE_KEY_PATH && chmod 600 $E2E_GCP_PRIVATE_KEY_PATH && $CI_PROJECT_DIR/tools/ci/fetch_secret.sh
    $AGENT_QA_E2E ssh_key_rsa > $E2E_GCP_PRIVATE_KEY_PATH || exit $?
  - pulumi login "s3://dd-pulumi-state?region=us-east-1&awssdk=v2&profile=$AWS_PROFILE"
  - ARM_CLIENT_ID=$($CI_PROJECT_DIR/tools/ci/fetch_secret.sh $E2E_AZURE client_id)
    || exit $?; export ARM_CLIENT_ID
  - ARM_CLIENT_SECRET=$($CI_PROJECT_DIR/tools/ci/fetch_secret.sh $E2E_AZURE token)
    || exit $?; export ARM_CLIENT_SECRET
  - ARM_TENANT_ID=$($CI_PROJECT_DIR/tools/ci/fetch_secret.sh $E2E_AZURE tenant_id)
    || exit $?; export ARM_TENANT_ID
  - ARM_SUBSCRIPTION_ID=$($CI_PROJECT_DIR/tools/ci/fetch_secret.sh $E2E_AZURE subscription_id)
    || exit $?; export ARM_SUBSCRIPTION_ID
  - $CI_PROJECT_DIR/tools/ci/fetch_secret.sh $E2E_GCP credentials_json > ~/gcp-credentials.json
    || exit $?
  - export GOOGLE_APPLICATION_CREDENTIALS=~/gcp-credentials.json
  - inv -e gitlab.generate-ci-visibility-links --output=$EXTERNAL_LINKS_PATH
  image: registry.ddbuild.io/ci/test-infra-definitions/runner$TEST_INFRA_DEFINITIONS_BUILDIMAGES_SUFFIX:$TEST_INFRA_DEFINITIONS_BUILDIMAGES
  needs:
  - go_e2e_deps
  - qa_agent_fips
  - deploy_deb_testing-a7_x64
  rules:
  - if: $RUN_E2E_TESTS == "off"
    when: never
  - if: $CI_COMMIT_BRANCH =~ /^mq-working-branch-/
    when: never
  - if: $RUN_E2E_TESTS == "on"
    when: on_success
  - if: $CI_COMMIT_BRANCH == "main"
    when: on_success
  - if: $CI_COMMIT_BRANCH =~ /^[0-9]+\.[0-9]+\.x$/
    when: on_success
  - if: $CI_COMMIT_TAG =~ /^[0-9]+\.[0-9]+\.[0-9]+-rc\.[0-9]+$/
    when: on_success
  - changes:
      compare_to: main
      paths:
      - .gitlab/e2e/e2e.yml
      - test/new-e2e/pkg/**/*
      - test/new-e2e/go.mod
      - flakes.yaml
  - changes:
      compare_to: main
      paths:
      - cmd/**/*
      - pkg/**/*
      - comp/**/*
      - test/new-e2e/tests/agent-shared-components/**/*
  - if: $CI_COMMIT_BRANCH =~ /^mq-working-branch-/
    when: never
  - allow_failure: true
    when: manual
  script:
  - inv -e new-e2e-tests.run --targets $TARGETS -c ddagent:imagePullRegistry=669783387624.dkr.ecr.us-east-1.amazonaws.com
    -c ddagent:imagePullUsername=AWS -c ddagent:imagePullPassword=$(aws ecr get-login-password)
    --junit-tar junit-${CI_JOB_ID}.tgz ${EXTRA_PARAMS} --test-washer --logs-folder=$E2E_OUTPUT_DIR/logs
    --logs-post-processing --logs-post-processing-test-depth=$E2E_LOGS_PROCESSING_TEST_DEPTH
  stage: e2e
  tags:
  - arch:amd64
  variables:
    E2E_AWS_PRIVATE_KEY_PATH: /tmp/agent-qa-aws-ssh-key
    E2E_AWS_PUBLIC_KEY_PATH: /tmp/agent-qa-aws-ssh-key.pub
    E2E_AZURE_PRIVATE_KEY_PATH: /tmp/agent-qa-azure-ssh-key
    E2E_AZURE_PUBLIC_KEY_PATH: /tmp/agent-qa-azure-ssh-key.pub
    E2E_COMMIT_SHA: $CI_COMMIT_SHORT_SHA
    E2E_GCP_PRIVATE_KEY_PATH: /tmp/agent-qa-gcp-ssh-key
    E2E_GCP_PUBLIC_KEY_PATH: /tmp/agent-qa-gcp-ssh-key.pub
    E2E_KEY_PAIR_NAME: datadog-agent-ci-rsa
    E2E_LOGS_PROCESSING_TEST_DEPTH: 1
    E2E_OUTPUT_DIR: $CI_PROJECT_DIR/e2e-output
    E2E_PIPELINE_ID: $CI_PIPELINE_ID
    EXTERNAL_LINKS_PATH: external_links_$CI_JOB_ID.json
    KUBERNETES_CPU_REQUEST: 6
    KUBERNETES_MEMORY_LIMIT: 16Gi
    KUBERNETES_MEMORY_REQUEST: 12Gi
    SHOULD_RUN_IN_FLAKES_FINDER: 'true'
    TARGETS: ./tests/fips-compliance
    TEAM: agent-shared-components

qa_agent_fips

qa_agent_fips:
  image: registry.ddbuild.io/ci/datadog-agent-buildimages/docker_x64$DATADOG_AGENT_BUILDIMAGES_SUFFIX:$DATADOG_AGENT_BUILDIMAGES
  needs:
  - docker_build_fips_agent7
  - docker_build_fips_agent7_arm64
  - docker_build_fips_agent7_windows2022_core
  rules:
  - if: $CI_COMMIT_BRANCH =~ /^mq-working-branch-/
    when: never
  - if: $RUN_E2E_TESTS == "off"
    when: never
  - when: on_success
  script:
  - GITLAB_TOKEN=$($CI_PROJECT_DIR/tools/ci/fetch_secret.sh $GITLAB_TOKEN write_api)
    || exit $?; export GITLAB_TOKEN
  - "if [[ \"$BUCKET_BRANCH\" == \"nightly\" && ( \"$IMG_SOURCES\" =~ \"$SRC_AGENT\"\
    \ || \"$IMG_SOURCES\" =~ \"$SRC_DCA\" || \"$IMG_SOURCES\" =~ \"$SRC_CWS_INSTRUMENTATION\"\
    \ || \"$IMG_VARIABLES\" =~ \"$SRC_AGENT\" || \"$IMG_VARIABLES\" =~ \"$SRC_DCA\"\
    \ || \"$IMG_VARIABLES\" =~ \"$SRC_CWS_INSTRUMENTATION\" ) ]]; then\n  export ECR_RELEASE_SUFFIX=\"\
    -nightly\"\nelse\n  export ECR_RELEASE_SUFFIX=\"${CI_COMMIT_TAG+-release}\"\n\
    fi\n"
  - IMG_VARIABLES="$(sed -E "s#(${SRC_AGENT}|${SRC_DSD}|${SRC_DCA}|${SRC_CWS_INSTRUMENTATION})#\1${ECR_RELEASE_SUFFIX}#g"
    <<<"$IMG_VARIABLES")"
  - IMG_SOURCES="$(sed -E "s#(${SRC_AGENT}|${SRC_DSD}|${SRC_DCA}|${SRC_CWS_INSTRUMENTATION})#\1${ECR_RELEASE_SUFFIX}#g"
    <<<"$IMG_SOURCES")"
  - inv pipeline.trigger-child-pipeline --project-name DataDog/public-images --git-ref
    main --timeout 1800 --variable IMG_VARIABLES --variable IMG_REGISTRIES --variable
    IMG_SOURCES --variable IMG_DESTINATIONS --variable IMG_SIGNING --variable APPS
    --variable BAZEL_TARGET --variable DDR --variable DDR_WORKFLOW_ID --variable TARGET_ENV
    --variable DYNAMIC_BUILD_RENDER_TARGET_FORWARD_PARAMETERS
  stage: dev_container_deploy
  tags:
  - arch:amd64
  variables:
    IMG_DESTINATIONS: agent:${CI_PIPELINE_ID}-${CI_COMMIT_SHORT_SHA}-fips
    IMG_REGISTRIES: agent-qa
    IMG_SIGNING: ''
    IMG_SOURCES: ${SRC_AGENT}:v${CI_PIPELINE_ID}-${CI_COMMIT_SHORT_SHA}-7-fips-amd64,${SRC_AGENT}:v${CI_PIPELINE_ID}-${CI_COMMIT_SHORT_SHA}-7-fips-arm64,${SRC_AGENT}:v${CI_PIPELINE_ID}-${CI_COMMIT_SHORT_SHA}-7-fips-winltsc2022-servercore-amd64
    IMG_VARIABLES: ''
    SRC_AGENT: registry.ddbuild.io/ci/datadog-agent/agent
    SRC_CWS_INSTRUMENTATION: registry.ddbuild.io/ci/datadog-agent/cws-instrumentation
    SRC_DCA: registry.ddbuild.io/ci/datadog-agent/cluster-agent
    SRC_DSD: registry.ddbuild.io/ci/datadog-agent/dogstatsd

Changes Summary

Removed	Modified	Added	Renamed
0	0	2	0

ℹ️ Diff available in the job log.

agent-platform-auto-pr · 2024-12-18T21:10:48Z

[Fast Unit Tests Report]

On pipeline 52591054 (CI Visibility). The following jobs did not run any unit tests:

Jobs:

tests_deb-arm64-py3
tests_deb-x64-py3
tests_flavor_dogstatsd_deb-x64
tests_flavor_heroku_deb-x64
tests_flavor_iot_deb-x64
tests_rpm-arm64-py3
tests_rpm-x64-py3
tests_windows-x64

If you modified Go files and expected unit tests to run in these jobs, please double check the job logs. If you think tests should have been executed reach out to #agent-devx-help

agent-platform-auto-pr · 2024-12-18T21:15:03Z

Uncompressed package size comparison

Comparison with ancestor 3f997d247e11871e8395be4aa8cae1250c6f7c14

Diff per package

package	diff	status	size	ancestor	threshold
datadog-agent-arm64-deb	0.00MB	✅	941.95MB	941.94MB	0.50MB
datadog-agent-x86_64-rpm	0.00MB	✅	1022.08MB	1022.08MB	0.50MB
datadog-agent-x86_64-suse	0.00MB	✅	1022.08MB	1022.08MB	0.50MB
datadog-agent-aarch64-rpm	0.00MB	✅	951.24MB	951.24MB	0.50MB
datadog-agent-amd64-deb	0.00MB	✅	1012.77MB	1012.77MB	0.50MB
datadog-dogstatsd-amd64-deb	0.00MB	✅	58.82MB	58.82MB	0.50MB
datadog-dogstatsd-x86_64-rpm	0.00MB	✅	58.90MB	58.90MB	0.50MB
datadog-dogstatsd-x86_64-suse	0.00MB	✅	58.90MB	58.90MB	0.50MB
datadog-dogstatsd-arm64-deb	0.00MB	✅	56.33MB	56.33MB	0.50MB
datadog-heroku-agent-amd64-deb	0.00MB	✅	506.56MB	506.56MB	0.50MB
datadog-iot-agent-amd64-deb	0.00MB	✅	114.01MB	114.01MB	0.50MB
datadog-iot-agent-x86_64-rpm	0.00MB	✅	114.08MB	114.08MB	0.50MB
datadog-iot-agent-x86_64-suse	0.00MB	✅	114.08MB	114.08MB	0.50MB
datadog-iot-agent-arm64-deb	0.00MB	✅	109.43MB	109.43MB	0.50MB
datadog-iot-agent-aarch64-rpm	0.00MB	✅	109.50MB	109.50MB	0.50MB

Decision

✅ Passed

agent-platform-auto-pr · 2024-12-18T21:17:40Z

Test changes on VM

Use this command from test-infra-definitions to manually test this PR changes on a VM:

inv aws.create-vm --pipeline-id=52591054 --os-family=ubuntu

Note: This applies to commit 61b96d1

cit-pr-commenter · 2024-12-18T21:40:50Z

Regression Detector

Regression Detector Results

Metrics dashboard
Target profiles
Run ID: da145734-322e-493d-aa8d-9feb6e1a68c6

Baseline: 832149a
Comparison: 342f769
Diff

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

perf	experiment	goal	Δ mean %	Δ mean % CI	trials	links
➖	quality_gate_logs	% cpu utilization	+2.43	[-0.86, +5.73]	1	Logs
➖	tcp_syslog_to_blackhole	ingress throughput	+0.74	[+0.67, +0.81]	1	Logs
➖	uds_dogstatsd_to_api_cpu	% cpu utilization	+0.62	[-0.07, +1.30]	1	Logs
➖	file_to_blackhole_1000ms_latency	egress throughput	+0.23	[-0.54, +1.00]	1	Logs
➖	file_to_blackhole_1000ms_latency_linear_load	egress throughput	+0.21	[-0.26, +0.69]	1	Logs
➖	file_to_blackhole_0ms_latency	egress throughput	+0.12	[-0.72, +0.96]	1	Logs
➖	file_to_blackhole_100ms_latency	egress throughput	+0.07	[-0.58, +0.73]	1	Logs
➖	file_to_blackhole_500ms_latency	egress throughput	+0.06	[-0.72, +0.84]	1	Logs
➖	file_to_blackhole_300ms_latency	egress throughput	+0.01	[-0.63, +0.65]	1	Logs
➖	tcp_dd_logs_filter_exclude	ingress throughput	+0.00	[-0.01, +0.01]	1	Logs
➖	uds_dogstatsd_to_api	ingress throughput	-0.00	[-0.13, +0.12]	1	Logs
➖	file_tree	memory utilization	-0.01	[-0.15, +0.13]	1	Logs
➖	file_to_blackhole_0ms_latency_http1	egress throughput	-0.04	[-0.97, +0.89]	1	Logs
➖	file_to_blackhole_0ms_latency_http2	egress throughput	-0.07	[-0.97, +0.83]	1	Logs
➖	quality_gate_idle_all_features	memory utilization	-0.15	[-0.24, -0.05]	1	Logs bounds checks dashboard
➖	quality_gate_idle	memory utilization	-0.22	[-0.26, -0.18]	1	Logs bounds checks dashboard

Bounds Checks: ✅ Passed

perf	experiment	bounds_check_name	replicates_passed	links
✅	file_to_blackhole_0ms_latency	lost_bytes	10/10
✅	file_to_blackhole_0ms_latency	memory_usage	10/10
✅	file_to_blackhole_0ms_latency_http1	lost_bytes	10/10
✅	file_to_blackhole_0ms_latency_http1	memory_usage	10/10
✅	file_to_blackhole_0ms_latency_http2	lost_bytes	10/10
✅	file_to_blackhole_0ms_latency_http2	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency_linear_load	memory_usage	10/10
✅	file_to_blackhole_100ms_latency	lost_bytes	10/10
✅	file_to_blackhole_100ms_latency	memory_usage	10/10
✅	file_to_blackhole_300ms_latency	lost_bytes	10/10
✅	file_to_blackhole_300ms_latency	memory_usage	10/10
✅	file_to_blackhole_500ms_latency	lost_bytes	10/10
✅	file_to_blackhole_500ms_latency	memory_usage	10/10
✅	quality_gate_idle	memory_usage	10/10	bounds checks dashboard
✅	quality_gate_idle_all_features	memory_usage	10/10	bounds checks dashboard
✅	quality_gate_logs	lost_bytes	10/10
✅	quality_gate_logs	memory_usage	10/10

Explanation

Confidence level: 90.00%
Effect size tolerance: |Δ mean %| ≥ 5.00%

Performance changes are noted in the perf column of each table:

✅ = significantly better comparison variant performance
❌ = significantly worse comparison variant performance
➖ = no significant change in performance

A regression test is an A/B test of target performance in a repeatable rig, where "performance" is measured as "comparison variant minus baseline variant" for an optimization goal (e.g., ingress throughput). Due to intrinsic variability in measuring that goal, we can only estimate its mean value for each experiment; we report uncertainty in that value as a 90.00% confidence interval denoted "Δ mean % CI".

For each experiment, we decide whether a change in performance is a "regression" -- a change worth investigating further -- if all of the following criteria are true:

Its estimated |Δ mean %| ≥ 5.00%, indicating the change is big enough to merit a closer look.
Its 90.00% confidence interval "Δ mean % CI" does not contain zero, indicating that if our statistical model is accurate, there is at least a 90.00% chance there is a difference in performance between baseline and comparison variants.
Its configuration does not mark it "erratic".

CI Pass/Fail Decision

✅ Passed. All Quality Gates passed.

quality_gate_idle_all_features, bounds check memory_usage: 10/10 replicas passed. Gate passed.
quality_gate_logs, bounds check lost_bytes: 10/10 replicas passed. Gate passed.
quality_gate_logs, bounds check memory_usage: 10/10 replicas passed. Gate passed.
quality_gate_idle, bounds check memory_usage: 10/10 replicas passed. Gate passed.

…use fips images

…ver entrypoint

…art/stop commands

…ck up new env and clear logs

…en build

Copy FIPS linux cipher compliance tests from PR 29560 and hook up e2e…

aef35cd

… job dependencies

jeremy-hanna added changelog/no-changelog qa/no-code-change No code change in Agent code requiring validation labels Dec 18, 2024

jeremy-hanna requested review from a team as code owners December 18, 2024 20:46

github-actions bot added the medium review PR review might take time label Dec 18, 2024

Update CODEOWNERS for new test suite

416bcd8

jeremy-hanna added 16 commits December 18, 2024 17:28

Use correctly flavor agentparam in e2e test

f704971

Add missed ci job dependencies

0796c16

Merge branch 'main' into jeremy.hanna/add-fips-compliance-e2e-tests

0adbb01

Update e2e provisioner import to reflect main

c67945a

Fix import naming issue from master and update docker provisioner to …

50122d0

…use fips images

Improve the test case and use envvar for cipher templates in fips-ser…

8b1279b

…ver entrypoint

Fix linter naming issue

7d9f1ff

Fix linter issue with gofmt

49d5a34

use docker-compose command instead of docker compose to stop service

9fd7626

Supply the compose file for the compose commands

e10407d

Green build passing formatted compose files to fips-server service st…

0712ed2

…art/stop commands

Use down and up instead of stop and run for fips-server service to pi…

31a700c

…ck up new env and clear logs

Fix flag ordering with docker-compose up

8bbd4fb

Merge branch 'main' into jeremy.hanna/add-fips-compliance-e2e-tests

08dd804

Change compose tmpl to yaml and remove disabled FIPS tests to get gre…

285ebc8

…en build

Merge branch 'main' into jeremy.hanna/add-fips-compliance-e2e-tests

4b9dc3d

jeremy-hanna added 6 commits January 8, 2025 16:56

Run the diagnose command with GOFIPS=1

7443668

Dont delete tests on failure for troubleshooting

e29f1b6

Fix linter error

f8b150c

Sprintf the exec command

342f769

Merge branch 'main' into jeremy.hanna/add-fips-compliance-e2e-tests

ba89b54

try using the image in the compose yaml

61b96d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ASCII-2626] Add FIPS linux cipher compliance tests for agent flavor #32366

[ASCII-2626] Add FIPS linux cipher compliance tests for agent flavor #32366

jeremy-hanna commented Dec 18, 2024

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

cit-pr-commenter bot commented Dec 18, 2024 •

edited

Loading

Fine details of change detection per experiment

Bounds Checks: ✅ Passed

Explanation

[ASCII-2626] Add FIPS linux cipher compliance tests for agent flavor #32366

Are you sure you want to change the base?

[ASCII-2626] Add FIPS linux cipher compliance tests for agent flavor #32366

Conversation

jeremy-hanna commented Dec 18, 2024

What does this PR do?

Motivation

Describe how you validated your changes

Possible Drawbacks / Trade-offs

Additional Notes

agent-platform-auto-pr bot commented Dec 18, 2024 • edited Loading

Gitlab CI Configuration Changes

Added Jobs

Changes Summary

agent-platform-auto-pr bot commented Dec 18, 2024 • edited Loading

agent-platform-auto-pr bot commented Dec 18, 2024 • edited Loading

Uncompressed package size comparison

Decision

agent-platform-auto-pr bot commented Dec 18, 2024 • edited Loading

Test changes on VM

cit-pr-commenter bot commented Dec 18, 2024 • edited Loading

Regression Detector

Regression Detector Results

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

Bounds Checks: ✅ Passed

Explanation

CI Pass/Fail Decision

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

agent-platform-auto-pr bot commented Dec 18, 2024 •

edited

Loading

cit-pr-commenter bot commented Dec 18, 2024 •

edited

Loading