Resource Health Deployment Guide⚓︎

The Resource Health Building Block (BB) provides a flexible framework for monitoring the health and status of resources within the EOEPCA platform. This includes core platform services as well as derived or user-provided resources such as datasets, workflows, or user applications.

Introduction⚓︎

The Resource Health BB allows you to:

Define and schedule automated health checks (e.g. daily, hourly).
Observe and visualise check outcomes via a web dashboard.
Integrate with external services (e.g. IAM for OIDC authentication, Data Access, Resource Catalogue).
Store results in OpenSearch, optionally visualizing them using OpenSearch Dashboards.
Collect telemetry via OpenTelemetry, enabling advanced monitoring and alerting.

Components Overview⚓︎

Resource Health Web

Dashboard and front-end for viewing health checks and results.
By default, can be secured with OIDC authentication (e.g. via Keycloak).

Resource Health API(s)

Telemetry API for gathering check results and metrics.
Health Checks API (Check Manager) for listing, scheduling, and managing checks.

Health Check Runner

A flexible engine that executes your custom health checks at scheduled intervals.

Mock API (optional sample)

An example test resource used in demonstration checks (e.g. an hourly check to a mock endpoint).

OpenSearch & OpenSearch Dashboards

Stores logs, results, and trace data from your checks.
Provides advanced visualisation and analytics features.

OpenTelemetry Collector

Receives telemetry from health checks and forward them to OpenSearch.

Prerequisites⚓︎

Before deploying the Resource Health Building Block, ensure you have the following:

Component	Requirement	Documentation Link
Kubernetes	Cluster (tested on v1.28)	Installation Guide
Git	Properly installed	Installation Guide
Helm	Version 3.5 or newer	Installation Guide
Helm plugins	`helm-git`: Version 1.3.0 tested	Installation Guide
kubectl	Configured for cluster access	Installation Guide
Ingress Controller	Properly installed (e.g., NGINX)	Installation Guide
Internal TLS Certificates	ClusterIssuer for internal certificates	Internal TLS Setup

Clone the Deployment Guide Repository:

git clone https://github.com/EOEPCA/deployment-guide
cd deployment-guide/scripts/resource-health

Validate your environment:

bash check-prerequisites.sh

This script checks common prerequisites, including your Kubernetes/Helm installation, Git, and any required Helm plugins.

Deployment Steps⚓︎

1. Run the Configuration Script⚓︎

The configure-resource-health.sh script gathers basic configuration inputs (such as your internal ClusterIssuer for TLS, storage class, etc.) and generates a generated-values.yaml that tailors the Resource Health deployment to your environment.

bash configure-resource-health.sh

During execution, you will be prompted for:

INGRESS_HOST: Hostname.
INTERNAL_CLUSTER_ISSUER: Name of the cert-manager ClusterIssuer for internal TLS. (Default: eoepca-ca-clusterissuer)
PERSISTENT_STORAGECLASS: Storage class for persistent volumes. (Default: standard)

2. Create a Keycloak Client⚓︎

Use the create-client.sh script in the /scripts/utils/ directory. This script prompts you for basic details and automatically creates a Keycloak client in your chosen realm:

bash ../utils/create-client.sh

When prompted:

Keycloak Admin Username and Password: Enter the credentials of your Keycloak admin user (these are also in ~/.eoepca/state if you have them set).
Keycloak base domain: e.g. auth.example.com
Realm: Typically eoepca.
Confidential Client?: specify true to create a CONFIDENTIAL client
Client ID: For the Resource Health, you should use resource-health.
Client name and description: Provide any helpful text (e.g. Resource Health).
Client secret: Enter the Client Secret that was generated during the configuration script (check ~/.eoepca/state).
Subdomain: Use resource-health.
Additional Subdomains: Leave blank.
Additional Hosts: Leave blank.

After it completes, you should see a JSON snippet confirming the newly created client.

3. Deploy the Resource Health BB (Helm)⚓︎

Apply Secrets

bash apply-secrets.sh

This script creates the necessary secrets for the Resource Health BB.

Install or upgrade Resource Health

Note: While the Resource Health BB is not yet in the official EOEPCA Helm charts, you can install it directly from the GitHub repository.

Clone the Resource Health repository and update dependencies:

git clone -b 2.0.0 https://github.com/EOEPCA/resource-health.git reference-repo
helm dependency update reference-repo/resource-health-reference-deployment

Install or upgrade the Resource Health Helm chart:

helm upgrade -i resource-health reference-repo/resource-health-reference-deployment \
  -f generated-values.yaml \
  -n resource-health --create-namespace

As part of this deployment, you will have a preconfigured healthcheck that runs every minute.

3. Configure Ingress⚓︎

By default, Resource Health is designed to be flexible with Ingress and OIDC configurations.

For the purpose of this guide, the configuration script created a sample Ingress resource in generated-ingress.yaml that you can apply or adapt to your environment. The output depends on the ingress controller you have set in the ~/.eoepca/state file.

APISIX

kubectl apply -f apisix/plugin-api-auth.yaml -n resource-health
kubectl apply -f apisix/plugin-browser-auth.yaml -n resource-health
kubectl apply -f generated-ingress.yaml -n resource-health

Nginx

kubectl apply -f generated-ingress.yaml -n resource-health

4. Configure Keycloak Client⚓︎

To ensure your Keycloak user has proper permissions in OpenSearch, you must configure role mapping explicitly.

Step 1: Create a Keycloak Realm Role⚓︎

Log into your Keycloak (auth.${INGRESS_HOST}).
Navigate to your realm (eoepca).
Click on Realm Roles, then click Create Role.
Create a new role named opensearch_user

Step 2: Assign the Role to your Keycloak User⚓︎

Still in Keycloak, go to Users and select your user (e.g. eoepcauser).
Click on the Role Mappings tab.
Assign the newly created opensearch_user realm role to this user.

Step 3: Add the Realm Role Mapper to your Keycloak Client⚓︎

Go to Clients and select your resource-health client.
Navigate to Client Scopes → resource-health-dedicated and click Add Mapper.
Configure the User Realm Role template mapper as follows:

Field	Value
Mapper Type	`User Realm Role`
Name	`realm roles`
Multivalued	`ON` ✅
Token Claim Name	`roles`
Claim JSON Type	`String`
Add to ID token	`ON` ✅
Add to Access token	`ON` ✅
Add to Userinfo	`ON` (recommended) ✅

This configuration ensures Keycloak will correctly include realm roles in the JWT.

Dashboard

4. Monitor the Deployment⚓︎

Once deployed, you will have to wait a minute until the first health check runs before you can access the Resource Health Web dashboard.

After the Helm installation finishes, check that all pods are running in the resource-health namespace:

kubectl get all -n resource-health

Validation⚓︎

Run the validation script:

bash validation.sh

Access the Resource Health Web:

Access the Resource Health Web dashboard at:

https://resource-health.${INGRESS_HOST}

Dashboard

Access the Health Checks at:

https://resource-health.${INGRESS_HOST}/api/healthchecks/v1/checks/

Usage⚓︎

1. Defining Health Checks⚓︎

Health checks can either be defined in the Helm chart’s values under resource-health.healthchecks.checks or via the UI. Each check has:

name
schedule (a cron expression like "@hourly" or "0 8 * * *")
requirements (optional Python packages)
script (the actual test logic)
env (environment variables, e.g. references to external services)

Defining Health Checks⚓︎

Helm-based (check inside the generated-values.yaml):

resource-health:
  healthchecks:
    checks:
      - name: daily-trivial-check
        schedule: "0 8 * * *"
        requirements: "https://example.com/requirements.txt"
        script: "https://example.com/trivial_check.py"
        env:
          - name: SOME_HOST
            value: "https://some-endpoint.example.com"

Apply with:

helm upgrade -i resource-health reference-repo/resource-health-reference-deployment -f generated-values.yaml -n resource-health

UI-based:

Visit the Resource Health Web dashboard (resource-health.${INGRESS_HOST}) and select the Create new check dropdown to define a new health check.

Fill in the form similarly to the Helm-based approach or create a test script like this

Uninstallation⚓︎

To remove all Resource Health components and the namespace:

helm uninstall resource-health -n resource-health
kubectl delete namespace resource-health