Processing - OGC API Processes Engine⚓︎

Introduction⚓︎

The OGC API Processes Engine provides an OGC API Processes execution engine through which users can deploy, manage, and execute OGC Application Packages. The OAPIP engine is provided by the ZOO-Project zoo-project-dru implementation - supporting OGC WPS 1.0.0/2.0.0 and OGC API Processes Parts 1 & 2.

Standardised Interfaces: Implements OGC API Processes standards for interoperability.
Application Deployment: Supports deployment, replacement, and undeployment of applications.
Execution Engine: Applications execution backend. Can be Kubernetes/Calrissian, HPC/Toil and others.

Prerequisites⚓︎

Before deploying the OGC API Processes Engine, ensure you have the following:

Component	Requirement	Documentation Link
Kubernetes	Cluster (tested on v1.28)	Installation Guide
Helm	Version 3.5 or newer	Installation Guide
kubectl	Configured for cluster access	Installation Guide
Ingress	Properly installed	Installation Guide
TLS Certificates	Managed via `cert-manager` or manually	TLS Certificate Management Guide
Stage-In S3	Accessible	MinIO Deployment Guide
Stage-Out S3	Accessible	MinIO Deployment Guide

Clone the Deployment Guide Repository:

git clone https://github.com/EOEPCA/deployment-guide
cd deployment-guide/scripts/processing/oapip

Validate your environment:

Run the validation script to ensure all prerequisites are met:

bash check-prerequisites.sh

Deployment⚓︎

Run the Configuration Script⚓︎

bash configure-oapip.sh

Configuration Parameters

INGRESS_HOST: Base domain for ingress hosts.
- Example: example.com
CLUSTER_ISSUER (if using cert-manager): Name of the ClusterIssuer.
- Example: letsencrypt-http01-apisix
STORAGE_CLASS: Storage class for persistent volumes.
- Example: standard

Stage-Out S3 Configuration:

Before proceeding, ensure you have an S3-compatible object store set up. If not, refer to the MinIO Deployment Guide. These values should already be in your EOEPCA+ state file if you followed the main deployment steps.

S3_ENDPOINT, S3_ACCESS_KEY, S3_SECRET_KEY, S3_REGION: Credentials and location details for the S3 bucket used as Stage-Out storage.

Stage-In S3 Configuration:

If your Stage-In storage differs from Stage-Out (e.g., data hosted externally), specify these separately:

STAGEIN_S3_ENDPOINT, STAGEIN_S3_ACCESS_KEY, STAGEIN_S3_SECRET_KEY, STAGEIN_S3_REGION

OIDC Configuration:

If you are using the APISIX Ingress Controller, you will be prompted to provide whether you wish to enable OIDC authentication. If you choose to enable OIDC, ensure that you follow the steps in the OIDC Configuration section after deployment.

When prompted for the Client ID we recommend setting it to oapip-engine.

For instructions on how to set up IAM, you can follow the IAM Building Block guide.

Execution Engine Configuration:

Different Execution Engines can be selected, according to the type of backend. The currently supported engines and their additional dependencies are the following:

Execution Engine	Backend	Additional dependencies
calrissian	Executes applications as Kubernetes jobs in dedicated namespaces, using Calrissian	None
toil	Executes application as HPC jobs on a variety of HPC batch scedulers, using Toil	A Toil WES Service

The following Execution Engine specific configuration parameters needs to be setup:

For Calrissian:

NODE_SELECTOR_KEY: Determine which nodes will run the processing workflows.
- Example: kubernetes.io/os
- Read more: Node Selector Documentation
NODE_SELECTOR_VALUE: Value for the node selector key.
- Example: linux

For Toil:

OAPIP_TOIL_WES_URL: The Toil WES service endpoint, including the path. Must be ending with /ga4gh/wes/v1/
- Example: https://toil.hpc.host/ga4gh/wes/v1/
- Read more: Zoo Wes Runner documentation
OAPIP_TOIL_WES_USER: The Toil WES service user
- Example: test
OAPIP_TOIL_WES_PASSWORD: The Toil WES service password (must but be in htpasswd format)
- Example: $2y$12$ci.4U63YX83CwkyUrjqxAucnmi2xXOIlEF6T/KdP9824f1Rf1iyNG

Deploy the OAPIP Engine⚓︎

Deploy the Helm Chart⚓︎

helm repo add zoo-project https://zoo-project.github.io/charts/
helm repo update zoo-project
helm upgrade -i zoo-project-dru zoo-project/zoo-project-dru \
  --version 0.4.14 \
  --values generated-values.yaml \
  --namespace processing \
  --create-namespace

Optional: Enable OIDC with Keycloak⚓︎

This option is only available when using the APISIX Ingress Controller as it relies upon APISIX to act as the policy enforcement point.. If you are using a different Ingress Controller, skip to the Validation section.

If you do not wish to use OIDC IAM right now, you can skip these steps and proceed directly to the Validation section. You can still work with the OAPIP Engine but access will not be restricted.

If you do want to protect OAPIP endpoints with IAM policies (i.e. require Keycloak tokens, limit access by groups/roles, etc.) and you enabled OIDC in the configuration script then follow these steps. You will create a new client in Keycloak for the OAPIP engine and optionally define resource-protection rules (e.g. restricting who can list jobs).

Before starting this please ensure that you have followed our IAM Deployment Guide and have a Keycloak instance running.

2.1 Create a Keycloak Client⚓︎

Use the create-client.sh script in the /scripts/utils/ directory. This script prompts you for basic details and automatically creates a Keycloak client in your chosen realm:

bash ../../utils/create-client.sh

When prompted:

Keycloak Admin Username and Password: Enter the credentials of your Keycloak admin user (these are also in ~/.eoepca/state if you have them set).
Keycloak base domain: e.g. auth.example.com
Realm: Typically eoepca.
Confidential Client?: specify true to create a CONFIDENTIAL client
Client ID: For the OAPIP engine, you should use oapip-engine.
Client name and description: Provide any helpful text (e.g. OAPIP Engine Client).
Client secret: Enter the OAPIP Client Secret that was generated during the configuration script (check ~/.eoepca/state).
Subdomain: Use zoo for the OAPIP engine.
Additional Subdomains: Leave blank.
Additional Hosts: Leave blank.

After it completes, you should see a JSON snippet confirming the newly created client.

2.2 Define Resource Protection (Optional)⚓︎

By default, once the OAPIP engine is connected to Keycloak, it can accept OIDC tokens. If you want to restrict or fine-tune access to certain endpoints (like /ogc-api/jobs/).

Before protecting the resource, please ensure that you have a user in Keycloak other than the admin user. If you don’t have a user, you can create one using:

bash ../../utils/create-user.sh

Protect the user’s zoo context⚓︎

Zoo uses a path prefix to establish a context within the processing service - such as /<username> or /<project>.

Protection can be applied so that the context is accessible only by the owning user(s).

For the purposes of this example we can assume the user eoepcauser - adjust for your own purposes.

Use the protect-resource.sh:

bash ../../utils/protect-resource.sh

When prompted (adjust values for your needs):

Client ID: oapip-engine (the client you created in the previous step)
Username: e.g. eoepcauser
Display Name: eoepcauser
Resource Type: urn:oapip-engine:resources:default
Resource URI: /eoepcauser/*

2.3 Create APISIX Route Ingress⚓︎

If you are using APISIX Ingress controller, apply the ingress:

kubectl apply -f generated-ingress.yaml

2.4 Confirm Protection (APISIX Only)⚓︎

Resource protection is only available when using the APISIX Ingress Controller.

With the resource and permission created, attempts to access the protected endpoint (/eoepcauser/*) without a valid token or with insufficient privileges should be denied. You can test it by:

bash resource-protection-validation.sh

If this script shows 401 Authorization errors when the request is made with a token, then there must be an issue with the token or the resource protection configuration.

For more detailed Keycloak testing (device flow, tokens, etc.), refer to Resource Protection with Keycloak Policies.

Validation⚓︎

Automated Validation⚓︎

This script performs a series of automated tests to validate the deployment.

bash validation.sh

Web Endpoints⚓︎

Check access to the service web endpoints:

ZOO-Project Swagger UI - https://zoo.${INGRESS_HOST}/swagger-ui/oapip/
OGC API Processes Landing Page - https://zoo.${INGRESS_HOST}/ogc-api/processes/

Expected Kubernetes Resources⚓︎

Ensure that all Kubernetes resources are running correctly.

kubectl get pods -n processing

Expected Output:

All pods should be in the Running state.
No pods should be in CrashLoopBackOff or Error states.

Via OGC API Processes⚓︎

Validate the operation of the zoo service via its OGC API Processes interfaces.

We offer a sample application that can be used to exercise the deployed service:

convert - a very simple ‘hello world’ application that is quick to run, with low resource requirements, that can be used as a smoke test to validate the deployment

Using the API⚓︎

Initialise Environment⚓︎

The following example commands assume use of bash shell.

bash -l

Initialise environment variables used by the example commands.

source ~/.eoepca/state
echo ${OAPIP_HOST}

If you have OIDC enabled, run the oapip-utils.sh to generate a valid OIDC token that will be temporarily stored in your environment variables.

This will ask you for the username and password for the user you added to the group to generate an access token.

source oapip-utils.sh

NOTE that the token is short-lived - so it may be necessary to repeat this step to refresh the token - in the case that the following commands fail unexpectedly

List Processes⚓︎

Retrieve the list of available (currently deployed) processes.

curl --silent --show-error \
  -X GET "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/processes" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Accept: application/json"

This command will omit the Authorization header if OIDC is not enabled. If you have OIDC enabled, and it is failing, please ensure you have run the source oapip-utils.sh script to generate the OAPIP_AUTH_HEADER variable.

Deploy Process `convert`⚓︎

Deploy the convert app…

curl --silent --show-error \
  -X POST "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/processes" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Content-Type: application/json" \
  -H "Accept: application/json" \
  -d @- <<EOF | jq
{
  "executionUnit": {
    "href": "https://raw.githubusercontent.com/EOEPCA/deployment-guide/refs/heads/main/scripts/processing/oapip/examples/convert-url-app.cwl",
    "type": "application/cwl"
  }
}
EOF

Check the convert application is deployed…

curl --silent --show-error \
  -X GET "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/processes/convert-url" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Accept: application/json" | jq

Execute Process `convert`⚓︎

JOB_ID=$(
  curl --silent --show-error \
    -X POST "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/processes/convert-url/execution" \
    ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
    -H "Content-Type: application/json" \
    -H "Accept: application/json" \
    -H "Prefer: respond-async" \
    -d @- <<EOF | jq -r '.jobID'
  {
    "inputs": {
      "fn": "resize",
      "url":  "https://eoepca.org/media_portal/images/logo6_med.original.png",
      "size": "50%"
    }
  }
EOF
)

echo "JOB ID: ${JOB_ID}"

Check Execution Status⚓︎

The JOB ID is used to monitor the progress of the job execution - most notably the status field that indicates whether the job is in-progress (running), or its completion status (successful / failed). Note that the full URL for job monitoring is also returned in the Location header of the http response to the execution request.

curl --silent --show-error \
  -X GET "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/jobs/${JOB_ID}" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Accept: application/json" | jq

Check Execution Results⚓︎

Similarly, once the job is completed successfully, then details of the results (outputs) can be retrieved.

curl --silent --show-error \
  -X GET "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/jobs/${JOB_ID}/results" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Accept: application/json" | jq

Undeploy Process `convert`⚓︎

curl --silent --show-error \
  -X DELETE "${OAPIP_HOST}/${OAPIP_USER}/ogc-api/processes/convert-url" \
  ${OAPIP_AUTH_HEADER:+-H "$OAPIP_AUTH_HEADER"} \
  -H "Accept: application/json" | jq

Uninstallation⚓︎

To remove the Processing Building Block from your cluster:

helm -n processing uninstall zoo-project-dru
kubectl delete ns processing

Additional Cleanup⚓︎

Delete Persistent Volume Claims (PVCs):

kubectl -n processing delete pvc -l app.kubernetes.io/instance=zoo-project-dru

Feedback⚓︎

If you have any issues or suggestions, please open an issue on the EOEPCA+Deployment Guide GitHub Repository.