Inference

Goal

Wrap the inference Python command line tool as a Common Workflow Language CommandLineTool and execute it with a CWL runner.

Lab

This step has a dedicated lab available at /workspace/inference-eoap/practice-labs/CommandLineTool.ipynb

How to wrap a step as a CWL CommandLineTool

The CWL document below shows the inference Python command line tool step wrapped as a CWL CommandLineTool:

cwl-cli/inference
cwlVersion: v1.0

class: CommandLineTool
id: inference
requirements:
  InlineJavascriptRequirement: {}
  EnvVarRequirement:
    envDef:
      PATH: /usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
      PYTHONPATH: /app
  ResourceRequirement:
    coresMax: 2
    ramMax: 4096
hints:
  DockerRequirement:
    dockerPull: localhost/inference:latest
baseCommand: ["python", "-m", "app"]
arguments: []
inputs:
  input-item:
    type: Directory
    inputBinding:
      prefix: --input-item
outputs:
  stac_catalog:
    outputBinding:
      glob: .
    type: Directory

Let's break down the key components of this CWL document:

cwlVersion: v1.0: Specifies the version of the CWL specification that this document follows.
class: CommandLineTool: Indicates that this CWL document defines a command-line tool.
id: crop: Provides a unique identifier for this tool, which can be used to reference it in workflows.
requirements: Specifies the requirements and dependencies of the tool. In this case, it defines the following:
- InlineJavascriptRequirement: This requirement allows the use of inline JavaScript expressions in the tool.
- EnvVarRequirement: It sets environment variables. In this case, it sets the PYTHONPATH environment variable to "/app."
- ResourceRequirement: Specifies resource requirements for running the tool, including the maximum number of CPU cores and maximum RAM.
- DockerRequirement: This requirement specifies the Docker container to be used. It indicates that the tool should be executed in a Docker container with the image localhost/crop:latest.
baseCommand: Defines the base command to be executed in the container. In this case, it's running a Python module called "app" with the command python -m app.
arguments: This section is empty, meaning there are no additional command-line arguments specified here. The tool is expected to receive its arguments via the input parameters.
inputs: Describes the input parameters for the tool, including their types and how they are bound to command-line arguments. The tool expects the following inputs:
- --input-item: the path to the folder where a Sentinel-2 Level-1C has been staged and converted to COG.
outputs: Specifies the tool's output. It defines an output parameter named stac_catalog, which is of type Directory. The outputBinding section specifies that the tool is expected to produce one or more files (glob: .) as output.

Steps

Run the CWL document using the cwltool CWL runner to execute the tile based classification inference:

terminal

export WORKSPACE=/workspace/inference-eoap

# check if the results.json file exists
if [ ! -f ${WORKSPACE}/runs/results.json ]; then
    echo "results.json file not found, run the stage-and-cog.cwl workflow first."
    exit 1
fi

command -v podman >/dev/null 2>&1 && { 
    flag="--podman"
}

cwltool ${flag} \
    --outdir ${WORKSPACE}/runs \
    ${WORKSPACE}/cwl-cli/inference.cwl \
    $( cat ${WORKSPACE}/runs/results.json | jq -r .stac_catalog.path )

Expected outcome

The folder /workspace/inference-eoap/runs contains:

(base) jovyan@jupyter-fbrito--training:~/inference-eoap$ tree runs
runs
├── am8yu9it
│   ├── S2A_T33TVM_20240125T100359_L1C-classification
│   │   ├── S2A_T33TVM_20240125T100359_L1C-classification.json
│   │   └── classification.tif
│   └── catalog.json
├── results.json
└── ydmns4od
    ├── S2A_T33TVM_20240125T100359_L1C
    │   ├── S2A_T33TVM_20240125T100359_L1C.json
    │   ├── blue.tif
    │   ├── cirrus.tif
    │   ├── coastal.tif
    │   ├── green.tif
    │   ├── nir.tif
    │   ├── nir08.tif
    │   ├── nir09.tif
    │   ├── red.tif
    │   ├── rededge1.tif
    │   ├── rededge2.tif
    │   ├── rededge3.tif
    │   ├── swir16.tif
    │   └── swir22.tif
    └── catalog.json

4 directories, 19 files