How-to Guide: Creating Files at Runtime¶
This guide explains how to create files dynamically at runtime in a CWL workflow using the InitialWorkDirRequirement.
The focus will be on defining a shell script (run.sh) as part of the workflow to handle runtime commands.
Objective¶
Dynamically create a run.sh script at runtime that executes specific commands for each tool in the workflow.
Relevant Block:
InitialWorkDirRequirement:
  listing:
- entryname: run.sh
entry: |-
        #!/bin/bash
        rio stack $@
- InitialWorkDirRequirement: Specifies files or directories to be created dynamically in the working directory.
- entryname: The name of the file to be created (- run.sh).
- entry: The contents of the file, which in this case is a shell script.
Steps¶
- Understand the Workflow Structure
The workflow (runtime-file.cwl) processes a Sentinel-2 STAC item to:
- Fetch band URLs (stactool).
- Stack TIFFs (rio_stacktool) using a runtime script.
- Apply color correction (rio_colortool) using another runtime script.
- Define the InitialWorkDirRequirement
For rio_stack Tool
The run.sh script stacks the provided TIFFs into a single file (stacked.tif):
InitialWorkDirRequirement:
  listing:
- entryname: run.sh
entry: |-
        #!/bin/bash
        rio stack $@
- Contents:- The script (run.sh) executes rio stack with all TIFF file paths passed as arguments ($@).
 
- The script (
For rio_color Tool
The run.sh script applies color correction to the stacked TIFF file and produces an RGB TIFF (rgb.tif):
InitialWorkDirRequirement:
  listing:
- entryname: run.sh
entry: |-
        #!/bin/bash
        rio color -j -1 --out-dtype uint8 $1 rgb.tif "gamma 3 0.95, sigmoidal rgb 35 0.13"
- Contents:- The script (run.sh) applies color correction to the file passed as the first argument ($1).
 
- The script (
- Specify the baseCommand
Each tool uses /bin/bash to execute the dynamically created run.sh script:
baseCommand: ["/bin/bash", "run.sh"]
This ensures the created script run.sh is executed in a shell environment.
- Complete Workflow Example
Here’s the full CWL example (runtime-file.cwl):
cwlVersion: v1.2
$graph:
- class: Workflow
id: main
requirements:
      InlineJavascriptRequirement: {}
      NetworkAccess:
networkAccess: true
ScatterFeatureRequirement: {}
    inputs:
      stac-item:
type: string
bands:
type: string[]
outputs:
      rgb-tif:
outputSource: step_color/rgb
type: File
steps:
      step_curl:
        in:
stac_item: stac-item
common_band_name: bands
out:
          - hrefs
run: "#stac"
scatter: common_band_name
scatterMethod: dotproduct
step_stack:
        in:
          tiffs:
source: step_curl/hrefs
out:
          - stacked
run: "#rio_stack"
      step_color:
        in:
          stacked:
source: step_stack/stacked
out:
          - rgb
run: "#rio_color"
- class: CommandLineTool
id: stac
requirements:
      DockerRequirement:
dockerPull: docker.io/curlimages/curl:latest
baseCommand: curl
stdout: message
arguments:
      - $( inputs.stac_item )
inputs:
      stac_item:
type: string
common_band_name:
type: string
outputs:
      hrefs:
type: string
outputBinding:
glob: message
loadContents: true
outputEval: |
            ${
              const assets = JSON.parse(self[0].contents).assets;
              const bandKey = Object.keys(assets).find(key =>
                assets[key]['eo:bands'] &&
                assets[key]['eo:bands'].length === 1 &&
                assets[key]['eo:bands'].some(band => band.common_name === inputs.common_band_name)
              );
              if (!bandKey) {
                throw new Error(`No valid asset found for band: ${inputs.common_band_name}`);
              }
              return assets[bandKey].href;
            }
- class: CommandLineTool
id: rio_stack
requirements:
      DockerRequirement:
dockerPull: ghcr.io/eoap/how-to/rio:1.0.0
EnvVarRequirement:
        envDef:
GDAL_TIFF_INTERNAL_MASK: YES
GDAL_HTTP_MERGE_CONSECUTIVE_RANGES: YES
CPL_VSIL_CURL_ALLOWED_EXTENSIONS: ".tif"
      InitialWorkDirRequirement:
        listing:
- entryname: run.sh
entry: |-
              #!/bin/bash
              rio stack $@
baseCommand: ["/bin/bash", "run.sh"]
    arguments:
- valueFrom: "${ \n var arr = [];\n for(var i=0; i<inputs.tiffs.length; i++) {\n arr.push(inputs.tiffs[i]); \n }\n return arr; \n }\n"
      - stacked.tif
inputs:
      tiffs:
type: string[]
outputs:
      stacked:
type: File
outputBinding:
glob: stacked.tif
- class: CommandLineTool
id: rio_color
requirements:
      DockerRequirement:
dockerPull: ghcr.io/eoap/how-to/rio:1.0.0
InitialWorkDirRequirement:
        listing:
- entryname: run.sh
entry: |-
              #!/bin/bash
              rio color -j -1 --out-dtype uint8 $1 rgb.tif "gamma 3 0.95, sigmoidal rgb 35 0.13"
baseCommand: ["/bin/bash", "run.sh"]
    arguments:
      - $( inputs.stacked.path )
inputs:
      stacked:
type: File
outputs:
      rgb:
type: File
outputBinding:
glob: rgb.tif
Graphical representation:
- Run the Workflow
Run the workflow with the following command:
cwltool runtime-file.cwl \
  --stac-item https://earth-search.aws.element84.com/v0/collections/sentinel-s2-l2a-cogs/items/S2B_53HPA_20210723_0_L2A \
  --bands red \
  --bands green \
  --bands blue
INFO /opt/hostedtoolcache/Python/3.13.3/x64/bin/cwltool 3.1.20250110105449
INFO Resolved '../cwl-workflows/runtime-file.cwl' to 'file:///home/runner/work/how-to/how-to/cwl-workflows/runtime-file.cwl'
INFO [workflow ] start
INFO [workflow ] starting step step_curl
INFO [step step_curl] start
INFO [job step_curl] /tmp/512t_7s9$ docker \
run \
-i \
--mount=type=bind,source=/tmp/512t_7s9,target=/QKdTXN \
--mount=type=bind,source=/tmp/szvap4h7,target=/tmp \
--workdir=/QKdTXN \
--read-only=true \
--log-driver=none \
--user=1001:118 \
--rm \
--cidfile=/tmp/m99a85zl/20250620072003-458470.cid \
--env=TMPDIR=/tmp \
--env=HOME=/QKdTXN \
docker.io/curlimages/curl:latest \
curl \
https://earth-search.aws.element84.com/v0/collections/sentinel-s2-l2a-cogs/items/S2B_53HPA_20210723_0_L2A > /tmp/512t_7s9/message
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
100 10156 100 10156 0 0 62852 0 --:--:-- --:--:-- --:--:-- 63080
INFO [job step_curl] completed success
INFO [step step_curl] start
INFO [job step_curl_2] /tmp/xjffa47j$ docker \
run \
-i \
--mount=type=bind,source=/tmp/xjffa47j,target=/QKdTXN \
--mount=type=bind,source=/tmp/ydbzzqx4,target=/tmp \
--workdir=/QKdTXN \
--read-only=true \
--log-driver=none \
--user=1001:118 \
--rm \
--cidfile=/tmp/6qc__ldf/20250620072004-465669.cid \
--env=TMPDIR=/tmp \
--env=HOME=/QKdTXN \
docker.io/curlimages/curl:latest \
curl \
https://earth-search.aws.element84.com/v0/collections/sentinel-s2-l2a-cogs/items/S2B_53HPA_20210723_0_L2A > /tmp/xjffa47j/message
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
100 10156 100 10156 0 0 32627 0 --:--:-- --:--:-- --:--:-- 32655
INFO [job step_curl_2] completed success
INFO [step step_curl] start
INFO [job step_curl_3] /tmp/nokz76j5$ docker \
run \
-i \
--mount=type=bind,source=/tmp/nokz76j5,target=/QKdTXN \
--mount=type=bind,source=/tmp/ajtt_ptt,target=/tmp \
--workdir=/QKdTXN \
--read-only=true \
--log-driver=none \
--user=1001:118 \
--rm \
--cidfile=/tmp/aqe_082j/20250620072005-472792.cid \
--env=TMPDIR=/tmp \
--env=HOME=/QKdTXN \
docker.io/curlimages/curl:latest \
curl \
https://earth-search.aws.element84.com/v0/collections/sentinel-s2-l2a-cogs/items/S2B_53HPA_20210723_0_L2A > /tmp/nokz76j5/message
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
100 10156 100 10156 0 0 53916 0 --:--:-- --:--:-- --:--:-- 54021
INFO [job step_curl_3] completed success
INFO [step step_curl] completed success
INFO [workflow ] starting step step_stack
INFO [step step_stack] start
INFO [job step_stack] /tmp/on6baw_r$ docker \
run \
-i \
--mount=type=bind,source=/tmp/on6baw_r,target=/QKdTXN \
--mount=type=bind,source=/tmp/6s_9nz_0,target=/tmp \
--workdir=/QKdTXN \
--read-only=true \
--user=1001:118 \
--rm \
--cidfile=/tmp/yy5rppvw/20250620072006-493196.cid \
--env=TMPDIR=/tmp \
--env=HOME=/QKdTXN \
--env=CPL_VSIL_CURL_ALLOWED_EXTENSIONS=.tif \
--env=GDAL_HTTP_MERGE_CONSECUTIVE_RANGES=YES \
--env=GDAL_TIFF_INTERNAL_MASK=YES \
ghcr.io/eoap/how-to/rio:1.0.0 \
/bin/bash \
run.sh \
https://sentinel-cogs.s3.us-west-2.amazonaws.com/sentinel-s2-l2a-cogs/53/H/PA/2021/7/S2B_53HPA_20210723_0_L2A/B04.tif \
https://sentinel-cogs.s3.us-west-2.amazonaws.com/sentinel-s2-l2a-cogs/53/H/PA/2021/7/S2B_53HPA_20210723_0_L2A/B03.tif \
https://sentinel-cogs.s3.us-west-2.amazonaws.com/sentinel-s2-l2a-cogs/53/H/PA/2021/7/S2B_53HPA_20210723_0_L2A/B02.tif \
stacked.tif
INFO [job step_stack] Max memory used: 1281MiB
INFO [job step_stack] completed success
INFO [step step_stack] completed success
INFO [workflow ] starting step step_color
INFO [step step_color] start
INFO [job step_color] /tmp/e6l9yjuo$ docker \
run \
-i \
--mount=type=bind,source=/tmp/e6l9yjuo,target=/QKdTXN \
--mount=type=bind,source=/tmp/ipkr8q24,target=/tmp \
--mount=type=bind,source=/tmp/on6baw_r/stacked.tif,target=/var/lib/cwl/stg660feec5-9086-4f85-a010-eedd9a9b8eee/stacked.tif,readonly \
--workdir=/QKdTXN \
--read-only=true \
--user=1001:118 \
--rm \
--cidfile=/tmp/wqhbm494/20250620072027-356593.cid \
--env=TMPDIR=/tmp \
--env=HOME=/QKdTXN \
ghcr.io/eoap/how-to/rio:1.0.0 \
/bin/bash \
run.sh \
/var/lib/cwl/stg660feec5-9086-4f85-a010-eedd9a9b8eee/stacked.tif
INFO [job step_color] Max memory used: 742MiB
INFO [job step_color] completed success
INFO [step step_color] completed success
INFO [workflow ] completed success
INFO Final process status is success
- Expected Output
- Final Output (rgb.tif): An RGB TIFF file generated by the rio_color tool.
{
  "rgb-tif": {
"location": "file:///home/runner/work/how-to/how-to/docs/rgb.tif",
"basename": "rgb.tif",
"class": "File",
"checksum": "sha1$1be3372cc826482bc689219f417971852b209460",
"size": 361747464,
"path": "/home/runner/work/how-to/how-to/docs/rgb.tif"
}
}
Key Takeaways¶
- InitialWorkDirRequirement:
- Dynamically creates files like scripts or configuration files.
- Use entryname to define the file name and entry for the content.
- Flexibility:
- Simplifies command-line definitions by embedding logic in runtime scripts.
This approach allows you to customize workflows with runtime behavior using dynamically created files.