Core concepts
Tasks
A Task is the basic unit of execution in Tork. Tasks are executed in the order they appear on a job.
Example
- name: say hello
var: task1
image: ubuntu:mantic
run: |
echo -n hello world > $TORK_OUTPUT
When using the default Docker runtime, tasks execute within a Docker container. The type of container (or image) is specified using the image
property.
Tasks can use any of the publicly available docker images, and support for private repositories coming in the near future.
The work to be done in the container is specified in the run
property.
Image
When using the default Docker runtime image
, spcifies the Docker image to use for the task.
You can use images from any publicly available registries.
- name: say hello
var: task1
# uses Docker hub's ubuntu image
image: ubuntu:mantic
run: |
echo -n hello world > $TORK_OUTPUT
Private registries
You can also use private docker registries by using the registry
property.
- name: populate a variable
image: myregistry.com/my_image:latest
registry:
username: user
password: mypassword
run: |
echo "do work"
Avoid having your registry credentials in the clear. As an alternative, you can create a Docker config file on your Tork host with the necessary credentials:
{
"auths": {
"myregistry.com": {
"auth": "base64encodedusername:base64encodedpassword"
}
}
}
And then pass the path to the config file as a parameter to the Tork worker using the TORK_RUNTIME_DOCKER_CONFIG
environment variable.
Queue
Queues are the primary mechanism for routing tasks in Tork.
Tasks are always routed to a queue. When not specified, tasks are routed to the default
queue.
Suppose you have a task that is very CPU heavy. But since large machines are typically more expensive than smaller machines you'd like to route only specific tasks to this queue, while sending the rest of your workload to the default
queue.
To solve for this, you create a new pool of Tork workers and have them subscribe to the arbitrarily named highcpu
queue. Then in your job definitions, you send all "heavy" tasks to that queue.
name: my job
tasks:
- name: easy task
queue: default # does not have to be specified
image: ubuntu:mantic
run: |
echo "do some light lifting"
- name: say hello
# will route traffic to Tork workers that are subscribed
# to the 'highcpu' queue.
queue: highcpu
image: ubuntu:mantic
run: |
echo "do some heavy lifting"
Output
Tasks may produce output by directing their output to the file specified in the $TORK_OUTPUT
environment variable and specifying the key to store the task's output in the job's context using the var
property.
The output from a task can be used by subsequent tasks. Example:
name: example job
tasks:
- name: populate a variable
image: ubuntu:mantic
# The task must specify the name of the
# variable under which its results will be
# stored in the job's context
var: task1
run: |
echo -n "world" > "$TORK_OUTPUT"
- name: say hello
image: ubuntu:mantic
env:
# refer to the outputs of the previous task
NAME: '{{ tasks.task1 }}'
run: |
echo -n hello $NAME
Expressions
Tork uses the expr expression language to:
- Evaluate C-style embedded expressions in the job defintion.
- Evaluate a task's
if
condition to determine whether a task should run.
Most expressions use the job's context which has the following namespaces:
inputs
- to access any values from the job'sinputs
block.secrets
- to access any values from the job'ssecrets
block.tasks
- to access the results of previous tasks.job
- to access the job's metadata.
Examples:
When an if
expression evaluates to anything except false
, the task will run.
name: example job
inputs:
run: 'false'
tasks:
- name: say something
if: "{{ inputs.run == 'true' }}"
image: ubuntu:mantic
run: |
echo "this should not execute"
name: example job
inputs:
message: hello world
tasks:
- name: say something
image: ubuntu:mantic
env:
MESSAGE: '{{ inputs.message }}'
run: |
echo $MESSAGE
name: hello job
tasks:
- name: do something
var: someOutput
image: ubuntu:mantic
run: |
echo -n hello world > $TORK_OUTPUT
- name: print result of previous task
image: ubuntu:mantic
run: |
echo -n $OUTPUT
env:
OUTPUT: '{{tasks.someOutput}}'
name: my job
secrets:
api_key: 1111-1111-1111-1111
tasks:
- name: my task
queue: default
image: alpine:latest
run: |
curl -X POST -H "API_KEY: $API_KEY" http://example.com
env:
# use the 'secrets' namespace to inject a secret
API_KEY: '{{secrets.api_key}}'
Functions
There are a number of built-in and additional functions that can be used in expressions.
- name: print the length of a string
image: ubuntu:mantic
env:
LENGTH: '{{ len("hello world") }}'
run: |
echo "The length of the string is: $LENGTH"
Environment Variables
You can set custom environment variables for a given task by using the env
property:
- name: print a message
image: ubuntu:mantic
env:
INTRO: hello world
OUTRO: bye world
run: |
echo $INTRO
echo $OUTRO
Environment variables can also be populated using expressions.
name: example job
inputs:
message: hello world
tasks:
- name: say something
image: ubuntu:mantic
env:
MESSAGE: '{{ inputs.message }}'
run: |
echo $MESSAGE
Secrets
Sensitive values can be specified in the job's secrets
block so they can be auto-redacted from API responses.
name: my job
secrets:
api_key: 1111-1111-1111-1111
tasks:
- name: my task
queue: default
image: alpine:latest
run: |
curl -X POST -H "API_KEY: $API_KEY" http://example.com
env:
# use the 'secrets' namespace to inject a secret
API_KEY: '{{secrets.api_key}}'
Warning!
Tork automatically redacts secrets printed to the log, but you should avoid printing secrets to the log intentionally.
Files
Files is a convenient means to create arbitrary files in the task's working directory.
- name: Get the post
image: python:3
files:
script.py: |
import requests
url = "https://jsonplaceholder.typicode.com/posts/1"
response = requests.get(url)
data = response.json()
print(data['title'])
run: |
pip install requests
python script.py > $TORK_OUTPUT
Parallel Task
To run a group of tasks concurrently, wrap them in a parallel
task.
Example:
- name: a parallel task
parallel:
tasks:
- image: ubuntu:mantic
run: sleep 2
- image: ubuntu:mantic
run: sleep 1
- image: ubuntu:mantic
run: sleep 3
Each Task
Executes the task to for each item
in list
, in parallel.
You can optionally control the maximum number of tasks that would execute across the cluster using the concurrency
property.
Examples:
- name: sample each task
each:
list: '{{ sequence(1,5) }}'
concurrency: 3 # max 3 tasks would execute concurrently
task:
image: ubuntu:mantic
env:
ITEM: '{{ item.value }}'
INDEX: '{{ item.index }}'
run: echo -n HELLO $ITEM at $INDEX
name: my job
inputs:
people: '[{"name":"Michael"},{"name":"George"}]'
tasks:
- name: print each person
each:
list: '{{fromJSON(inputs.people)}}'
task:
name: my first task
run: echo $NAME
image: alpine:3.18.3
env:
NAME: '{{item.value.name}}'
Sub-Job Task
A task can start another job. When a sub-job completes or fails it marks its parent task as COMPLETED
or FAILED
respectively.
- name: a task that starts a sub-job
subjob:
name: my sub job
tasks:
- name: hello sub task
image: ubuntu:mantic
run: echo start of sub-job
- name: bye task
image: ubuntu:mantic
run: echo end of sub-job
Sub jobs may also be spawned in detached
mode, meaning that the parent/spawning job will not wait for their completion but would simply "fire and forget" these jobs. Example:
- name: a task that starts a detached job
subjob:
name: my sub job
detached: true
tasks:
- name: hello sub task
image: ubuntu:mantic
run: echo some work
Mounts
Mounts are often used to share state between the task and its pre
and post
tasks (see Pre/Post tasks) but can also be used to access persistent data on the host.
When using the default Docker runtime there are three types of mounts available:
volume
- a Docker volume based mount. Volumes are removed at the termination of the task.
- name: convert the first 5 seconds of a video
image: jrottenberg/ffmpeg:3.4-alpine
run: ffmpeg -i /tmp/my_video.mov -t 5 /tmp/output.mp4
mounts:
- type: volume
target: /tmp
pre:
- name: download the remote file
image: alpine:3.18.3
run: wget http://example.com/my_video.mov
bind
- used to mount a host path to a container path.
- name: convert the first 5 seconds of a video
image: jrottenberg/ffmpeg:3.4-alpine
run: ffmpeg -i /mnt/videos/my_video.mov -t 5 /mnt/videos/output.mp4
mounts:
- type: bind
target: /mnt/videos
source: /host/path
tmpfs
- atmpfs
mount is temporary, and only persisted in the host memory. When the container stops, thetmpfs
mount is removed, and files written there won't be persisted.
Pre/Post Tasks
Worker nodes are stateless by design. Which means that no state is left on the worker node after a task terminates. Moreover tasks can execute on any of the available worker so there's no guarantee that a task that is scheduled to execute will execute on the same node that the task just prior to it executed.
However, it is sometimes desireable to execute a task - potentially even using a different image - before or after a task executes and share the state of that execution with the "main" task we want to execute. This is where pre
and post
tasks come in.
Each task can define a set of tasks that will be executed prior to its execution, and after its execution.
The pre
and post
tasks always execute on the same worker node which will execute the task itself and are considered to be an atomic part of the task. That is, a failure in any of the pre/post tasks is considered a failure of the entire task.
Additionally, any mounts
and/or networks
defined on the primary task are also accessible to the pre
and post
tasks.
Example:
- name: convert the first 5 seconds of a video
image: jrottenberg/ffmpeg:3.4-alpine
run: |
ffmpeg -i /tmp/my_video.mov -t 5 /tmp/output.mp4
mounts:
- type: volume
target: /tmp
pre:
- name: download the remote file
image: alpine:3.18.3
run: |
wget \
http://example.com/my_video.mov \
-O /tmp/my_video.mov
post:
- name: upload the converted file
image: alpine:3.18.3
run: |
wget \
--post-file=/tmp/output.mp4 \
https://devnull-as-a-service.com/dev/null
Retry
Tasks can be retried on failure using the retry
property. This property allows you to specify the number of retry attempts, the initial delay before the first retry, and an optional scaling factor to increase the delay between subsequent retries.
Example:
retry:
limit: 5 # will retry up to 5 times
initialDelay: 5s # optional: default 1s (max: 5m)
scalingFactor: # optional: default 2 (max: 10)
Priority
To increase the priority of a task in its queue, use the priority
property.
Acceptable values are between 0
(no priority) and 9
(highest priority).
name: my job
tasks:
- name: my first task
image: alpine:3.18.3
run: sleep 3
priority: 1
You can also set the default priority for all tasks at the job level:
name: my job
defaults:
priority: 1
tasks:
- name: my first task
image: alpine:3.18.3
run: sleep 3
Limits
By default, a task has no resource constraints and can use as much of a given resource as the host’s kernel scheduler allows.
For more fine-grained control, default limits can be overridden at an individual task level:
- name: some task
image: alpine:3.18.3
run: |
echo "do some work"
limits:
cpus: .5
memory: 10m
Timeout
Tasks can specify a maximum time of execution using the timeout
property.
- name: some task
image: alpine:3.18.3
# this task will automatically fail
# after 5 seconds if not completed
timeout: 5s
run: |
sleep 30
GPUs
When using the default Docker runtime, you can use the gpus
property to access the host's GPU resources.
This property maps to Docker's --gpus
flag.
- name: some task
image: alpine:3.18.3
gpus: all
run: |
echo "do some gpu based work"
Tags
Tasks may specify an arbitrary list of metadata tags.
- name: some task
image: alpine:3.18.3
tags:
- some-tag
- another-tag
run: |
echo "do some work"
Workdir
You can use the workdir
property to set the task's default working directory.
- name: some task
image: alpine:3.18.3
workdir: /workspace
run: |
echo "do some work"