Redun .md .md

Here, we’ll see how to track redun workflows with lamindb.

Amend the workflow

Here is how to instrument a redun workflow for tracking with lamindb:

  1. Add ln.track() to the main() task (see on GitHub)

  2. Register desired output files or folders by creating artifacts for them (see on GitHub):

    ln.Artifact(output_path, key="data/results.tar.gz").save()
    
  3. Add a finish() task that calls ln.finish() (see on GitHub)

  4. Optionally cache/stage input files (see on GitHub)

Why not use @ln.flow() for main()?

Because main() in redun is typically a scheduler/executor task rather than a task that performs the actual computation. ln.flow() would then just track the execution time of scheduling, and not an actual compute run.

If one wanted to use @ln.flow() it’s advisable to wrap the scheduling main() task:

@ln.flow()
def run_pipeline(...):
    scheduler = Scheduler()
    result = scheduler.run(main(...))  # run the main task
    ln.Artifact(result.path, key="data/results.tgz").save()
    return result

Run redun

Let’s see what the input files are:

!ls ./fasta
Hide code cell output
KLF4.fasta  MYC.fasta  PO5F1.fasta  SOX2.fasta

Create a lamindb test instance:

# pip install lamindb redun git+http://github.com/laminlabs/redun-lamin-fasta
!lamin init --storage ./test-redun-lamin
Hide code cell output
 initialized lamindb: testuser1/test-redun-lamin

Register each input file individually as an artifact:

import lamindb as ln
import json

ln.Artifact.from_dir("./fasta").save()
Hide code cell output
 connected lamindb: testuser1/test-redun-lamin
! folder is outside existing storage location, will copy files from ./fasta to /home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/fasta
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
SQLRecordList([Artifact(uid='hA9U6ZtQqHkzx5a90000', key='fasta/PO5F1.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=477, hash='-7iJgveFO9ia0wE1bqVu6g', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-05 13:59:50 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='k4hkSYSa7s2hAhHi0000', key='fasta/MYC.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=536, hash='WGbEtzPw-3bQEGcngO_pHQ', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-05 13:59:50 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='LOqkdWPP3qu0NKb60000', key='fasta/SOX2.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=414, hash='C5q_yaFXGk4SAEpfdqBwnQ', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-05 13:59:50 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='9UI3oCBqkiKfwGO10000', key='fasta/KLF4.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=609, hash='LyuoYkWs4SgYcH7P7JLJtA', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-05 13:59:50 UTC, is_locked=False, version_tag=None, is_latest=True)])

Run the redun workflow:

!redun run workflow.py main --input-dir ./fasta --tag run=test-run  1> run_logs.txt 2>run_logs.txt

Inspect the logs:

!cat run_logs.txt
Hide code cell output
 connected lamindb: testuser1/test-redun-lamin
 script invoked with: run workflow.py main --input-dir ./fasta --tag run=test-run
 created Transform('npwXN8JbIYjp0000', key='workflow.py'), started new Run('AJCH7fVhsKcjv9LM') at 2026-05-05 13:59:57 UTC
 recommendation: to identify the script across renames, pass the uid: ln.track("npwXN8JbIYjp")
File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/results.tgz, hash=b33841c9)
_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/hA9U6ZtQqHkzx5a90000.fasta, hash=bb2d0ad8), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job b129d2fb:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/k4hkSYSa7s2hAhHi0000.fasta, hash=29eb7edd), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job ba55ae45:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/LOqkdWPP3qu0NKb60000.fasta, hash=405fde63), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job 57d4da21:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/9UI3oCBqkiKfwGO10000.fasta, hash=b99ed1fc), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job e9b20b16:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/hA9U6ZtQqHkzx5a90000.fasta, hash=bb2d0ad8), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/hA9U6ZtQqHkzx5a90000.peptides.txt, hash=a5403017), amino_acid='C') on default
[redun] Run    Job bb0ac837:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/k4hkSYSa7s2hAhHi0000.fasta, hash=29eb7edd), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/k4hkSYSa7s2hAhHi0000.peptides.txt, hash=5dbb91a6), amino_acid='C') on default
[redun] Run    Job f17891bc:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/LOqkdWPP3qu0NKb60000.fasta, hash=405fde63), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/LOqkdWPP3qu0NKb60000.peptides.txt, hash=c9765d8a), amino_acid='C') on default
[redun] Run    Job 44db0d96:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/9UI3oCBqkiKfwGO10000.fasta, hash=b99ed1fc), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/9UI3oCBqkiKfwGO10000.peptides.txt, hash=27991639), amino_acid='C') on default
[redun] Run    Job 5963b163:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/hA9U6ZtQqHkzx5a90000.count.tsv, hash=d4998b08)) on default
[redun] Run    Job 70eca016:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/k4hkSYSa7s2hAhHi0000.count.tsv, hash=5d781eeb)) on default
[redun] Run    Job 93e31252:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/LOqkdWPP3qu0NKb60000.count.tsv, hash=2d5939a1)) on default
[redun] Run    Job 6c23fe12:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/9UI3oCBqkiKfwGO10000.count.tsv, hash=4e7f4828)) on default
[redun] Run    Job ef2679b0:  bioinformatics_pipeline_tutorial.lib.get_report_task(input_counts=[File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/hA9U6ZtQqHkzx5a90000.count.tsv, hash=d4998b08), File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-l...) on default
[redun] Run    Job 7051ef6a:  bioinformatics_pipeline_tutorial.lib.archive_results_task(inputs_plots=[File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/hA9U6ZtQqHkzx5a90000.plot.png, hash=b551562c), File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-la..., input_report=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/protein_report.tsv, hash=ac7bac69)) on default
[redun] Run    Job da809bc8:  redun_lamin_fasta.finish(results_archive=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/results.tgz, hash=b33841c9)) on default
[redun] 
[redun] | JOB STATUS 2026/05/05 14:00:14
[redun] | TASK                                                        PENDING RUNNING  FAILED  CACHED    DONE   TOTAL
[redun] | 
[redun] | ALL                                                               0       0       0       0      16      16
[redun] | bioinformatics_pipeline_tutorial.lib.archive_results_task         0       0       0       0       1       1
[redun] | bioinformatics_pipeline_tutorial.lib.count_amino_acids_task       0       0       0       0       4       4
[redun] | bioinformatics_pipeline_tutorial.lib.digest_protein_task          0       0       0       0       4       4
[redun] | bioinformatics_pipeline_tutorial.lib.get_report_task              0       0       0       0       1       1
[redun] | bioinformatics_pipeline_tutorial.lib.plot_count_task              0       0       0       0       4       4
[redun] | redun_lamin_fasta.finish                                          0       0       0       0       1       1
[redun] | redun_lamin_fasta.main                                            0       0       0       0       1       1
[redun] 
[redun] 
[redun] Execution duration: 16.71 seconds

View data lineage

artifact = ln.Artifact.get(key="data/results.tgz")
artifact.view_lineage()
Hide code cell output
_images/766f47128129f340ab77fdff3e2c2b33c14a851487402be6179c9cf09f783549.svg
artifact.transform.describe()
Hide code cell output
Transform: workflow.py (0000)
|   description: CLI: redun
├── uid: npwXN8JbIYjp0000                                     
hash: zjZMEeofUMp3FV6fxbgjYg         type: script         
branch: main                         space: all           
created_at: 2026-05-05 13:59:57 UTC  created_by: testuser1
└── source_code: 
    """workflow.py."""
    
    # This code is based on a copy from https://github.com/ricomnl/bioinformatics-pi …
    # Copyright Rico Meinl 2022
    from enum import Enum
    
    import lamindb as ln
    from redun import File, task
    
    import redun_lamin_fasta
    from redun_lamin_fasta.lib import (
        archive_results_task,
        count_amino_acids_task,
        digest_protein_task,
        get_report_task,
        plot_count_task,
    )
    
    redun_namespace = redun_lamin_fasta.__name__
    
    
    class Executor(str, Enum):
        default = "default"
        process = "process"
        batch = "batch"
        batch_debug = "batch_debug"
    
    
    @task()
    def finish(results_archive: File) -> File:
    │ …
artifact.run.describe()
Hide code cell output
Run: AJCH7fV (workflow.py)
├── uid: AJCH7fVhsKcjv9LM                transform: workflow.py (0000)       
started_at: 2026-05-05 13:59:57 UTC  finished_at: 2026-05-05 14:00:14 UTC
status: completed                                                        
branch: main                         space: all                          
created_at: 2026-05-05 13:59:57 UTC  created_by: testuser1               
├── cli_args: 
run workflow.py main --input-dir ./fasta --tag run=test-run
├── report: FqLep2H
→ connected lamindb: testuser1/test-redun-lamin
→ created Transform('npwXN8JbIYjp0000', key='workflow.py'), started new Run('AJC …
• recommendation: to identify the script across renames, pass the uid: ln.track( …
└── environment: 2E9XBLh
    aiobotocore==3.6.0
    aiohappyeyeballs==2.6.1
    aiohttp==3.13.5
    aioitertools==0.13.0
    │ …

Explore the run on the hub

lamin.ai/laminlabs/lamindata/transform/taasWKawCiNA

View the database content

ln.view()
Hide code cell output
Artifact
uid key description suffix kind otype size hash n_files n_observations ... is_latest is_locked created_at branch_id created_on_id space_id storage_id run_id schema_id created_by_id
id
6 tXKcZmE5FdD8SByW0000 data/results.tgz None .tgz None None 94846 hD05koRlKOku6N6FQhzQVQ None None ... True False 2026-05-05 14:00:14.200000+00:00 1 1 1 1 1.0 None 1
4 9UI3oCBqkiKfwGO10000 fasta/KLF4.fasta None .fasta None None 609 LyuoYkWs4SgYcH7P7JLJtA None None ... True False 2026-05-05 13:59:50.385000+00:00 1 1 1 1 NaN None 1
3 LOqkdWPP3qu0NKb60000 fasta/SOX2.fasta None .fasta None None 414 C5q_yaFXGk4SAEpfdqBwnQ None None ... True False 2026-05-05 13:59:50.384000+00:00 1 1 1 1 NaN None 1
2 k4hkSYSa7s2hAhHi0000 fasta/MYC.fasta None .fasta None None 536 WGbEtzPw-3bQEGcngO_pHQ None None ... True False 2026-05-05 13:59:50.384000+00:00 1 1 1 1 NaN None 1
1 hA9U6ZtQqHkzx5a90000 fasta/PO5F1.fasta None .fasta None None 477 -7iJgveFO9ia0wE1bqVu6g None None ... True False 2026-05-05 13:59:50.383000+00:00 1 1 1 1 NaN None 1

5 rows × 21 columns

Run
uid name description entrypoint started_at finished_at params reference reference_type cli_args ... created_at branch_id created_on_id space_id transform_id report_id environment_id plan_id created_by_id initiated_by_run_id
id
1 AJCH7fVhsKcjv9LM None None None 2026-05-05 13:59:57.692115+00:00 2026-05-05 14:00:14.233479+00:00 None None None run workflow.py main --input-dir ./fasta --tag... ... 2026-05-05 13:59:57.693000+00:00 1 1 1 1 7 5 None 1 None

1 rows × 21 columns

Storage
uid root description type region instance_uid is_locked created_at branch_id created_on_id space_id created_by_id run_id
id
1 Q2wir5f0SujZ /home/runner/work/redun-lamin/redun-lamin/docs... None local None iQlBPgD8uaqR False 2026-05-05 13:59:49.073000+00:00 1 1 1 1 None
Transform
uid key description kind source_code hash reference reference_type version_tag is_latest is_locked created_at branch_id created_on_id space_id environment_id plan_id created_by_id
id
1 npwXN8JbIYjp0000 workflow.py CLI: redun script """workflow.py."""\n\n# This code is based on ... zjZMEeofUMp3FV6fxbgjYg None None None True False 2026-05-05 13:59:57.690000+00:00 1 1 1 None None 1

Appendix

Map the redun execution id

If we want to be able to query LaminDB for redun execution ID, this here is a way to get it:

# export the run information from redun
!redun log --exec --exec-tag run=test-run --format json --no-pager > redun_exec.json
# load the redun execution id from the JSON and store it in the LaminDB run record
with open("redun_exec.json") as file:
    redun_exec = json.loads(file.readline())
artifact.run.reference = redun_exec["id"]
artifact.run.reference_type = "redun_id"
artifact.run.save()
Hide code cell output
Run(uid='AJCH7fVhsKcjv9LM', name=None, description=None, entrypoint=None, started_at=2026-05-05 13:59:57 UTC, finished_at=2026-05-05 14:00:14 UTC, params=None, reference='9e4991e3-9c7b-4f57-8f41-fceeb9564c46', reference_type='redun_id', cli_args='run workflow.py main --input-dir ./fasta --tag run=test-run', branch_id=1, created_on_id=1, space_id=1, transform_id=1, report_id=7, environment_id=5, plan_id=None, created_by_id=1, initiated_by_run_id=None, created_at=2026-05-05 13:59:57 UTC, is_locked=False)

Map the redun run report

While lamindb auto-tracks the logs of the main python process you might also want to link the dedicated redun logs:

report = ln.Artifact(
    "run_logs.txt",
    description=f"Redun run report of {redun_exec['id']}",
    run=False,
    kind="__lamindb_run__",  # mark as auxiliary artifact for the run
).save()
artifact.run.report = report
artifact.run.save()
Hide code cell output
Run(uid='AJCH7fVhsKcjv9LM', name=None, description=None, entrypoint=None, started_at=2026-05-05 13:59:57 UTC, finished_at=2026-05-05 14:00:14 UTC, params=None, reference='9e4991e3-9c7b-4f57-8f41-fceeb9564c46', reference_type='redun_id', cli_args='run workflow.py main --input-dir ./fasta --tag run=test-run', branch_id=1, created_on_id=1, space_id=1, transform_id=1, report_id=8, environment_id=5, plan_id=None, created_by_id=1, initiated_by_run_id=None, created_at=2026-05-05 13:59:57 UTC, is_locked=False)