Redun .md .md

Here, we’ll see how to track redun workflows with lamindb.

Amend the workflow

Here is how to instrument a redun workflow for tracking with lamindb:

  1. Add ln.track() to the main() task (see on GitHub)

  2. Register desired output files or folders by creating artifacts for them (see on GitHub):

    ln.Artifact(output_path, key="data/results.tar.gz").save()
    
  3. Add a finish() task that calls ln.finish() (see on GitHub)

  4. Optionally cache/stage input files (see on GitHub)

Why not use @ln.flow() for main()?

Because main() in redun is typically a scheduler/executor task rather than a task that performs the actual computation. ln.flow() would then just track the execution time of scheduling, and not an actual compute run.

If one wanted to use @ln.flow() it’s advisable to wrap the scheduling main() task:

@ln.flow()
def run_pipeline(...):
    scheduler = Scheduler()
    result = scheduler.run(main(...))  # run the main task
    ln.Artifact(result.path, key="data/results.tgz").save()
    return result

Run redun

Let’s see what the input files are:

!ls ./fasta
Hide code cell output
KLF4.fasta  MYC.fasta  PO5F1.fasta  SOX2.fasta

Create a lamindb test instance:

# pip install lamindb redun git+http://github.com/laminlabs/redun-lamin-fasta
!lamin init --storage ./test-redun-lamin
Hide code cell output
 initialized lamindb: testuser1/test-redun-lamin

Register each input file individually as an artifact:

import lamindb as ln
import json

ln.Artifact.from_dir("./fasta").save()
Hide code cell output
 connected lamindb: testuser1/test-redun-lamin
! folder is outside existing storage location, will copy files from ./fasta to /home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/fasta
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
! no run & transform got linked, call `ln.track()` & re-run
SQLRecordList([Artifact(uid='VP2IcbXMqE9HJFyQ0000', key='fasta/SOX2.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=414, hash='C5q_yaFXGk4SAEpfdqBwnQ', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-26 09:52:24 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='SXk23iDiSKMixy7L0000', key='fasta/MYC.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=536, hash='WGbEtzPw-3bQEGcngO_pHQ', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-26 09:52:24 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='iCAKiuVm0j3GEYQZ0000', key='fasta/PO5F1.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=477, hash='-7iJgveFO9ia0wE1bqVu6g', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-26 09:52:24 UTC, is_locked=False, version_tag=None, is_latest=True),
               Artifact(uid='mFy8dmPlZvgsVbIX0000', key='fasta/KLF4.fasta', description=None, suffix='.fasta', kind=None, otype=None, size=609, hash='LyuoYkWs4SgYcH7P7JLJtA', n_files=None, n_observations=None, branch_id=1, created_on_id=1, space_id=1, storage_id=1, run_id=None, schema_id=None, created_by_id=1, created_at=2026-05-26 09:52:24 UTC, is_locked=False, version_tag=None, is_latest=True)])

Run the redun workflow:

!redun run workflow.py main --input-dir ./fasta --tag run=test-run  1> run_logs.txt 2>run_logs.txt

Inspect the logs:

!cat run_logs.txt
Hide code cell output
 connected lamindb: testuser1/test-redun-lamin
 script invoked with: run workflow.py main --input-dir ./fasta --tag run=test-run
 created Transform('1aJV8PnPoZP70000', key='workflow.py'), started new Run('2dZ1MV7VuJJLQ0qm') at 2026-05-26 09:52:31 UTC
 recommendation: to identify the script across renames, pass the uid: ln.track("1aJV8PnPoZP7")
File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/results.tgz, hash=ac61a9e6)
_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/VP2IcbXMqE9HJFyQ0000.fasta, hash=8fc65cd8), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job f977dee9:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/SXk23iDiSKMixy7L0000.fasta, hash=1684b760), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job 0074292b:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/iCAKiuVm0j3GEYQZ0000.fasta, hash=e9fa08f9), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job a4fa370b:  bioinformatics_pipeline_tutorial.lib.digest_protein_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/mFy8dmPlZvgsVbIX0000.fasta, hash=70fe8e5b), enzyme_regex='[KR]', missed_cleavages=0, min_length=4, max_length=75) on default
[redun] Run    Job 281f49a7:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/VP2IcbXMqE9HJFyQ0000.fasta, hash=8fc65cd8), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/VP2IcbXMqE9HJFyQ0000.peptides.txt, hash=80ae9f3e), amino_acid='C') on default
[redun] Run    Job 1230276f:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/SXk23iDiSKMixy7L0000.fasta, hash=1684b760), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/SXk23iDiSKMixy7L0000.peptides.txt, hash=d3daf12f), amino_acid='C') on default
[redun] Run    Job 9b0cde28:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/iCAKiuVm0j3GEYQZ0000.fasta, hash=e9fa08f9), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/iCAKiuVm0j3GEYQZ0000.peptides.txt, hash=51a89167), amino_acid='C') on default
[redun] Run    Job 284e5d83:  bioinformatics_pipeline_tutorial.lib.count_amino_acids_task(input_fasta=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/.lamindb/mFy8dmPlZvgsVbIX0000.fasta, hash=70fe8e5b), input_peptides=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/mFy8dmPlZvgsVbIX0000.peptides.txt, hash=08eef679), amino_acid='C') on default
[redun] Run    Job 0968c91b:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/VP2IcbXMqE9HJFyQ0000.count.tsv, hash=abed6107)) on default
[redun] Run    Job f33d0705:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/SXk23iDiSKMixy7L0000.count.tsv, hash=86dfa400)) on default
[redun] Run    Job 6529f7a2:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/iCAKiuVm0j3GEYQZ0000.count.tsv, hash=b5bc6bca)) on default
[redun] Run    Job 7460aab4:  bioinformatics_pipeline_tutorial.lib.plot_count_task(input_count=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/mFy8dmPlZvgsVbIX0000.count.tsv, hash=22556de9)) on default
[redun] Run    Job 628b59d1:  bioinformatics_pipeline_tutorial.lib.get_report_task(input_counts=[File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/VP2IcbXMqE9HJFyQ0000.count.tsv, hash=abed6107), File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-l...) on default
[redun] Run    Job 1dd6cb58:  bioinformatics_pipeline_tutorial.lib.archive_results_task(inputs_plots=[File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/VP2IcbXMqE9HJFyQ0000.plot.png, hash=bcfd2a5f), File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-la..., input_report=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/protein_report.tsv, hash=b6ec6640)) on default
[redun] Run    Job 34b84f2f:  redun_lamin_fasta.finish(results_archive=File(path=/home/runner/work/redun-lamin/redun-lamin/docs/test-redun-lamin/data/results.tgz, hash=ac61a9e6)) on default
[redun] 
[redun] | JOB STATUS 2026/05/26 09:52:43
[redun] | TASK                                                        PENDING RUNNING  FAILED  CACHED    DONE   TOTAL
[redun] | 
[redun] | ALL                                                               0       0       0       0      16      16
[redun] | bioinformatics_pipeline_tutorial.lib.archive_results_task         0       0       0       0       1       1
[redun] | bioinformatics_pipeline_tutorial.lib.count_amino_acids_task       0       0       0       0       4       4
[redun] | bioinformatics_pipeline_tutorial.lib.digest_protein_task          0       0       0       0       4       4
[redun] | bioinformatics_pipeline_tutorial.lib.get_report_task              0       0       0       0       1       1
[redun] | bioinformatics_pipeline_tutorial.lib.plot_count_task              0       0       0       0       4       4
[redun] | redun_lamin_fasta.finish                                          0       0       0       0       1       1
[redun] | redun_lamin_fasta.main                                            0       0       0       0       1       1
[redun] 
[redun] 
[redun] Execution duration: 11.58 seconds

View data lineage

artifact = ln.Artifact.get(key="data/results.tgz")
artifact.view_lineage()
Hide code cell output
_images/bff4133b741335b2de7e914009762eb1fa045420dbb6a8ba2554820b38f12c70.svg
artifact.transform.describe()
Hide code cell output
Transform: workflow.py (0000)
|   description: CLI: redun
├── uid: 1aJV8PnPoZP70000                                     
hash: zjZMEeofUMp3FV6fxbgjYg         type: script         
branch: main                         space: all           
created_at: 2026-05-26 09:52:31 UTC  created_by: testuser1
└── source_code: 
    """workflow.py."""
    
    # This code is based on a copy from https://github.com/ricomnl/bioinformatics-pi …
    # Copyright Rico Meinl 2022
    from enum import Enum
    
    import lamindb as ln
    from redun import File, task
    
    import redun_lamin_fasta
    from redun_lamin_fasta.lib import (
        archive_results_task,
        count_amino_acids_task,
        digest_protein_task,
        get_report_task,
        plot_count_task,
    )
    
    redun_namespace = redun_lamin_fasta.__name__
    
    
    class Executor(str, Enum):
        default = "default"
        process = "process"
        batch = "batch"
        batch_debug = "batch_debug"
    
    
    @task()
    def finish(results_archive: File) -> File:
    │ …
artifact.run.describe()
Hide code cell output
Run: 2dZ1MV7 (workflow.py)
├── uid: 2dZ1MV7VuJJLQ0qm                transform: workflow.py (0000)       
started_at: 2026-05-26 09:52:31 UTC  finished_at: 2026-05-26 09:52:43 UTC
status: completed                                                        
branch: main                         space: all                          
created_at: 2026-05-26 09:52:31 UTC  created_by: testuser1               
├── cli_args: 
run workflow.py main --input-dir ./fasta --tag run=test-run
├── report: oAtg6DF
→ connected lamindb: testuser1/test-redun-lamin
→ created Transform('1aJV8PnPoZP70000', key='workflow.py'), started new Run('2dZ …
• recommendation: to identify the script across renames, pass the uid: ln.track( …
└── environment: Pdp0ziC
    aiobotocore==3.7.0
    aiohappyeyeballs==2.6.2
    aiohttp==3.13.5
    aioitertools==0.13.0
    │ …

Explore the run on the hub

lamin.ai/laminlabs/lamindata/transform/taasWKawCiNA

View the database content

ln.view()
Hide code cell output
Artifact
uid key description suffix kind otype size hash n_files n_observations ... is_latest is_locked created_at branch_id created_on_id space_id storage_id run_id schema_id created_by_id
id
6 ICviUUaHazXucpt00000 data/results.tgz None .tgz None None 94355 KuFRbl-HXdjOF6a6LptSTQ None None ... True False 2026-05-26 09:52:43.093000+00:00 1 1 1 1 1.0 None 1
4 mFy8dmPlZvgsVbIX0000 fasta/KLF4.fasta None .fasta None None 609 LyuoYkWs4SgYcH7P7JLJtA None None ... True False 2026-05-26 09:52:24.433000+00:00 1 1 1 1 NaN None 1
3 iCAKiuVm0j3GEYQZ0000 fasta/PO5F1.fasta None .fasta None None 477 -7iJgveFO9ia0wE1bqVu6g None None ... True False 2026-05-26 09:52:24.432000+00:00 1 1 1 1 NaN None 1
2 SXk23iDiSKMixy7L0000 fasta/MYC.fasta None .fasta None None 536 WGbEtzPw-3bQEGcngO_pHQ None None ... True False 2026-05-26 09:52:24.432000+00:00 1 1 1 1 NaN None 1
1 VP2IcbXMqE9HJFyQ0000 fasta/SOX2.fasta None .fasta None None 414 C5q_yaFXGk4SAEpfdqBwnQ None None ... True False 2026-05-26 09:52:24.431000+00:00 1 1 1 1 NaN None 1

5 rows × 21 columns

Run
uid name description entrypoint started_at finished_at params reference reference_type cli_args ... created_at branch_id created_on_id space_id transform_id report_id environment_id plan_id created_by_id initiated_by_run_id
id
1 2dZ1MV7VuJJLQ0qm None None None 2026-05-26 09:52:31.716689+00:00 2026-05-26 09:52:43.120272+00:00 None None None run workflow.py main --input-dir ./fasta --tag... ... 2026-05-26 09:52:31.717000+00:00 1 1 1 1 7 5 None 1 None

1 rows × 21 columns

Storage
uid root description type region instance_uid is_locked created_at branch_id created_on_id space_id created_by_id run_id
id
1 ehvHoUeYWs2Y /home/runner/work/redun-lamin/redun-lamin/docs... None local None iQlBPgD8uaqR False 2026-05-26 09:52:23.114000+00:00 1 1 1 1 None
Transform
uid key description kind source_code hash reference reference_type version_tag is_latest is_locked created_at branch_id created_on_id space_id environment_id plan_id created_by_id
id
1 1aJV8PnPoZP70000 workflow.py CLI: redun script """workflow.py."""\n\n# This code is based on ... zjZMEeofUMp3FV6fxbgjYg None None None True False 2026-05-26 09:52:31.714000+00:00 1 1 1 None None 1

Appendix

Map the redun execution id

If we want to be able to query LaminDB for redun execution ID, this here is a way to get it:

# export the run information from redun
!redun log --exec --exec-tag run=test-run --format json --no-pager > redun_exec.json
# load the redun execution id from the JSON and store it in the LaminDB run record
with open("redun_exec.json") as file:
    redun_exec = json.loads(file.readline())
artifact.run.reference = redun_exec["id"]
artifact.run.reference_type = "redun_id"
artifact.run.save()
Hide code cell output
Run(uid='2dZ1MV7VuJJLQ0qm', name=None, description=None, entrypoint=None, started_at=2026-05-26 09:52:31 UTC, finished_at=2026-05-26 09:52:43 UTC, params=None, reference='86c585cd-ded0-43bc-9cf4-22cf04467d75', reference_type='redun_id', cli_args='run workflow.py main --input-dir ./fasta --tag run=test-run', branch_id=1, created_on_id=1, space_id=1, transform_id=1, report_id=7, environment_id=5, plan_id=None, created_by_id=1, initiated_by_run_id=None, created_at=2026-05-26 09:52:31 UTC, is_locked=False)

Map the redun run report

While lamindb auto-tracks the logs of the main python process you might also want to link the dedicated redun logs:

report = ln.Artifact(
    "run_logs.txt",
    description=f"Redun run report of {redun_exec['id']}",
    run=False,
    kind="__lamindb_run__",  # mark as auxiliary artifact for the run
).save()
artifact.run.report = report
artifact.run.save()
Hide code cell output
Run(uid='2dZ1MV7VuJJLQ0qm', name=None, description=None, entrypoint=None, started_at=2026-05-26 09:52:31 UTC, finished_at=2026-05-26 09:52:43 UTC, params=None, reference='86c585cd-ded0-43bc-9cf4-22cf04467d75', reference_type='redun_id', cli_args='run workflow.py main --input-dir ./fasta --tag run=test-run', branch_id=1, created_on_id=1, space_id=1, transform_id=1, report_id=8, environment_id=5, plan_id=None, created_by_id=1, initiated_by_run_id=None, created_at=2026-05-26 09:52:31 UTC, is_locked=False)