Changelog 2024ΒΆ
Note
π‘ LaminDB implements βmigration-based versioningβ. When upgrading your LaminDB installation to a new minor
version in major.minor.patch
, you also migrate your database via lamin migrate deploy
.
Get notified by watching releases for git repositories: lamindb, laminhub, laminr, and bionty.
2024-12-01 db 0.77.3ΒΆ
π¨ Better names for sections in
.describe()
PR1 PR2 @sunnyosunπΈ Improve feedback on not-up-to-date notebook content PR @falexwolf
π Fix
curator.validate()
from public PR @sunnyosunπ Fix very long runtimes for Artifact.describe PR @Koncopd
π Fix organism in curator.standardize PR @sunnyosun
π Better error behavior for repeated calls of
standardize
PR @Zethsonπ Fix the error on special chars in search strings PR @Koncopd
π Fix display of (non-categorical)
str
features in.describe()
PR @sunnyosunπ Fix the error on existing cache on copy to cache in
Artifact.save()
PR @Koncopd
2024-12-02 R 0.3.0ΒΆ
2024-12-01 db 0.77.2ΒΆ
πΈ A more intuitive
artifact.describe()
PR @sunnyosun⨠Enable to easily join features onto artifacts via
Artifact.df()
PR @falexwolf⨠Support features with
dtype = 'str'
PR @falexwolf⨠Support features with
dtype = 'datetime'
and improve feature values handling PR @falexwolfπ¨ Let
.from_values()
returnRecordList
and better treat categorical PR @falexwolfπ¨ Add
.standardize()
toCurator
PR @sunnyosunπ¨ Make search in
bionty.base
consistent with lamindb PR @Koncopd
2024-11-25 hub 0.31ΒΆ
πΈ Improve speed and relevance search. Make search consistent with lamindb
0.77. PR @Koncopd @fredericenard
Details
Since lamindb
0.77 and laminhub
0.31 search isnβt fuzzy anymore. This leads to much more predictable and relevant results similar to what users know from GitHub, Slack and similar tools.
Search results for lamindb
and bionty
are now exemplified here faq/search.
More changes.
β‘ Faster loading speed on launch @awgaan
π Fixed incorrect sorting in the version selector @chaichontat
2024-11-21 R 0.2.0ΒΆ
β¨ Read and write data with LaminR, an R client for LaminDB. PR @rcannood @lazappi
install.packages("laminr", dependencies = TRUE) # install the laminr package from CRAN
library(laminr)
db <- connect() # connect to the instance you configured on the terminal
db$track(path = "./my-analysis.Rmd") # track a run of your notebook or script
artifact <- db$Artifact$get("3TNCsZZcnIBv2WGb0001") # get an artifact record by uid
df <- artifact$load() # load the artifact into memory, e.g., a DataFrame
# do your work
db$Artifact.from_path("./my_result_folder", description="My result").save() # save a folder
db$finish() # mark the run finished
2024-11-21 db 0.77.0 | bionty 0.53ΒΆ
β¨ Enable validation of Literal
and other field types. PR @sunnyosun
A
Literal
-typedCharField
is validated for anyRecord
:from lnschema_core import Record, fields CRISPRType = Literal[ "CRISPRi", "CRISPRa", ] class Treatment(Record): system: CRISPRType = fields.CharField() Treatment(system="crispr") #> FieldValidationError: #> system: crispr is not a valid value #> β valid values are: CRISPRa, CRISPRi
For custom field types, use Djangoβs rich field validation and subclass
ValidateFields
:from lnschema_core import Record, ValidateFields from django.core.validators import RegexValidator class Reference(Record, ValidateFields): url: str = fields.URLField() doi: str = fields.CharField( validators=[ RegexValidator( regex=r"^(?:https?://(?:dx\.)?doi\.org/|doi:|DOI:)?10\.\d+/.*$", message="Must be a DOI (10.1000/xyz123 or https://doi.org/10.1000/xyz123)", ) ], ) Reference(doi="abc.ef", url="myurl.com") #> FieldValidationError: #> url: myurl.com is not valid #> β Enter a valid URL. #> doi: abc.ef is not valid #> β Must be a DOI (10.1000/xyz123 or https://doi.org/10.1000/xyz123)
These possibilities are now leveraged in all schema modules.
More concise registry definitions: lnschema-core findrefs ourprojects clinicore cellregistry @sunnyosun
Squash migrations for faster instance creation: lnschema-core bionty wetlab @sunnyosun
π± Overhauled Nextflow integration. PR @Zethson
The registration script now leverages standardized
nf-core
metadata pathsExample use case is now more comprehensive and based on
nf-core/scrnaseq
Other enhancements.
πΈ Mark
.qmd
&.Rmd
files as notebooks, not scripts PR @falexwolfπΈ Suppress hf filesystem warning due to not being explicitly implemented in upath PR @Koncopd
2024-11-15 db 0.76.16ΒΆ
New features.
β¨ Support saving R code including
.qmd
and.Rmd
PR @falexwolf⨠Support registering artifacts on Hugging Face PR @Koncopd
Other enhancements.
πΈ Refactor
ln.track()
to improve logging and method signature PR @falexwolfπΈ Enable to query with records from a different database instance PR @falexwolf
πΈ Enable autocompletion for inherited methods in Jupyter PR @Koncopd
πΈ Add
pip install
extras for all schema modules PR @Zethson
Fixes.
π Fix transfering artifacts from a source instance with fewer schema modules PR @sunnyosun
π Fix registering Gene columns in
DataFrameCurator
PR @sunnyosun
Deprecations.
2024-11-13 hub 0.30ΒΆ
β¨ A link that omits the last 4 version-coding characters now links to the latest version of a record, e.g., 13VINnFk89PE β 13VINnFk89PE0006. PR @chaichontat
πΈ Filter selectors are more intuitive. PR @chaichontat
β¨ Plots can now visualize time series. PR @chaichontat @Golodhros
Further enhancements.
β¨ Table columns are now resizable PR @chaichontat @awgaan
β¨ UID copy button are now in custom tables PR @chaichontat
πΈ Invalid filter triggers error PR @chaichontat
Fixes.
π Fix intermittent page reloading PR @chaichontat
2024-10-29 db 0.76.15ΒΆ
β¨ Stream pyarrow.dataset
via Artifact.open()
PR @Koncopd
df = pd.DataFrame({"col1": [0, 0, 1, 1], "col2": [6, 7, 8, 9]})
df.to_parquet("df.parquet", engine="pyarrow", partition_cols=["feat1"])
artifact = ln.Artifact("df.parquet", description="A partitioned parquet").save()
with artifact.open() as dataset: # get pyarrow.dataset.Dataset
batches = dataset.to_batches() # get a streaming iterator over batches
dataset.to_table().to_pandas() # read into memory and convert to pandas Dataframe
β¨ Add .query_parents()
and .query_children()
to hierachical registries PR @Koncopd
import lamindb as ln
label1 = ln.ULabel(name="label1").save()
label2 = ln.ULabel(name="label2").save()
label3 = ln.ULabel(name="label3").save()
label1.children.add(label2)
label2.children.add(label3)
label3.query_parents() # returns a QuerySet with label1 and label2
label1.query_children() # returns a QuerySet with label2 and label3
More changes:
2024-10-18 db 0.76.14 | bionty 0.52ΒΆ
Features.
β¨ Safe handling of renaming internal features and labels PR @sunnyosun
β¨ Enable curating multiple categorical features per artifact against the same label registry PR @falexwolf
User experience.
πΈ Automate
Curator.add_validated_from
and remove it from the API Breaking PR @sunnyosunπΈ Remove unnecessary
BioRecord.list_source()
Breaking PR @sunnyosun
Bionty changes.
π¨ Add unique constraints and fix
gene_ref_is_symbol
tolabel_ref_is_name
PR @falexwolfπ¨ Remove update in
add_ontology_from_df()
PR @sunnyosun
Fixes.
π Fix loading artifact by key and give clear errors if no artifacts or transforms found PR @Koncopd
Deprecations.
2024-10-14 hub 0.29ΒΆ
β¨ Advanced filters. @chaichontat @adamdev21
β¨ Filter on click. @chaichontat
β¨ Links to filtered artifacts in record tables. @chaichontat
β¨ Links to artifacts in Features dashboard. @chaichontat
π Prettify plotting panels on Overview
page PR @chaichontat
Enhancements.
β¨ Make filtered views sharable by coding filters in the URL @chaichontat
π Prevent duplicates from showing up in filters @chaichontat
β¨ Add
created_by
filtering to transform list @chaichontat⨠Version warning banner @sunnyosun
π Use backend to search selectors @chaichontat
πΈ Allow hyphens in handles @chaichontat
π Simplify text on landing page PR @falexwolf
π Show instance description on user dashboard @adamdev21
β¨ Group ULabels by Features in detailed artifact view @chaichontat
π Remove ID from field sort hierarchy @chaichontat
πΈ Rename
Data
tab toArtifacts
and sort filters @chaichontat⨠Enable to add a schema module via the UI @adamdev21 @chaichontat @sunnyosun
Fixes.
π Do not display N/A fields in hover card
π Fix duplicated alias error with filters
π Fix collection copy button
π Deduplicate transform references in βOutputsβ section
π Fix hang when loading transform
π Fix transform pane collapsing behavior
π Fix flicker
Deprecations.
πΈ Beta API key is now stable; deprecate legacy API key
2024-10-11 db 0.76.13ΒΆ
πΈ Do not error if a schema module of an instance isnβt installed PR @falexwolf
πΈ More decent failing upon invalid
lamin init
calls PR @KoncopdπΈ Clearer feedback when somebody tries to switch the default instance PR @falexwolf
π Fix
.get()
with.using()
PR @sunnyosunπ For
.view_lineage()
for circular input/outputs PR @sunnyosunπ Fix transferring to target instances that have more schema modules than the source instance PR @sunnyosun
π Fix monkey-patching
__getitem__
onQuerySet
PR @sunnyosunπ Do not double track runs PR @falexwolf
2024-10-08 db 0.76.12 | bionty 0.51ΒΆ
β¨ Overhaul
save_vitessce_config()
to support multiple artifacts and non-.zarr
PR @keller-markπΈ Query with typed labels through
.features
PR @falexwolfπ Document how to query by dictionary-like run parameters PR @falexwolf
π Fix
lamin load ... --with-env
across servers PR @Koncopd
2024-10-02 hub 0.28ΒΆ
Major improvements.
β¨ SSO login + new API key @fredericenard @Koncopd @chaichontat @sunnyosun
π Display
lamin connect
on theOverview
andlamin load
on theTransform
page @chaichontatπ Overhauled
Features
page @chaichontatπ A
ULabels
page @adamdev21 PRπΈ Show only latest versions of versioned entities & add a version drop down to
Artifact
&Collection
in analogy toTransform
@adamdev21 @chaichontat PRπ Hover card for
ULabel
annotations @adamdev21β¨ Overhauled
Settings
page and enable to change handle & name @chaichontat
Minor changes.
π Fix
ulabel
filtering on Artifacts page @adamdev21π Fix collection page @sunnyosun
π Do not re-encode vitessce url if itβs non-lamin @sunnyosun
πΈ A friendly message when a user still drafts a transform and hasnβt provided source code or run report @sunnyosun
πΈ Use
key
as a download name for a transform @sunnyosunπΈ Only show linked labels in filters @adamdev21
π Order transforms & runs by
created_at
@sunnyosunπ On
Overview
page, show data formats, feature sets, and artifactsize
andcounts
@chaichontat⨠Support
refSpecUrl
inVitessceConfig
@sunnyosun
2024-10-01 db 0.76.11ΒΆ
β¨ Add a reference manager schema module:
findrefs
PR @falexwolfπ Fix label name display in
.describe()
PR @falexwolfπ Fix permission error when saving a new artifact on python 3.12 PR @Koncopd
π Strip
NotebookNotSaved
error from report afterln.finish()
PR @Koncopd
2024-09-30 db 0.76.10ΒΆ
πΈ Re-worked the CLI:
lamin load
βlamin connect
&lamin get
βlamin load
PR @Koncopd @apββ‘ Improve performance of
ln.connect()
,lamin connect
, andlamin load
for a notebook PR @falexwolfβ‘οΈ Speed up curation workflows through
from_values
PR @sunnyosunπΈ Improve
lamin load
UX for notebooks & scripts PR @falexwolfπΈ Transfer: Warn about inconsistencies between source & target instances PR @falexwolf
π¨ Move
.from_values()
fromRecord
toCanValidate
PR @falexwolfπ Document how to work with run parameters PR @falexwolf
β¨ Track transfers as transforms PR @falexwolf
β‘οΈ Speed up describe PR @sunnyosun
πΈ Minimal
ln.track()
PR @falexwolf⨠Enable to
lamin load
from on-prem domains PR @falexwolf
2024-09-26 db 0.76.9ΒΆ
β¨ Allow filtering
.obs
indices inMappedCollection
Guide PR @Koncopdπ Document notebook templates Guide PR @falexwolf
Curating perturbations
Genetic, compound, and environmental perturbations and their targets can be curated with the wetlab
schema.
This can be achieved at varying levels of detail. For example, with a high degree of detail:
import wetlab as wl
EGFR_kd = wl.GeneticTreatment(
system="CRISPR Cas9",
name="EGFR knockdown",
sequence="AGCTGACCGTGA",
on_target_score=85,
off_target_score=15
).save()
EGFR_gene = bt.Gene.from_source(symbol="EGFR").save()
EGFR_kd_target = wl.TreatmentTarget(name="cell growth").save()
EGFR_kd_target.genes.add(EGFR_gene)
artifact.genetic_treatmends.add(EGFR_kd)
2024-09-23 db 0.76.8ΒΆ
π Ensure
is_latest
is set toFalse
in previous version if matching onartifact.key
PR @falexwolf⨠Store artifacts under their virtual keys in cache PR @Koncopd
2024-09-18 db 0.76.7 | bionty 0.50ΒΆ
β¨ Enable getting the latest run environment for a transform PR @falexwolf
β¨ Enable displaying images via
artifact.load()
, add documentation for artifact loaders PR @falexwolfπΈ Do not throw an error but prompt upon
ln.context.track()
in a notebook PR @falexwolfπΈ Allow to use
Collection.mapped()
without saving the collection PR @KoncopdπΈ Simplify CLI commands PR @falexwolf
Technical changes
Ontology versions
2024-09-09 db 0.76.6 | bionty 0.50ΒΆ
β¨ Enable negations in
filter()
PR @falexwolfβ¨
lamin get
viakey
oruid
PR @falexwolfπ¨ Replace direct relation of
Collection
toFeatureSet
with indirect relation throughArtifact
PR @falexwolfπ¨ Remove backward relationships for
Run
,User
&Source
foreign keys PR PR @falexwolf @sunnyosunπ Reload in
Transform()
upon passing existinguid
PR @falexwolf
2024-09-05 db 0.76.5ΒΆ
π Fix
lamin save
for source code updates PR @falexwolfπ Allow login with Beta API key via CLI PR @fredericenard @Koncopd
2024-09-04 db 0.76.4ΒΆ
πΈ Make source code searchable Details & PR @falexwolf @chaichontat
π Fix marking latest transform as
is_latest
PR @falexwolf⨠Support lists as values in validation PR @sunnyosun
2024-08-30 db 0.76.3 | bionty 0.49ΒΆ
β¨ tiledbsoma
integration. Guide PR @Koncopd
Example
Create a tiledbsoma.Experiment
array store or append AnnData
objects to an existing store.
# create new versioned tiledbsoma.Experiment
artifact = ln.integrations.save_tiledbsoma_experiment(
adatas,
measurement_name="RNA"
)
# append to existing tiledbsoma.Experiment
revised_artifact = ln.integrations.save_tiledbsoma_experiment(
adatas,
measurement_name="RNA",
revises=artifact
)
Bionty updates.
More changes.
π Deprecate
Curate
in favor ofCurator
PR @sunnyosunπ₯ Remove
lamin register
and password argument oflamin login
PR PR @falexwolf
2024-08-26 hub 0.27ΒΆ
β¨ Instance overview page. @chaichontat
Screenshot
π Show Vitessce button next to dataset instead of VitessceConfig
file. @sunnyosun
ποΈ Much improved on-prem deployment. @fredericenard
2024-08-23 db 0.76.2ΒΆ
πΈ Simplify versioning. PR @falexwolf
Semantic version strings in .version
are now optional as in git.
For Artifact
& Transform
, you can now also create new versions by passing the key
argument.
artifact_v1 = ln.Artifact.from_df(df, key="my_datasets/my_study1.parquet").save()
# below automatically creates a new version of artifact_v1 because the `key` matches
artifact_v2 = ln.Artifact.from_df(df_updated, key="my_datasets/my_study1.parquet").save()
π Deprecate
is_new_version_of
argument in favor ofrevises
π Deprecate passing
version
to constructors; rather set.version
after creating records
More changes.
π Simpler icons PR @sunnyosun
2024-08-16 db 0.76.1ΒΆ
πΈ Overhauled context tracking experience with ln.context.track()
Details & PR @falexwolf @chaichontat
ln.context.uid = "FPnfDtJz8qbE0000" # <-- auto-generated by ln.context.track()
# track the execution of your notebook or script with inputs & outputs
ln.context.track()
What was the previous experience?
Now:
Previously:
How does it look on the hub?
If you donβt label with a semantic version tag, youβll get an auto-generated revision id.
β οΈ Breaking change: ln.track()
now returns None
instead of a Run
. Access the run via ln.context.run
instead.
More changes:
2024-08-14 db 0.76ΒΆ
β‘οΈ Add boolean field
IsVersioned.is_latest
to speed up queries for latest versions PR @falexwolf @fredericenard @chaichontatπ Fix length limitation for external hash parsing PR @falexwolf
β‘οΈ Speed up
BioRecord.import_from_source
PR @sunnyosunπ¨ Make more backward accessors private and type the rest PR @falexwolf
2024-08-10 hub 0.26ΒΆ
πΈ Backend search & filter via multiple relationships @chaichontat @fredericenard
β¨ Login via Microsoft @fredericenard
π Details in hover cards @chaichontat
2024-08-08 db 0.75.1ΒΆ
πΈ Improved the cellxgene_lamin
curation guide.
π Fixes for
lamindb.Curate
PR PR @sunnyosun⨠Allow excluding values in
Curate.validate
PR @sunnyosun
2024-08-08 bionty 0.48ΒΆ
New ontology versions.
2024-08-03 db 0.75ΒΆ
β¨ Track mutations of array stores. Guide PR @Koncopd
Artifacts that store mutable arrays can lead to non-reproducible queries.
To monitor reproduciblity and data lineage, mutations are now tracked when a context manager and
Artifact.open(mode="w")
fortiledbsoma
array stores is used:with artifact.open(mode="w") as array: # mutate `artifact` # `artifact` now points to a new version of the artifact with an updated hash
πΈ A better structured API. PR @falexwolf
πΈ Easier typing & maintenance of categorical fields via
typing.Literal
instead of Djangoβs migration-dependentCharField.choices
πΈ Less clutter in auto-complete
π All fields pointing to link records start with
links_
π Several fields for
Artifact
are now private via_
prefix:accessor
,key_is_virtual
,feature_values
,param_values
,hash_type
,previous_runs
π¨ More consistency
π Rename
Transform.parents
toTransform.predecessors
to disambiguate procedural/temporal from ontological/conceptual hierachiesπ¨ Feature names are now guaranteed to be unique in a lamindb instance
Feature.name
π¨ Consistent length of hash fields:
HASH_LENGTH=22
π Rename
input_of
toinput_of_runs
π¨
Transform.latest_report
is now a property point toTransform.latest_run.report
to simplify the schemaπ¨
Artifact.type
now defaults toNone
when passing apath
so that auxiliary files and folders arenβt labeled asdataset
πΈ Better definition of
Collection
π Rename fields
.artifact
to.meta_artifact
and.unordered_artifacts
to.artifacts
Iteration over an ordered
QuerySet
of artifacts is now possible via.ordered_artifacts
For collections that have a single data artifact, access it via
.data_artifact
ποΈ Towards searchable source code
π Rename
Transform.source_code
toTransform._source_code_artifact
Re-introduce
Transform.source_code
as a text field together with a fieldhash
Better storage management.
πΈ Enable deleting artifacts in all managed storage locations of the current instance PR @falexwolf
β»οΈ Do not write storage records to hub for local test instances PR @falexwolf
π Fix populating
storage.instance_uid
duringinit_instance
PR @falexwolf
Various updates.
2024-08-03 bionty 0.47ΒΆ
ποΈ Bionty is now a single Python package. PR PR PR PR PR
ποΈ
lnschema-bionty
andbionty-base
are integrated intobionty
πΈ Considerably simpler UX: see Manage biological registries PR @sunnyosun
β οΈ Migration: Once you load an instance, youβll be asked to uninstall lnschema_bionty
and lamin migrate deploy
On the SQL level, tables are now prefixed with
bionty_
instead oflnschema_bionty_
On the Django level, you can mount the
bionty
instead of thelnschema_bionty
apps
πΈ You can now import from in-house ontology sources. PR @sunnyosun
π Rename
PublicSource
toSource
&from_public
tofrom_source
Import from any parquet file into your registry, akin to how Bionty imports public ontology sources
User experience.
β‘ Performantly import bulk records via
.import_from_source()
πΈ More reliable
ontology_id
field recognition⨠Better error message for synonym duplications PR @Zethson
π¨
CellMarker.name
is now unique together withorganism
PR sunnyosun
New ontologies.
2024-07-26 hub 0.25ΒΆ
Overhauled the REST API: better performance and architecture.
β‘ Optimize query builder @fredericenard
β¨ GroupBy endpoint @fredericenard
β»οΈ Improved API schema @fredericenard @chaichontat
UI improvements.
π Add details in hover card @chaichontat
π Stop settings from flickering @chaichontat
2024-07-26 db 0.74.3ΒΆ
β‘ Speed up populating parent records by an order of magnitude, remove the parents
keyword (PR @sunnyosun).
Features.
β¨ Allow for multiple local storage locations with the same root path PR @falexwolf
β¨ Add
add_from_df
method toBioRecord
PR @sunnyosun
Chores.
β¬οΈ Upgrade to pydantic v2 PR @falexwolf
2024-07-22 db 0.74.2ΒΆ
The API is now cleaner and fields are typed.
Details
All users who donβt use Django outside of lamindb can set Djangoβs internal API that clutters the Record
name spaces by running: lamin set private-django-api
on the command line.
πΈ Cleaner API PR @falexwolf @Koncopd
β¨ Add global
private-django-api
setting PR @falexwolfπ·οΈ Add types to fields PR @falexwolf
tiledbsoma
is now better supported.
Better names.
π Deprecate
Artifact.backed()
in favor ofArtifact.open()
PR @Koncopdπ Deprecate
Annotate
in favor ofCurate
PR @falexwolfπ Deprecate
Registry
in favor ofRecord
PR @falexwolf
Better documentation.
π Improve the curation guide PR PR @sunnyosun @falexwolf
π Improve the CLI docs PR @falexwolf
Security updates & bug fixes.
π Enable Ruff security rules (bandit) & CodeQL PR @Zethson
π Fix return values of
.save()
for a few classes PR @falexwolf
2024-07-01 hub 0.24ΒΆ
β¨ Add a checkbox for instance setting
keep-artifacts-local
PR @chaichontat @sunnyosun⨠New endpoint:
create-instance
PR @fredericenardπ More detailed feature view PR @chaichontat
β¨ Image preview PR @chaichontat
β¨ Artifact backlinks PR @chaichontat
2024-06-26 db 0.74.1ΒΆ
β»οΈ Refactor ln.settings
PR @falexwolf.
β¨ Pass custom names for scripts via
ln.settings.transform.name = "My script"
β οΈ
ln.settings.storage
returns aStorageSettings
object (root vialn.settings.storage.root
)
Features.
β¨ Support different join types in
QuerySet.df()
PR @insavchuk
Use cases.
π First version of tiledbsoma guide PR Koncopd
π First version of wandb guide PR felix0097
Docs.
π Update hub screenshots PR @sunnyosun
2024-06-20 db 0.74ΒΆ
β¨ You can now distinguish model-like and dataset-like artifacts via a type
field in the Artifact
registry.
πΈ Leverage
artifact.params.add_values()
to annotate model-like artifacts like you leverageartifact.features.add_values()
to annotate dataset-like artifactsποΈ Add
type
field toArtifact
, allow linking model-like artifacts against params, validate params akin to validating features, enable features-based annotation with non-ulabels PR @falexwolf
2024-06-13 db 0.73.2ΒΆ
π Fix clashing reverse accessors for
.previous_runs
and.run
PR @falexwolf
2024-06-05 db 0.73.1ΒΆ
ποΈ Instantly synchronize instance schema with the hub PR @fredericenard
π Fix generation of
uid
for manual Transform constructor PR @falexwolfπ₯ Deleting
artifact.stage()
in favor ofartifact.cache()
(was deprecated in 0.70.0)
2024-05-29 db 0.73.0ΒΆ
Annotating & querying by features improved:
β¨ Support non-categorical feature values PR @falexwolf
β¨ Annotate dict-style with features & values PR @falexwolf
β¨ Query by features via
.features.filter(key=value)
PR @falexwolfποΈ Feature values decoupled from feature sets PR @falexwolf
Example:
# annotate dict-style (feature & category names get validated)
artifact.features.add_values({
"species": "setosa",
"scientist": ["Barbara McClintock", "Edgar Anderson"],
"instrument": "Leica IIIc Camera",
"temperature": 27.6,
"study": "Study 0: initial plant gathering",
"is_awesome": True
})
# get the dict back
artifact.features.get_values()
# query by feature
ln.Artifact.features.filter(is_awesome=True)
Various improvements:
π Additional non-breaking constraints in the core schema PR @falexwolf
πΈ Make
.upload_from()
,.download_to()
, and.view_tree()
more user friendly PR @falexwolf PR @KoncopdπΈ More intuitive version updating dialogue PR @falexwolf
π Actually add tracking run for entities beyond Artifact & Collection PR @falexwolf
πΈ
ln.track()
returnsrun
PR @falexwolfπΈ Better duplicate detection and search PR @falexwolf
πΈ Prettier
.describe()
PR @falexwolfπΈ More interactivity in
lamin save
PR @falexwolfπΈ
create
flag in.from_values()
PR @falexwolfπΈ Better ordering of fields in dataframe & record representations PR @falexwolf
π Improved API reference: docs now show relationship attributes PR @falexwolf
2024-05-19 db 0.72.1ΒΆ
β¬οΈ Update bionty PR @sunnyosun
π Deal with migration errors when keep-artifacts-local is true PR @falexwolf
2024-05-19 db 0.72.0ΒΆ
β¨ Extend managed access for AWS S3 to arbitrary paths PR @Koncopd @fredericenard
β¨ Extended data lineage tracking PR @falexwolf
Now store all creating runs and all updating runs for any entity, not just for
Artifact
&Collection
, e.g., runs can now haveCellType
record outputsCode is simpler through inheritance from two new base classes:
TracksRun
andTracksUpdates
β»οΈ Briefer and richer syntax for denoting feature types, renamed
Feature.type
toFeature.dtype
, e.g., for categorical features, a valid type can be:cat[ULabel|bionty.Drug]
PR @falexwolf⨠Support non-categorical metadata PR @falexwolf
Track non-categorical features:
int
,float
,bool
,datetime
, lists & dictionaries stored in aFeatureValue
registryTrack arbitrary typed parameters for runs through a
Param
registry analogous to theFeature
registry: this replaces the hard-to-validate, hard-to-migrate, and hard-to-queryjson
field ofRun
ποΈ Refactor link models PR PR @falexwolf
All annotation-related links are now stratified by
Feature
: what held forULabel
now also holdsCellType
and all otherBionty
registriesIndicate whether semantic keys were used during validation to enable warnings upon renames
Protect artifact annotations rather than cascade delete them
More consistent naming of link models, e.g.,
ulabels.artifact_links
instead ofulabels.artifactulabel_set
Dropped linking
Bionty
entities directly againstCollection
Pruned & squashed migrations for faster instance creation
2024-05-14 db 0.71.3ΒΆ
π¨ Enable transfer when schema donβt match PR @sunnyosun
β¨ Get artifacts through the CLI PR @falexwolf
β‘οΈ Improve the speed of describe PR @sunnyosun
β»οΈ Refactor search PR @falexwolf
β¨ Introduce bulk update PR @falexwolf
πΈ No need to pass organism if validating on ids PR @sunnyosun
2024-05-07 db 0.71.2ΒΆ
β¨ Enable passing parameters to
ln.track()
PR @falexwolf
2024-05-07 db 0.71.1ΒΆ
πΈ Upload source code of scripts upon
ln.finish()
and no longer uponln.track()
PR @falexwolfπ¨ Make
features.add_feature_set
public PR @sunnyosunπ¨ Use the same
uid
for the same feature set in transfer PR @sunnyosunπ¨ Upon upload switch to virtual key PR @falexwolf
β»οΈ Extend valid suffixes to composite suffixes PR @falexwolf
π₯ Remove little-used
artifact.view_tree()
PR @falexwolf
2024-05-01 db 0.71.0ΒΆ
β¨ Manage multiple storage locations with integrity PR @falexwolf
πΈ Proper progress bars for upload and download PR @Koncopd
πΈ Make save return self PR @falexwolf
2024-04-24 db 0.70.4ΒΆ
β¨ Allow passing path to
.from_anndata
PR @sunnyosunπΈ In
.setup.delete()
, check for data deletion & delete from hub PR @falexwolfβ‘οΈ Speed up
latest_version
PR @falexwolfπΈ Better user feedback on folder-like artifacts PR @falexwolf
2024-04-22 db 0.70.3ΒΆ
πΈ Update metadata like description upon re-running PR @falexwolf
π Fix detection of AnnData in zarr and h5ad, refactor directory upload PR @Koncopd
πΈ Raise error if transforms of type notebook or script are passed manually PR @falexwolf
2024-04-19 db 0.70.2ΒΆ
β»οΈ In Vitessce integration, separate
VitessceConfig
from its referenced artifacts PR @falexwolfπΈ In
ln.finish()
, remove flagi_saved_the_notebook
PR @falexwolf
2024-04-18 db 0.70.1ΒΆ
π Fix
public_source
in inspect PR @sunnyosun
2024-04-17 db 0.70.0ΒΆ
πΈ Update data source in case transform is re-run PR @falexwolf
πΈ Enable to label transforms via
transform.ulabels
PR @falexwolfπ Deprecate
stage()
in favor ofcache()
PR @falexwolf
2024-04-12 db 0.69.10ΒΆ
β¨ Add
.obsm
and.layers
toMappedCollection
and renamelabel_keys
toobs_keys
PR @KoncopdπΈ Eliminate kwargs PR @sunnyosun
β¨ Introduce
Annotate.from_mudata
PR @sunnyosun
2024-04-08 db 0.69.9ΒΆ
π Fix clashes for multiple processes PR @falexwolf
2024-04-04 db 0.69.8ΒΆ
2024-04-03 db 0.69.7ΒΆ
β¨ Add ability to upload arbitrary files or folders from CLI PR @falexwolf
π Fix anndata backed mode incompatibility with scipy 1.13.0 f
2024-04-02 db 0.69.6ΒΆ
ποΈ Temp fix region for non-hosted buckets PR @sunnyosun
2024-03-30 db 0.69.5ΒΆ
β»οΈ Improve Annotate API PR PR @sunnyosun @falexwolf
β¨ Introduce
Registry.get()
andlamin get
(replaceslamin stage
) PR @falexwolf
2024-03-30 db 0.69.4ΒΆ
β»οΈ Add Vitessce integration PR @falexwolf
β»οΈ Refactor collections PR @falexwolf
2024-03-28 db 0.69.3ΒΆ
β¨ Introduce annotation flow via
Annotate.from_df
andAnnotate.from_anndata
PR 1 2 3 @sunnyosun
2024-03-26 db 0.69.2ΒΆ
2024-03-18 db 0.69.1ΒΆ
β¨ To try out, add lamindb.validation
with the Validator
class PR @sunnyosun
2024-03-17 db 0.69.0ΒΆ
Main new features:
β¨ Integrate lamindb with git PR PR @falexwolf
β¨ Introduce
ln.finish()
, track run finish times asrun.finished_at
, renamerun.run_at
torun.started_at
, upload notebooks duringln.finish()
PR @falexwolfπΈ Upload script source code and environment during
ln.track()
PR @falexwolf
Other changes:
β¨ Allow including simple related fields in
.df()
PR @falexwolfπ Move transform settings into settings PR @falexwolf
β¨ Add
latest_version
filter forQuerySet
PR @falexwolfπ Rename
transform.short_name
totransform.key
PR @falexwolfβ»οΈ Add a JSON field to
Run
PR @falexwolf
2024-03-11 db 0.68.2ΒΆ
πΈ Move transform & run artifacts into cache before uploading PR @falexwolf
πΈ More sensible transform types PR @falexwolf
π Rename
lnschema_lamin1
towetlab
PR @falexwolf
2024-03-08 db 0.68.1ΒΆ
πΈ You can now use
ln.connect()
to connect to a LaminDB instance PR @falexwolfπΈ You can no longer delete data from non-default storage locations, as these might be tracked in other instances PR @sunnyosun
πΈ Enable transferring data from local instances to remote instances PR @sunnyosun
2024-03-01 db 0.68.0ΒΆ
πΈ Decouple features linking from Artifact construction PR 1 2 3 @sunnyosun.
# default constructor for PathLike
artifact = ln.Artifact("mysc.h5ad", description="raw data")
# from_ constructors for other types
artifact = ln.Artifact.from_anndata(mysc_adata, description="raw data") # no longer links features
artifact = artifact.save()
# high-level feature linking
artifact.features.add_from_anndata(var_field=bt.Gene.ensembl_gene_id)
artifact.features.add_from_df()
# low-level feature linking
meta = ln.Feature.from_values(mysc_adata.obs.columns, field="name")
genes = bt.Gene.from_values(mysc_adata.var.ensembl_gene_id, field="ensembl_gene_id")
artifact.features.add(genes, slot="obs")
artifact.features.add(genes, slot="var")
# labels linking (no change)
labels = ln.ULabel.from_values(adata.obs.donor, field=...)
ln.save(labels)
artifact.labels.add(labels)
πΈ Can now use
ln.track()
withoutlamin track
PR @falexwolfπ
lamin stage
respects new URL design PR @falexwolfπ Rename
.dev
to.core
PR @falexwolf
2024-02-02 db 0.67.3ΒΆ
πΈ Can now import
bionty
instead oflnschema-bionty
PR @sunnyosunβ»οΈ Use Click for CLI PR @chaichontat
π¨ Make
Collection.save()
ACID PR @falexwolf⨠Add
add_to_version_family
PR @sunnyosunπ Transfer
collection.artifacts
PR @sunnyosun
2024-01-14 db 0.67.2ΒΆ
β¨ Enable staging notebooks & code using the CLI PR @falexwolf
2024-01-12 db 0.67.1ΒΆ
π Fix idempotency of
collection.save()
PR @falexwolfπΈ Disallow bulk-delete for Artifact, Transform & Collection PR @falexwolf
πΈ Init transform versions at 1 PR @falexwolf
β¨ Load json and html files PR @falexwolf
2024-01-11 db 0.67.0ΒΆ
π Rename
.bionty
to.public
,.from_bionty
to.from_public
PR @sunnyosun
2024-01-09 db 0.66.1ΒΆ
π Fix id matching in view_lineage PR @sunnyosun
β»οΈ Incorporate edge cases in
inner
andouter
join inCollection.mapped
PR @Koncopdπ¨ Not create organism records when calling
.bionty()
PR @sunnyosun
2024-01-07 db 0.66.0ΒΆ
πΈ Add anonymous access (now works without login) PR1386 @falexwolf
π¨ Introduce ordered collections and simplify
.mapped()
PR1390 @falexwolfπ Re-write quickstart PR1387 @falexwolf
2024-01-05 db 0.65.1ΒΆ
π©Ή Prepare a potential migration of the hub to Django PR1385 @falexwolf
πΈ Various improvements PR1384 @falexwolf
π©Ή Track suffix of requirements.txt PR1383 @falexwolf
β¨ Add outer join and categories caching to Collection.mapped PR1380 @Koncopd
β»οΈ Except memory error PR1382 @falexwolf
2024-01-02 db 0.65.0ΒΆ
π Rename
Dataset
toCollection
PR1377 @falexwolf⨠Track run environment PR1368 @falexwolf
β¨ Allow transfer from private instances PR1370 @falexwolf
πΈ Speed up transfer and enable transfer parents PR1371 @sunnyosun
π¨ Version based on
stem_uid
instead ofinitial_version_id
and replace__lamindb_uid_prefix__
with__transform_stem_uid__
PR1369 PR1375 PR1373 @bpenteado @falexwolfπ¨ Name
.lndb
files by instance id PR1372 @falexwolf
biontyΒΆ
Name |
PR |
Developer |
Date |
Version |
---|---|---|---|---|
π Rename |
2024-01-12 |
0.36.0 |
||
π Rename |
||||
π Rename |
2024-07-26 |
0.44.0 |
||
π¨ Do not add obsolete terms due to ontology_id duplication |
2024-07-25 |
|||
β‘οΈ Speed up parents |
2024-07-22 |
|||
π Rename registry to record |
2024-07-17 |
|||
β»οΈ Consciously use class method |
2024-07-10 |
|||
π Fix clashing reverse accessors between |
2024-06-13 |
0.43.0 |
||
β»οΈ Reformulate data lineage, remove json field from run |
2024-05-19 |
0.42.0 |
||
β»οΈ Protect gene, protein, cell_marker & pathway in their FeatureSet relationships |
2024-05-18 |
|||
ποΈ Naming conventions for link tables, protecting deletion in link tables, maintaining integrity upon label & feature renames |
2024-05-18 |
|||
β»οΈ Account for migrations in lnschema_core |
2024-05-17 |
|||
π₯ Prune migrations |
2024-05-16 |
|||
ποΈ Spell out link tables with |
2024-05-16 |
|||
π₯ Remove linking Collection to all Bionty entities |
2024-05-15 |
|||
β¨ Add sources |
2024-05-14 |
|||
π¨ Fix passing arguments to from_public |
2024-05-13 |
|||
π Fix organism |
2024-05-08 |
|||
ποΈ Fix public_source in inspect |
2024-04-18 |
|||
π Fix syncing public sources |
2024-04-11 |
|||
β¨ Add PublicSource.set_as_currently_used |
2024-03-14 |
0.41.4 |
||
βοΈ Fix encoding |
2024-01-12 |
0.38.4 |
||
ποΈ Re-encode PublicSource |
2024-01-10 |
0.38.3 |
||
π Rename .bionty to .public |
2024-01-09 |
|||
πΈ Do not create organism when calling bionty |
2024-01-08 |
wetlabΒΆ
Name |
PR |
Developer |
Date |
Version |
---|---|---|---|---|
β»οΈ Model categoricals via simple |
2024-08-02 |
|||
β¨ Improved support for perturbations |
2024-07-29 |
|||
π Rename |
2024-03-08 |
0.27.0 |
nbprojectΒΆ
Name |
PR |
Developer |
Date |
Version |
---|---|---|---|---|
β¬οΈ Upgrade to pydantic v2 |
2024-07-23 |
|||
β‘οΈ Warn instead of raising the exception when ipylab is not installed |
2024-05-08 |
0.10.3 |
||
β»οΈ Make ipylab an optional dependency |
2024-05-06 |
|||
π Silence erroneous logging |
2024-02-27 |
0.10.1 |
||
πΈ Init version at 1 |
2024-01-11 |
0.10.0 |