###### Disease

lamindb provides access to the following public Disease ontologies
through bionty:

1. Mondo

2. Human Disease

Here we show how to access and search Disease ontologies to
standardize new data.

 import bionty as bt
 import pandas as pd

##### PublicOntology objects

Let us create a public ontology accessor with ".public" method, which
chooses a default public ontology source from "Source". It's a
PublicOntology object, which you can think about as a public registry:

 diseases = bt.Disease.public(organism="all")
 diseases

As for registries, you can export the ontology as a "DataFrame":

 df = diseases.to_dataframe()
 df.head()

Unlike registries, you can also export it as a Pronto object via
"public.ontology".

##### Look up terms

As for registries, terms can be looked up with auto-complete:

 lookup = diseases.lookup()

The "." accessor provides normalized terms (lower case, only contains
alphanumeric characters and underscores):

 lookup.alzheimer_disease

To look up the exact original strings, convert the lookup object to
dict and use the "[]" accessor:

 lookup_dict = lookup.dict()
 lookup_dict["Alzheimer disease"]

By default, the "name" field is used to generate lookup keys. You can
specify another field to look up:

 lookup = diseases.lookup(diseases.ontology_id)

 lookup.mondo_0004975

##### Search terms

Search behaves in the same way as it does for registries:

 diseases.search("parkinson disease").head(3)

By default, search also covers synonyms and all other fields
containing strings:

 diseases.search("paralysis agitans").head(3)

Search specific field (by default, search is done on all fields
containing strings):

 diseases.search(
 "progressive degenerative disorder of the central nervous system",
 field=diseases.definition,
 ).head()

##### Standardize Disease identifiers

Let us generate a "DataFrame" that stores a number of Disease
identifiers, some of which corrupted:

 df_orig = pd.DataFrame(
 index=[
 "supraglottis cancer",
 "alexia",
 "trigonitis",
 "paranasal sinus disorder",
 "This disease does not exist",
 ]
 )
 df_orig

We can check whether any of our values are validated against the
ontology reference:

 validated = diseases.validate(df_orig.index, diseases.name)
 df_orig.index[~validated]

##### Ontology source versions

For any given entity, we can choose from a number of versions:

 bt.Source.filter(entity="bionty.Disease").to_dataframe()

 # only lists the sources that are currently used
 bt.Source.filter(entity="bionty.Disease", currently_used=True).to_dataframe()

When instantiating a Bionty object, we can choose a source or version:

 source = bt.Source.filter(
 name="mondo", organism="all"
 ).first()
 diseases= bt.Disease.public(source=source)
 diseases

The currently used ontologies can be displayed using:

 bt.Source.filter(currently_used=True).to_dataframe()