This repo contains data models generated using LinkML for the BICAN knowledgebase project founded by the National Institute of Mental Health.
Here are the BICAN LinkML knowledgebase schemas and their statuses.
Model | Short Description | Release with Latest Updates | Status |
---|---|---|---|
Assertion Evidence Model | Types and relationships of assertions and evidences/ | 0.2.0 | under development |
Library Generation Model | Types and relationships of samples and digital data assets generated during processes that generate multimodal genomic data. | 0.2.0 | under development |
Anatomical Structure Model | Types and relationships of anatomical brain structures. | 0.1.0 | under development |
Genome Annotation Model | Types and relationships of an organism's annotated genome. | 0.2.0 | under development |
BICAN BioLink | BICAN subset of classes from the Biolink model. | 0.2.0 | under development |
CCN2 | 0.1.0 | deprecated | |
Figure1 | 0.1.0 | deprecated |
In this project we use the LinkML, the Linked Data Modeling Language, to define the data models. LinkML is a flexible modeling language that allows you to author schemas in YAML format, and it is designed to be both human-readable and machine-readable. LinkML framework provides also a set of tools to generate code in different languages, such as Python, JSON, and RDF, from the schema files.
All the LinkML schema files are stored in the linkml-schema
directory.
Some of the models are written directly in the YAML format, while others automatically generated from Google sheets using the LinkML tool schemasheets,
and the schema2model
tool from the bkbit
package.
You can find the specific information in linkml-schema/README.md.
The LinkML schema files are used to generate additional formats, such as JSON Schema and Pydantic models. All files are generated automatically using GitHub Actions workflow whenever the LinkML schema files are updated. You can see the specific workflow in the reusable workflow that is reused for all models.
Currently, we are supporting the following formats:
- Pydantic models: these models are used in the Brain Knowledge Base Interaction Toolkit (bkbit)
- JSON Schema
- JSON-LD Context
- ER Diagrams
All the schemas are automatically tested in the GitHub Actions workflow using LinkML validation tools (see test_lint.yaml for details) and Python API and pytest (see test_models.yaml for details).
In order to run the pytest test locally, you can use the following commands:
pip install -e .[test]
pytest -vs
These materials are provided under the Attribution International 4.0 (CC-BY-4.0) license, which is available at https://creativecommons.org/licenses/by/4.0/
An extensible brain knowledge base and toolset spanning modalities for multi-species data-driven cell types BICAN Knowledgebase National Institute of Mental Health Award # 1U24MH130918-01