General
- Train models directly from the command line through .yaml configuration files.
- Training on genotype, tabular, sequence, image, array and binary input data, with various modality-specific settings available.
- Seamless multi-modal (e.g., combining text + image + tabular data, or any combination of the modalities above) training.
- Train multiple features extractors on the same data source, e.g., combining vanilla transformer, Longformer and a pre-trained BERT variant for text classification.
- Support for checkpointing and continued training, as well as pretraining and transferring parts of trained models to new tasks.
Supervised Learning
- Supports continuous (i.e., regression) and categorical (i.e., classification) targets.
- Multi-task / multi-label prediction supported out-of-the-box.
- Model explainability for genotype, tabular, sequence, image and array data built in.
- Computes and graphs various evaluation metrics (e.g., RMSE, PCC and R2 for regression tasks, accuracy, ROC-AUC, etc. for classification tasks) during training.
Sequence Generation
- Supports various sequence generation tasks, including basic sequence generation, sequence to sequence transformations, and image to sequence transformations. For more information, refer to the respective tutorials: sequence generation, sequence to sequence, image to sequence and tabular to sequence.
Image Generation
- Image generation is supported. For more information, refer to the respective tutorials: Building a Simple Image Autoencoder, Image Colorization and Super-Resolution and Guided Diffusion for Image Generation.
Array Output
- Supports array output tasks, such as building simple autoencoders for tasks like MNIST Digit Generation.
Time Series
- Time series inputs and outputs is possible, such as Transformer-based Power Consumption Prediction and Stock Price Prediction Using Transformers, One-shot and Diffusion Models.
Survival Analysis
- Time-to-event prediction is supported as an output type, demonstrated through Patient Survival Prediction using Free Light Chain Data and Survival Analysis Using Cox Proportional Hazards Model.
Many more settings and configurations (e.g., augmentation, regularization, optimizers) available.

Supported Inputs and Outputs

Modality	Input	Output
Genotype	x	†
Tabular	x	x
Sequence	x	x
Image	x	x
Array	x	x
Binary	x
Survival	n/a	x

† While not directly supported, genotypes can be treated as arrays. For example see the MNIST Digit Generation tutorial.

Related Projects

EIR-auto-GP: Automated genomic prediction (GP) using deep learning models with EIR.

Citation

If you use EIR in a scientific publication, we would appreciate if you could use one of the following citations:

@article{10.1093/nar/gkad373,
    author    = {Sigurdsson, Arn{\'o}r I and Louloudis, Ioannis and Banasik, Karina and Westergaard, David and Winther, Ole and Lund, Ole and Ostrowski, Sisse Rye and Erikstrup, Christian and Pedersen, Ole Birger Vesterager and Nyegaard, Mette and DBDS Genomic Consortium and Brunak, S{\o}ren and Vilhj{\'a}lmsson, Bjarni J and Rasmussen, Simon},
    title     = {{Deep integrative models for large-scale human genomics}},
    journal   = {Nucleic Acids Research},
    month     = {05},
    year      = {2023}
}

@article{sigurdsson2024non,
  title={Non-linear genetic regulation of the blood plasma proteome},
  author={Sigurdsson, Arnor I and Gr{\"a}f, Justus F and Yang, Zhiyu and Ravn, Kirstine and Meisner, Jonas and Thielemann, Roman and Webel, Henry and Smit, Roelof AJ and Niu, Lili and Mann, Matthias and others},
  journal={medRxiv},
  pages={2024--07},
  year={2024},
  publisher={Cold Spring Harbor Laboratory Press}
}

@article{sigurdsson2022improved,
    author    = {Sigurdsson, Arnor Ingi and Ravn, Kirstine and Winther, Ole and Lund, Ole and Brunak, S{\o}ren and Vilhjalmsson, Bjarni J and Rasmussen, Simon},
    title     = {Improved prediction of blood biomarkers using deep learning},
    journal   = {medRxiv},
    pages     = {2022--10},
    year      = {2022},
    publisher = {Cold Spring Harbor Laboratory Press}
}

Acknowledgements

Massive thanks to everyone publishing and developing the packages this project directly and indirectly depends on.

Name		Name	Last commit message	Last commit date
Latest commit History 1,895 Commits
.github		.github
config		config
docs		docs
eir		eir
misc		misc
tests		tests
.coveragerc		.coveragerc
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
codecov.yml		codecov.yml
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Install

Installing EIR via `pip`

Installing EIR via Container Engine

Usage

Use Cases

Features

Supported Inputs and Outputs

Related Projects

Citation

Acknowledgements

About

Releases 54

Packages

Contributors 2

Languages

License

arnor-sigurdsson/EIR

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Install

Installing EIR via pip

Installing EIR via Container Engine

Usage

Use Cases

Features

Supported Inputs and Outputs

Related Projects

Citation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 54

Packages 0

Contributors 2

Languages

Installing EIR via `pip`

Packages