✨ allow more recent Python versions #67

enryH · 2024-05-31T09:38:37Z

…#66)

✨ test Snakemake workflow with more recent Python versions
✨ remove snakemake upper limit
🐛 bump plac version

related to : snakemake/snakemake#2276

📌 update actions, make artefacts unique

check if local windows error with pandas can be reproduced in action (corr)

🎨 dump counts for histograms

add for simulated missing values
remove duplication
🐛 do not omit last bin

🐛 assure that some values are set to NA

if all values are higher than the default threshold, the assertion on L17 is not met. Make sure some NAs (missing values) are set.

🎨 write out corr and prepare for pandas 2.0

see if this works also with pandas 1.5.3

🔧 test relaxing pandas restriction
🐛 drop batches with one sample for training DAE and VAE

for creating the latent representation, now a new DataLoader is needed.

✨ splitup large global environement

separate environment for PIMMS models and R based models
global environment should still work

🐛 test if adding jupyter is sufficient to install further packages in R sesssion

only execute one job at a time in retry to see errors better

🐛 fix sampling to make it compatible with python >=3.11
⬆️ remove pytorch upper dependency
✅ Test workflow v2 on Alzheimer dataset

once this passes, add ald analysis to website (for a reasonable subset of models)
maybe only showcase PIMMS models with a handful of other models

🐛 update path to execute run, speed-up

also remove two slowest models

🎨 hide code in rendered notebooks of workflow, sort imports

hide code cells for generated report
isort imports

✨ Functionality for plot source data (ALD study)

add some functionality required to collect source data for reporting on saved figures.

✨ Run differential analysis workflow in CI on Alzheimer data

several adaptions to slightly different design between ALD and Alzheimer data

🐛 specify folder_experiment from global space

... and not as wildcard

🎨 rename Snakefile_v2 to Snakefile_v2.smk

uncommon names should have a file ending specifying Snakefiles.

✨ skript to build website (execution)

execution should work, but subfolders need their own index.rst
need to adapt script for updating main index.rst

See if everything runs for now.

🐛 do not exclude diff analysis folder in conf.py
🎨🐛 make a strict hierarchy of headings per document

mapping titles in sphinx (cross-referencing) otherwise does not work

🎨 collapse code in published notebooks

for better inspection of generated report for example

🎨 annotate notebooks

add some comments and streamline cells.

✨ Test tutorial on colab
🐛🎨 format and check briefly colab workflow on dev branch
🎨 hide more inputs, downscale tutorial runner
📝 Update README

🐛 use larger image to test tutorial on colab

📝 update READMEs and add some hints to config files
📝🎨 save some adhoc script used during revisons, add and cleanup nb list
🐛 go back old config indentation (and model configuration)

rerun in codespace for inspection

🐛 fix issue having same model with 2 configurations

had to set model id ("model_key") as index

📝✨ Allow users to download large HeLa protein groups dataset easily

* ✨ test Snakemake workflow with more recent Python versions * ✨ remove snakemake upper limit * 🐛 bump plac version related to : snakemake/snakemake#2276 * 📌 update actions, make artefacts unique - check if local windows error with pandas can be reproduced in action (corr) * 🎨 dump counts for histograms - add for simulated missing values - remove duplication - 🐛 do not omit last bin * 🐛 assure that some values are set to NA if all values are higher than the default threshold, the assertion on L17 is not met. Make sure some NAs (missing values) are set. * 🎨 write out corr and prepare for pandas 2.0 - see if this works also with pandas 1.5.3 * 🔧 test relaxing pandas restriction * 🐛 drop batches with one sample for training DAE and VAE - for creating the latent representation, now a new DataLoader is needed. * ✨ splitup large global environement - separate environment for PIMMS models and R based models - global environment should still work * 🐛 test if adding jupyter is sufficient to install further packages in R sesssion - only execute one job at a time in retry to see errors better * 🐛 fix sampling to make it compatible with python >=3.11 * ⬆️ remove pytorch upper dependency * ✅ Test workflow v2 on Alzheimer dataset - once this passes, add ald analysis to website (for a reasonable subset of models) - maybe only showcase PIMMS models with a handful of other models * 🐛 update path to execute run, speed-up - also remove two slowest models * 🎨 hide code in rendered notebooks of workflow, sort imports - hide code cells for generated report - isort imports * ✨ Functionality for plot source data (ALD study) - add some functionality required to collect source data for reporting on saved figures. * ✨ Run differential analysis workflow in CI on Alzheimer data - several adaptions to slightly different design between ALD and Alzheimer data * 🐛 specify folder_experiment from global space - ... and not as wildcard * 🎨 rename Snakefile_v2 to Snakefile_v2.smk - uncommon names should have a file ending specifying Snakefiles. * ✨ skript to build website (execution) - execution should work, but subfolders need their own index.rst - need to adapt script for updating main index.rst See if everything runs for now. * 🐛 do not exclude diff analysis folder in conf.py * 🎨🐛 make a strict hierarchy of headings per document - mapping titles in sphinx (cross-referencing) otherwise does not work * 🎨 collapse code in published notebooks - for better inspection of generated report for example * 🎨 annotate notebooks add some comments and streamline cells. * ✨ Test tutorial on colab * 🐛🎨 format and check briefly colab workflow on dev branch * 🎨 hide more inputs, downscale tutorial runner * 📝 Update README - 🐛 use larger image to test tutorial on colab * 📝 update READMEs and add some hints to config files * 📝🎨 save some adhoc script used during revisons, add and cleanup nb list * 🐛 go back old config indentation (and model configuration) - rerun in codespace for inspection * 🐛 fix issue having same model with 2 configurations - had to set model id ("model_key") as index * 📝✨ Allow users to download large HeLa protein groups dataset easily

Henry Webel added 3 commits May 31, 2024 11:37

🎨 move plotting functionality to package

115e681

🎨 annotate some functions, remove tags

7fc0193

enryH merged commit 0874f5f into main May 31, 2024
51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ allow more recent Python versions #67

✨ allow more recent Python versions #67

enryH commented May 31, 2024

✨ allow more recent Python versions #67

✨ allow more recent Python versions #67

Conversation

enryH commented May 31, 2024