Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ allow more recent Python versions #67

Merged
merged 3 commits into from
May 31, 2024
Merged

✨ allow more recent Python versions #67

merged 3 commits into from
May 31, 2024

Conversation

enryH
Copy link
Member

@enryH enryH commented May 31, 2024

#66)

  • ✨ test Snakemake workflow with more recent Python versions

  • ✨ remove snakemake upper limit

  • 🐛 bump plac version

related to : snakemake/snakemake#2276

  • 📌 update actions, make artefacts unique
  • check if local windows error with pandas can be reproduced in action (corr)
  • 🎨 dump counts for histograms
  • add for simulated missing values
  • remove duplication
  • 🐛 do not omit last bin
  • 🐛 assure that some values are set to NA

if all values are higher than the default threshold, the assertion on L17 is not met. Make sure some NAs (missing values) are set.

  • 🎨 write out corr and prepare for pandas 2.0
  • see if this works also with pandas 1.5.3
  • 🔧 test relaxing pandas restriction

  • 🐛 drop batches with one sample for training DAE and VAE

  • for creating the latent representation, now a new DataLoader is needed.
  • ✨ splitup large global environement
  • separate environment for PIMMS models and R based models
  • global environment should still work
  • 🐛 test if adding jupyter is sufficient to install further packages in R sesssion
  • only execute one job at a time in retry to see errors better
  • 🐛 fix sampling to make it compatible with python >=3.11

  • ⬆️ remove pytorch upper dependency

  • ✅ Test workflow v2 on Alzheimer dataset

  • once this passes, add ald analysis to website (for a reasonable subset of models)
  • maybe only showcase PIMMS models with a handful of other models
  • 🐛 update path to execute run, speed-up
  • also remove two slowest models
  • 🎨 hide code in rendered notebooks of workflow, sort imports
  • hide code cells for generated report
  • isort imports
  • ✨ Functionality for plot source data (ALD study)
  • add some functionality required to collect source data for reporting on saved figures.
  • ✨ Run differential analysis workflow in CI on Alzheimer data
  • several adaptions to slightly different design between ALD and Alzheimer data
  • 🐛 specify folder_experiment from global space
  • ... and not as wildcard
  • 🎨 rename Snakefile_v2 to Snakefile_v2.smk
  • uncommon names should have a file ending specifying Snakefiles.
  • ✨ skript to build website (execution)
  • execution should work, but subfolders need their own index.rst
  • need to adapt script for updating main index.rst

See if everything runs for now.

  • 🐛 do not exclude diff analysis folder in conf.py

  • 🎨🐛 make a strict hierarchy of headings per document

  • mapping titles in sphinx (cross-referencing) otherwise does not work
  • 🎨 collapse code in published notebooks
  • for better inspection of generated report for example
  • 🎨 annotate notebooks

add some comments and streamline cells.

  • ✨ Test tutorial on colab

  • 🐛🎨 format and check briefly colab workflow on dev branch

  • 🎨 hide more inputs, downscale tutorial runner

  • 📝 Update README

  • 🐛 use larger image to test tutorial on colab
  • 📝 update READMEs and add some hints to config files

  • 📝🎨 save some adhoc script used during revisons, add and cleanup nb list

  • 🐛 go back old config indentation (and model configuration)

  • rerun in codespace for inspection
  • 🐛 fix issue having same model with 2 configurations
  • had to set model id ("model_key") as index
  • 📝✨ Allow users to download large HeLa protein groups dataset easily

Henry Webel added 3 commits May 31, 2024 11:37
* ✨ test Snakemake workflow with more recent Python versions

* ✨ remove snakemake upper limit

* 🐛 bump plac version

related to : snakemake/snakemake#2276

* 📌 update actions, make artefacts unique

- check if local windows error with pandas can be reproduced in action (corr)

* 🎨 dump counts for histograms

- add for simulated missing values
- remove duplication
- 🐛 do not omit last bin

* 🐛 assure that some values are set to NA

if all values are higher than the default threshold, the assertion on L17 is not met. Make sure some NAs (missing values) are set.

* 🎨 write out corr and prepare for pandas 2.0

- see if this works also with pandas 1.5.3

* 🔧 test relaxing pandas restriction

* 🐛 drop batches with one sample for training DAE and VAE

- for creating the latent representation, now a new DataLoader is needed.

* ✨ splitup large global environement

- separate environment for PIMMS models and R based models
- global environment should still work

* 🐛 test if adding jupyter is sufficient to install further packages in R sesssion

- only execute one job at a time in retry to see errors better

* 🐛 fix sampling to make it compatible with python >=3.11

* ⬆️ remove pytorch upper dependency

* ✅ Test workflow v2 on Alzheimer dataset

- once this passes, add ald analysis to website (for a reasonable subset of models)
- maybe only showcase PIMMS models with a handful of other models

* 🐛 update path to execute run, speed-up

- also remove two slowest models

* 🎨 hide code in rendered notebooks of workflow, sort imports

- hide code cells for generated report
- isort imports

* ✨ Functionality for plot source data (ALD study)

- add some functionality required to collect source data for reporting on saved figures.

* ✨ Run differential analysis workflow in CI on Alzheimer data

- several adaptions to slightly different design between ALD and Alzheimer data

* 🐛 specify folder_experiment from global space

- ... and not as wildcard

* 🎨 rename Snakefile_v2 to Snakefile_v2.smk

- uncommon names should have a file ending specifying Snakefiles.

* ✨ skript to build website (execution)

- execution should work, but subfolders need their own index.rst
- need to adapt script for updating main index.rst

See if everything runs for now.

* 🐛 do not exclude diff analysis folder in conf.py

* 🎨🐛 make a strict hierarchy of headings per document

- mapping titles in sphinx (cross-referencing) otherwise does not work

* 🎨 collapse code in published notebooks

- for better inspection of generated report for example

* 🎨 annotate notebooks

add some comments and streamline cells.

* ✨ Test tutorial on colab

* 🐛🎨 format and check briefly colab workflow on dev branch

* 🎨 hide more inputs, downscale tutorial runner

* 📝 Update README

- 🐛 use larger image to test tutorial on colab

* 📝 update READMEs and add some hints to config files

* 📝🎨 save some adhoc script used during revisons, add and cleanup nb list

* 🐛 go back old config indentation (and model configuration)

- rerun in codespace for inspection

* 🐛 fix issue having same model with 2 configurations

- had to set model id ("model_key") as index

* 📝✨ Allow users to download large HeLa protein groups dataset easily
@enryH enryH merged commit 0874f5f into main May 31, 2024
51 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant