-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updating profile & consensus docs #75
Updating profile & consensus docs #75
Conversation
I think I still need to add a part where I explain where the profiles and consensus can be found in the repo. That should go in the top level README |
@gwaygenomics
I have read this part of the |
Yes! We should clarify anything that isn't clear. It's just saying that the consensus signatures for DMSO create profiles per well instead of per treatment. Does that make sense? One might assume that we treat DMSO like any other perturbation when creating consensus profiles - that is, combine ALL replicates into a single signature. We don't do that. Instead, we collapse per well, thus creating ~24 or so (I don't remember the exact number) DMSO consensus profiles. |
@niranjchandrasekaran Can you review and merge this? @FloHu any last comments? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@michaelbornholdt I made a few inline comments. Also there is typo in the feature_select
box of the figure in the word "operations".
I am not approving or merging this PR yet as I want @gwaygenomics to take a look at your changes since he should be the one who decides what goes where in this repo :)
README.md
Outdated
@@ -19,10 +19,11 @@ The [Morphology Connectivity Hub](https://clue.io/morphology) is the primary sou | |||
We apply a unified, image-based profiling pipeline to all 136 384-well plates from `LINCS Pilot 1`, and all 135 384-well plates from `LKCP`. | |||
We use [pycytominer](https://github.com/cytomining/pycytominer) as the primary tool for image-based profiling. | |||
|
|||
We process and store profiles in the [profiles/](profiles/) directory. | |||
See [`profiles/README.md`](profiles/README.md) for more details and for instructions on how to reproduce the pipeline. | |||
We process and store profiles in the [profiles/](profiles/) directory. By 'profiles' we refer to the level 3 data, which contains the aggregate of each well. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We call everything from level 3-5 profiles. Also doesn't this folder contain level 4a and 4b profiles?
profiles/README.md
Outdated
|
||
Note here that we do not include the intermediate step of generating `.sqlite` files per plate using a tool called [cytominer-database](https://github.com/cytomining/cytominer-database). | ||
This repository and workflow begins after we applied cytominer-database. | ||
|
||
|
||
### Aggregation | ||
The [aggregation method](https://github.com/cytomining/pycytominer/blob/master/pycytominer/aggregate.py) is used twice in the workflow. Firstly, the median of all cells within a well is aggregated to one profiler per well. The aggregation method doesn't persist the metadata which is why this step is followed by an annotation step to add the MOA data and others. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like the idea of thinking about creating the consensus profiles as another aggregation step. But given that you have a separate section for consensus below perhaps you could talk about aggregating 4b to consensus profiles in that section and not here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@michaelbornholdt - can you address this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have, there are two different sections now that explain aggregation and consensus.
profiles/README.md
Outdated
[Pycytominer](https://github.com/cytomining/pycytominer) is a code base built by @gwaygenomics and @niranjchandrasekaran. | ||
It allows easy processing CellProfiler data and contains all functions that were used to create the data in this repository. Below, we describe the different steps of the pipeline. Please check the pycytominer repo for more details. | ||
|
||
Part of the pipeline, from Level 3 to Level 4b, can be found in the [profile_cells](https://github.com/broadinstitute/lincs-cell-painting/blob/master/profiles/profile_cells.py) script and the final aggregation to the consensus data is found in this [notebook](https://github.com/broadinstitute/lincs-cell-painting/blob/master/consensus/build-consensus-signatures.ipynb). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you think including the notebook that contains the code for sphering should also be mentioned here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@michaelbornholdt - can you address this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have
I would still clarify what is the difference in the plate normalization and the normalization by spherization, i.e. which artifacts are corrected in the former and which ones in the latter and where they occur. For people from outside the imaging field, e.g. in some screen types people just normalize on a plate by plate basis and that takes care of the batch effects. Overall, great job, things are a lot clearer now. Thanks for the effort! |
Thanks @michaelbornholdt - I will provide my review once you address @niranjchandrasekaran's comments |
Co-authored-by: Niranj Chandrasekaran <[email protected]>
Co-authored-by: Niranj Chandrasekaran <[email protected]>
Co-authored-by: Niranj Chandrasekaran <[email protected]>
@gwaygenomics |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great @michaelbornholdt - I made several suggested changes (note that you can commit these directly from the github GUI, so there is no need to change again on your end)
The three primary comments are:
- Make sure you make formatting consistent. This means proper spacing after headings and converting to one-sentence-per-line. I tried to adjust every instance, but it's likely i missed some.
- Try your best to edit this document to active (instead of passive) voice. For example, write like this: "Unlike the other normalizations, we spherize the data using the full batch (all plates)." and not like this: "Unlike the other normalizations, spherizing is done on the full batch (all plates)" It makes reading such a more pleasant experience :)
- Make your png overwrite the old png - no need for a new file name
Thanks again!
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Greg Way <[email protected]>
Co-authored-by: Gregory Way <[email protected]>
Co-authored-by: Gregory Way <[email protected]>
Co-authored-by: Gregory Way <[email protected]>
@niranjchandrasekaran @gwaygenomics |
Why do the suggestions of Grag still appear in the other tab? |
batch commit to clear my suggestions
@michaelbornholdt - it also looks like you need to add back your new-and-improved workflow figure: |
The newest version is there!? Or what am I missing? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just dropping in to say that this figure was really helpful to me – I had forgotten the order of operations leading up to spherizing and this cleared it up. Thanks again @michaelbornholdt ! 🥇 |
Related to issue #73