Update Benchmarking Notebook & Refactor Training Pipeline #12
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
As a token of gratitude for the great work, some updates to help people reading & running the code 🤗.
Below a list of all the changes. In addition to the committed changes, there were some potential changes that i could imagine improving the experimental setup. These are left as unchecked boxes. I separated changes to the training pipeline and the benchmarking notebook into 2 branches, just in case.
Training Pipeline
Benchmarking Notebook
Sanity Check
To make sure, refactoring the training pipeline didn't mess up the semantics, i re-ran the experiments on the Pets dataset. Here are the learning curves i got:
To save compute, i refrained from retraining on ImageNette and re-used the weights i got from back then. In both cases, evaluating Insertion, Deletion & Faithfulness did reproduce the results of the paper for vit_base_patch16_224.
Please let me know if you like it or have additional suggestions.