Update Benchmarking Notebook & Refactor Training Pipeline #12

JensRahnfeld · 2024-07-19T20:18:50Z

As a token of gratitude for the great work, some updates to help people reading & running the code 🤗.

Below a list of all the changes. In addition to the committed changes, there were some potential changes that i could imagine improving the experimental setup. These are left as unchecked boxes. I separated changes to the training pipeline and the benchmarking notebook into 2 branches, just in case.

Training Pipeline

Load Pets dataset through fastai api removing need to specify dataset location manually
Remove code duplicates by merging classifier, surrogate & explainer backbone init & loading into a base module
Fix checkpointing
Add conda environment.yaml for all the 🐍 enjoyers
Set default precision to 32-bit (this fixed nan encounters due to numerical issues on ImageNette for me)

Benchmarking Notebook

fix import (there seemed to be some rename vitmedical.modules.explainer -> vit_shapley.modules.explainer)
fix checkpointing upon LRP initialization
cd into parent directory (vit-shapley) only upon first cell execution (otherwise it would keep cd'ing "../")
automatically create result folders for experiments
clean up unused code / cells
Evaluate Attention Rollout with Residuals (set to False which degrades performance metric-wise)
Sample Rise masks using generate_mask instead of binomial distribution as done for the surrogate model

Sanity Check

To make sure, refactoring the training pipeline didn't mess up the semantics, i re-ran the experiments on the Pets dataset. Here are the learning curves i got:

To save compute, i refrained from retraining on ImageNette and re-used the weights i got from back then. In both cases, evaluating Insertion, Deletion & Faithfulness did reproduce the results of the paper for vit_base_patch16_224.

Please let me know if you like it or have additional suggestions.

…ne single base model

- create experiment's result directories automatically - import explainer from existing module - dynamic state dict loading when initializing LRP - add option to specfiy number of masks per rise batch

…e corresponding cell was executed regardless of dataset leading to an exception when running it with pets

Update training pipeline

Update evaluation pipeline

JensRahnfeld added 10 commits July 18, 2024 13:54

use fastai api to (down-)load pets dataset

c47b159

merge code duplicates across classifier, surrogate & explainer into o…

2640a68

…ne single base model

- keep working directory consistent upon restart

94276de

- create experiment's result directories automatically - import explainer from existing module - dynamic state dict loading when initializing LRP - add option to specfiy number of masks per rise batch

evaluate only surrogate checkpoints that have been loaded

da2f123

clean up unused code in load dataset cell

fccea6d

fix bug: kernelshap method is evaluated on a subset of ImageNette. Th…

a99977f

…e corresponding cell was executed regardless of dataset leading to an exception when running it with pets

remove unused cells

284a1e5

Merge pull request #1 from JensRahnfeld/update-training-pipeline

b3186d4

Update training pipeline

Merge pull request #2 from JensRahnfeld/update-evaluation-pipeline

b6770f6

Update evaluation pipeline

add conda env file

e470f97

JensRahnfeld mentioned this pull request Jul 22, 2024

About the code #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Benchmarking Notebook & Refactor Training Pipeline #12

Update Benchmarking Notebook & Refactor Training Pipeline #12

JensRahnfeld commented Jul 19, 2024 •

edited

Loading

Update Benchmarking Notebook & Refactor Training Pipeline #12

Are you sure you want to change the base?

Update Benchmarking Notebook & Refactor Training Pipeline #12

Conversation

JensRahnfeld commented Jul 19, 2024 • edited Loading

JensRahnfeld commented Jul 19, 2024 •

edited

Loading