SAMPa: Sharpness-aware Minimization Parallelized

This is the official code for SAMPa: Sharpness-aware Minimization Parallelized accepted at NeurIPS 2024.

SAMPa introduces a fully parallelized version of sharpness-aware minimization (SAM) by allowing the two gradient computations to occur simultaneously:

$$ \begin{aligned} \widetilde{x}_t &= x_t + \rho \frac{\nabla f(y_t)}{\lVert \nabla f(y_t) \rVert} \\ y_{t+1} &= x_t - \eta_t \nabla f(y_t) \\ x_{t+1} &= x_t - \eta_t (1-\lambda) \nabla f (\widetilde{x}_t) - \eta_t \lambda \nabla f(y_{t+1}) \end{aligned} $$

where the gradients $\nabla f(\widetilde{x}_t)$ and $\nabla f(y_{t+1})$ are computed in parallel, significantly improving efficiency.

SAMPa serves as one of the most efficient SAM variants:

Setup

conda create -n sampa python=3.8
conda activate sampa

# On GPU
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

pip install -r requirements.txt

Usage

This code is for SAMPa's implementation. It parallelizes two gradient computations on 2 GPUs. Specifically in train.py, global_rank:0 handles $\nabla f (\widetilde{x}_t)$ and global_rank:1 handles $\nabla f(y_{t+1})$.

To train ResNet-56 on CIFAR-10 using SAMPa, use the following command:

CUDA_VISIBLE_DEVICES=0,1 python train.py --model resnet56 --dataset cifar10 --rho 0.1 --epochs 200

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
figs		figs
model		model
README.md		README.md
get_dataset.py		get_dataset.py
get_model.py		get_model.py
requirements.txt		requirements.txt
train.py		train.py
utility.py		utility.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAMPa: Sharpness-aware Minimization Parallelized

Setup

Usage

About

Releases

Packages

Languages

LIONS-EPFL/SAMPa

Folders and files

Latest commit

History

Repository files navigation

SAMPa: Sharpness-aware Minimization Parallelized

Setup

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages