The Senseiver

Implementation of "The Senseiver: attention-based global field reconstruction from sparse observations" in Pytorch. This model provides an easy and efficient way to train a data driven to create a mapping between sensor measurements an a global field, the model operations are decoupled from the size of the domain which allows the model to be trained with very large n-D arrays. The application shown in our paper considered simulations, but the method is general, and should be applicable to any other application.

Architecture

Parameters

Training parameters

data_name: str. Name of the dataset to be used for training.
num_sensors: int. number of sensors to train with
gpu_device: int. GPU to train on. MultiGPU support coming soon.
training_frames: int. Number of frames (time steps) to train the model with.
seed: int. If specified, it uses a seed to pick up sensors (if locations not specified) and frames.
consecutive_train: bool. Whether to use consecutive frames to train or chosen at random.
batch_frames: int. Number of frames per batch.
batch_pixels: int. Number of pixels per batch.
lr: float. Learning rate
accum_grads: int. Number of batches to accumulate to perform an optimizer step.

Model parameters

space_bands: int. Number of sine-cosine frequencies
enc_preproc_ch: int. Size of the linear layer that processes the inputs (sensor value+positons)
num_latents: int. Sequence size of the Q_in array.
enc_num_latent_channels: int. Channel dimension of the Q_in array.
num_layers: int. Number of model layers (depth).
num_cross_attention_heads: int. Number of processsing attention heads.
num_self_attention_layers_per_block: int. Number of self processing layers in each block.
dec_preproc_ch: int. Size of the linear layer that processes the latent space sent to the decoder. This can act as a bolttleneck and reduce significantly the number of parameters.
dec_num_latent_channels: int. Number of channels in the decoder.

IO

load_model_num: int. Load a model to test or to re-train on. Our library saves the models sequencially starting with 1.
test: bool. To test the model in the entire dataset.

Usage

python train.py --gpu 0 --data cylinder --num_sensors 8 --training_frames 50 --cons False --seed 123 --enc_preproc 16 --dec_num_latent_channels 16 --enc_num_latent_channels 16 --num_latents 256 --dec_preproc_ch 16 --test False

for num_frames in 100, 250, 500, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000, 10000; do
    python train.py --gpu 1 --data pipe --num_sensors 6144 --cons False --seed 123 --enc_preproc 32 --dec_num_latent_channels 32 --enc_num_latent_channels 32 --num_latents 64 --dec_preproc_ch 32 --lr 1e-3 —training_frames $num_frames || break
done

Results

Dynamic Sensor Placement Enhances Generalization

We introduce a novel training strategy for the Senseiver model, which reconstructs a field by optimizing sensor positions through backpropagation. This enhancement significantly improves the model's ability to efficiently explore and understand the spatial domain. We developed a fully end-to-end differentiable framework called the differentiable walk strategy. This approach allows sensor locations to be trained alongside the parameters of the attention-based neural network. The differentiable walk strategy exploits optimally the training portion of the dataset, enabling sensors to dynamically adjust their positions during training. This results in enhanced spatial awareness and improved field reconstruction performance. Our algorithm includes corrections to prevent sensors from moving into invalid domain areas.

Usage:

python train_diff_walk.py --gpu 0 --data cylinder --num_sensors 8 --training_frames 50 --cons False --seed 123 --enc_preproc 16 --dec_num_latent_channels 16 --enc_num_latent_channels 16 --num_latents 256 --dec_preproc_ch 16 --test False

Code acknowledgements

We are grateful to the developers of the many software packages used throughout this project including, but not limited, to PyTorch, Numpy, Vedo, Matplotlib, PyTorch Lightning, and krasserm's implementation of the perceiver models.

Citations

If you use our code for your own research, we would be grateful if you cite our publications NMI, MLST

@article{Senseiver,
title = "Development of the Senseiver for efficient field reconstruction from sparse observations",
author = "Santos, Javier E and Fox, Zachary R and Mohan, Arvind and O’Malley, Daniel and Viswanathan, Hari and Lubbers, Nicholas",
journal = "Nat Mach Intell",
year = "2023",
doi = "https://link.springer.com/article/10.1007/s11242-021-01617-y",
url = "https://www.nature.com/articles/s42256-023-00746-x"
}

@article{DynamicSensorPlacement,
title = "Journey over Destination: Dynamic Sensor Placement Enhances Generalization",
author = "Marcato, Agnese and Guiltinan, Eric and Viswanathan, Hari and O'Malley, Daniel and Lubbers, Nicholas and E. Santos, Javier",
journal = "Machine Learning: Science and Technology",
year= "2024",
doi = "https://iopscience.iop.org/article/10.1088/2632-2153/ad4e06"
}

Data availability

The data is available at 10.5281/zenodo.8290039.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Data		Data
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
arq.png		arq.png
dataloaders.py		dataloaders.py
datasets.py		datasets.py
diff_walk.png		diff_walk.png
functions_diff_walk.py		functions_diff_walk.py
model.py		model.py
network_light.py		network_light.py
plot.py		plot.py
positional.py		positional.py
results.png		results.png
s_parser.py		s_parser.py
sensor_loc.py		sensor_loc.py
train.py		train.py
train_diff_walk.py		train_diff_walk.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Senseiver

Architecture

Parameters

Training parameters

Model parameters

IO

Usage

Results

Dynamic Sensor Placement Enhances Generalization

Code acknowledgements

Citations

Data availability

About

Releases 1

Packages

Contributors 3

Languages

License

OrchardLANL/Senseiver

Folders and files

Latest commit

History

Repository files navigation

The Senseiver

Architecture

Parameters

Training parameters

Model parameters

IO

Usage

Results

Dynamic Sensor Placement Enhances Generalization

Code acknowledgements

Citations

Data availability

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages