graphcast hub #977

julienchastang · 2024-11-14T22:06:30Z

ana-v-espinoza · 2024-12-20T17:30:49Z

jupyter-images/fall-2024/tm/Dockerfile

@@ -1,7 +1,7 @@
 # Heavily borrowed from docker-stacks/minimal-notebook/
 # https://github.com/jupyter/docker-stacks/blob/main/minimal-notebook/Dockerfile

-ARG BASE_CONTAINER=jupyter/minimal-notebook
+ARG BASE_CONTAINER=jupyter/tensorflow-notebook


Why the switch from minimal to the tf image?

It is the only way I can get the GPU to work.

What are you trying? The image that's live right now uses the minimal-notebook and thomas and I can access the GPU just fine.

How did you get that to work beyond the usual, i.e., the newly defined environment.yml and jupyterhub_gpu.yaml. Did you have to install anything special CUDA/GPU-wise?

My understanding is that the current running environment was modified "in-place" for faster experimentation so not everything may have been captured in the Dockerfile, environment.yml, etc. I guess that is what I am asking.

You can find the most recent Dockerfile/environment combo in the "ana" tmux session on the docker-gpu machine. Your most recent commit has the correct environment file, however using the minimal-notebook as the base still works.

To get the GPU to work there is nothing more needed past the "normal" things--i.e. requesting the additional resource via jupyterhub_gpu.yaml.

I've found that the important parts of getting these things to work are making sure the packages you install (via conda or pip) are compatible with that shown when doing an nvidia-smi on JS2.

For example, in this case I tell pip to look for torch and associated packages from the cuda 12.1 index with this line:
- --extra-index-url https://download.pytorch.org/whl/cu121

We can get together to discuss this in-person after the holidays?

julienchastang force-pushed the tm24f branch 4 times, most recently from a95faef to b35747e Compare November 15, 2024 17:41

graphcast hub

137c70e

julienchastang force-pushed the tm24f branch from b35747e to 137c70e Compare November 15, 2024 18:11

ana-v-espinoza and others added 7 commits November 20, 2024 16:23

Add some niceties; conda-->mamba

5bdaf33

Modify resource limits

16dd45a

additional niceties

1b34df1

Revert to standard env before changing

95df6f6

Env: ai-models-graphcast and deps

a82fafd

Add graphcast dep

cf9feb6

earth2mip

7364b79

ana-v-espinoza reviewed Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graphcast hub #977

graphcast hub #977

julienchastang commented Nov 14, 2024

ana-v-espinoza Dec 20, 2024

julienchastang Dec 20, 2024

ana-v-espinoza Dec 20, 2024

julienchastang Dec 20, 2024

julienchastang Dec 20, 2024

ana-v-espinoza Dec 23, 2024

graphcast hub #977

Are you sure you want to change the base?

graphcast hub #977

Conversation

julienchastang commented Nov 14, 2024

ana-v-espinoza Dec 20, 2024

Choose a reason for hiding this comment

julienchastang Dec 20, 2024

Choose a reason for hiding this comment

ana-v-espinoza Dec 20, 2024

Choose a reason for hiding this comment

julienchastang Dec 20, 2024

Choose a reason for hiding this comment

julienchastang Dec 20, 2024

Choose a reason for hiding this comment

ana-v-espinoza Dec 23, 2024

Choose a reason for hiding this comment