Update CUDA to 11.8 #123

vanHavel · 2023-11-27T16:40:13Z

Hi,

first of all thank you for providing this docker image, it is very useful.

I have created a fork where I upgraded CUDA to 11.8 to use with newer versions of Tensorflow. The related PR is here: #122

There are a few small caveats listed in the description of the PR. Nevertheless, I hope it will be a good start for folks looking to use the image with newer CUDA.

benz0li · 2023-11-29T17:05:54Z

@vanHavel You may be interested in b-data's/my GPU accelerated JupyterLab docker stacks.

(currently) Based on nvidia/cuda:11.8.0-cudnn8-devel-ubuntu22.04; including code-server – aka VS Code in the browser.

mathematicalmichael · 2023-12-07T17:46:01Z

@benz0li

thanks for the links. I took a look and have a question for you:

❗ Always mount the user's entire home directory.
Mounting a subfolder prevents the container from starting.[1](https://github.com/b-data/jupyterlab-python-docker-stack/blob/main/CUDA.md#user-content-fn-1-fca8fa3aa93e9d6a945115ed9b64e882)

this is different than how the (vanilla) jupyter docker-stacks images work, as they default to mounting ~/work as a persistent storage with a docker volume (for things like jupyterhub).
So I'm wondering, why this design change? From my perspective, it makes user-experience sense in that things like dotfiles and conda envs will persist... but from the other hand, part of the benefit of the ephemeral compute is that you / your users can totally "mess up" things and just start fresh with a new container. A lot of the ways in which one can "mess up" are directly related to ~/.local, ~/.conda and the like.
So yes, it's convenient to not lose settings such as ~/.ssh and VSCode's settings ~/.vscode, but this tracking of state can lead to one being unable to recover a working compute environment.

It's been years since I've thought about this decision, but many years ago I was managing a jupyterhub for hundreds of students and this discussion about the persistence of ~/work vs ~/ was a topic of discussion then, (as many students were having their first python experience on this, we were designing for robustness and making some convenience compromises).

So I was wondering if something's changed or if you had some thoughts on this.

+1 for the vscode inclusion, I do that as well for my images which have their base images taken from here.

benz0li · 2023-12-07T18:38:13Z

@benz0li

thanks for the links. I took a look and have a question for you:
❗ Always mount the user's entire home directory.
Mounting a subfolder prevents the container from starting.[1](https://github.com/b-data/jupyterlab-python-docker-stack/blob/main/CUDA.md#user-content-fn-1-fca8fa3aa93e9d6a945115ed9b64e882)
this is different than how the (vanilla) jupyter docker-stacks images work, as they default to mounting ~/work as a persistent storage with a docker volume (for things like jupyterhub). So I'm wondering, why this design change?

See jupyter/docker-stacks#1478.

From my perspective, it makes user-experience sense in that things like dotfiles and conda envs will persist...

There is no Conda in b-data's/my images.

but from the other hand, part of the benefit of the ephemeral compute is that you / your users can totally "mess up" things and just start fresh with a new container.

You can start fresh with a new container with b-data's/my images, too.

A lot of the ways in which one can "mess up" are directly related to ~/.local, ~/.conda and the like. So yes, it's convenient to not lose settings such as ~/.ssh and VSCode's settings ~/.vscode, but this tracking of state can lead to one being unable to recover a working compute environment.

True. (If a user messes up, the JupyterHub admin must step in)

It's been years since I've thought about this decision, but many years ago I was managing a jupyterhub for hundreds of students and this discussion about the persistence of ~/work vs ~/ was a topic of discussion then, (as many students were having their first python experience on this, we were designing for robustness and making some convenience compromises).

IMHO users should have all the freedom in their home directory. E.g. in the case of b-data's/my images even installing Miniconda or Micromamba at user level – persistently.

So I was wondering if something's changed or if you had some thoughts on this.

For my thoughts, see b-data/jupyterlab-python-docker-stack#1 (comment).

There are startup hooks in place. Especially /usr/local/bin/before-notebook.d/10-init.sh which allows mounting the same home directory with all of b-data's JupyterLab docker stacks – repeatedly.
🔬 Demo environment: https://demo.jupyter.b-data.ch. Login with GitHub account.
ℹ️ See Notes for all the differences to the (vanilla) jupyter docker-stacks images.
👉 I.e. tweaks, settings, etc. that can be applied at user-level for customisation.

By allowing users to persistently install Python packages at user level b-data's/my docker stacks do not require separate images for simply installing python packages like TensorFlow or PyTorch.

b-data's/my images also support Docker/Podman in rootless mode. I have opened a pull request to "backport" this feature to the (vanilla) jupyter docker-stacks images: jupyter/docker-stacks#2039

benz0li · 2023-12-07T18:45:00Z

Furthermore there are no GPU accelerated (vanilla) jupyter docker-stacks images and this repository misses some essential features:

Struggling to switch users and maintain full cuda support #108 (comment)
Error when running your current image with host drivers cuda 12.1 : "Could not load dynamic library 'libnvinfer.so.7'" #117 (comment)
PyTorch 2 needs CUDA 11.7+ #119 (comment)

Our dear colleagues from the Rocker project have a different problem:

Nvidia devel tools get removed from rocker/cuda-based images rocker-org/rocker-versioned2#736

benz0li · 2023-12-07T18:59:39Z

+1 for the vscode inclusion, I do that as well for my images which have their base images taken from here.

Not VS Code but code-server – aka VS Code in the browser plus some additional features.
ℹ️ There are b-data's/my Data Science Dev Containers for use with 'VS Code'/Codespaces.

mathematicalmichael · 2023-12-09T19:01:36Z

@benz0li thank you for the detailed response. I'm reading through the links you provided to issue discussions and it's clear you've put a lot of thought and work into the design changes. The approach to optionally bind-mounting home / having a population script if empty... is not something I considered.

You solved for one of the most painful UX: the state preservation of ~/

(and yes, my mistake, I did mean code-server).

I think you've aimed at a "one image for everything" design, whereas yes, I believe the design approach before was that users run multiple servers in jupyterhub for their different images, which provides possibly "too much" isolation / a lot of disk consumption on the host OS running docker.

And the fact you got it working with rootless docker... wow, that gives me a lot of trust in your technical abilities. Props. That's quite a painful exercise (I haven't tried it here but have migrated other images before opting out of podman entirely).

Props also for the busy script during tmux/screen. Good idea.

So I think you've made some great choices. As you mentioned, the tradeoff for more statefulness comes at the potential need for more admin-interference. So the question becomes: are you deploying for a fleet of inexperienced users (jupyterhub got its start as a product for Berkeley's students), or power-users? For GPU-enabled images, I think the answer leans far more towards the latter.

I appreciate you taking the time to explain all that.
Would you please be so kind as to whitelist me for your demo server? I'm going to spend some time playing around with your set up on my servers as well, but I'm really pressed for time lately.

One random personal style question:
Why the switch to zsh? Would it be easy to default to bash instead?
I'm one of those still-defaults-my-mac-to-bash people, mostly because of its presence on random servers I need to configure, and zsh out of the box does things like ruin pip install package[options] syntax.

benz0li · 2023-12-09T21:11:17Z

are you deploying for a fleet of inexperienced users (jupyterhub got its start as a product for Berkeley's students), or power-users?

My images are intended for power users. A user [of b-data's/my images] should have more than just basic Linux knowledge.

Why the switch to zsh?

I simply like Zsh; further enhanced with

Framework: Oh My Zsh
Theme: Powerlevel10k
Font: MesloLGS NF

Would it be easy to default to bash instead?

Try starting the image with -e SHELL=/usr/bin/bash.

benz0li · 2023-12-09T21:20:56Z

Would you please be so kind as to whitelist me for your demo server?

Done. I have whitelisted your account (@mathematicalmichael) for https://demo.cuda.jupyter.b-data.ch.

(Anyone with a GitHub account may log in at https://demo.jupyter.b-data.ch)

Sometimes it does not start at first try. Simply try again...

benz0li · 2023-12-09T21:24:33Z

I'm one of those still-defaults-my-mac-to-bash people, mostly because of its presence on random servers I need to configure, and zsh out of the box does things like ruin pip install package[options] syntax.

@mathematicalmichael Can you give an example that does not work with my JupyterLab docker stacks?

mathematicalmichael · 2023-12-09T22:27:22Z

@benz0li thank you!

with respect to the zsh question, that was just memory from when macOS switched the default shell, and I found that without further configuring it, the [] characters were being interpreted by zsh instead of pip

e.g., in your jupyterhub, pip install hiplot[dev] fails; I have to use quotes: pip install 'hiplot[dev]'. This is practically the only thing I remember about zsh when I first tried it.. that optional python dependencies would fail (and as a package developer, I often rely on these), and rather than learn how to configure a new shell, I stuck to bash.

thanks for the instruction on how to override the default config, and that does make sense that you're targeting power-users. The original docker-stacks (in my impression) do not necessarily assume the users are comfortable with linux, and I believe that's why ~/ was not persistent (as annoying as that is to a power-user).

benz0li · 2023-12-09T22:46:30Z

I found that without further configuring it, the [] characters were being interpreted by zsh instead of pip

@mathematicalmichael

zsh uses square brackets for globbing / pattern matching.

[...]

If you want to disable globbing for the pip command permanently, you can do so by adding this to your ~/.zshrc:
alias pip='noglob pip'

– https://stackoverflow.com/a/30539963

benz0li · 2023-12-09T22:48:49Z

@mathematicalmichael

[...]

To get the Bash behavior in Zsh, add this to your ~/.zshrc file:
unsetopt NOMATCH
[...]

– https://superuser.com/a/1606090

ChristophSchranz · 2023-12-16T07:58:02Z

Hi @vanHavel ,
thanks for your issue and PR. It was merged into the main branch (see #124 that builds upon your PR #122).

ChristophSchranz · 2023-12-16T08:04:00Z

Also thanks to @benz0li for the detailed explanations!

benz0li · 2023-12-17T19:40:53Z

Why the switch to zsh?

Addendum: I could not get bash working properly with screen/tmux, i.e. PATH was not updated consistently; and PATH was updated differently in JupyterLab and code-server.

This is due to whether the shell is a 'Login shell' or not. Because

With JupyterHub
- In JupyterLab Terminal: The shell is a 'Login shell'
- In code-server Terminal: The shell is not a 'Login shell'
Without JupyterHub
- Both: The shell is not a 'Login shell'

👉 Using zsh, PATH is updated consistently for all configurations.

@mathematicalmichael With b-data/jupyterlab-r-docker-stack@5e2a258...6080796, bash now also updates PATH consistently for all configurations.

mathematicalmichael · 2023-12-20T18:34:25Z

@benz0li I think this is bc zsh just reads ~/.zshrc regardless of interactivity, but bash will choose between ~/.bashrc (non-login), ~/.bash_profile / ~/.profile (interactive shell).

Thanks for digging into that, wasn't aware of how Jupyterhub impacts any of that.

benz0li · 2024-03-02T10:41:56Z

@mathematicalmichael ℹ️ I found a way to enable bind mounting a subfolder of the home directory for arbitrary $NB_USERs and thus resolve b-data/jupyterlab-python-docker-stack#1.

Users can now choose whether to (bind) mount the entire home directory or just a subfolder within it.

ChristophSchranz closed this as completed Dec 16, 2023

benz0li mentioned this issue Jun 4, 2024

cuda-nvcc missing again pangeo-data/pangeo-docker-images#438

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update CUDA to 11.8 #123

Update CUDA to 11.8 #123

vanHavel commented Nov 27, 2023

benz0li commented Nov 29, 2023 •

edited

Loading

mathematicalmichael commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

mathematicalmichael commented Dec 9, 2023 •

edited

Loading

benz0li commented Dec 9, 2023 •

edited

Loading

benz0li commented Dec 9, 2023 •

edited

Loading

benz0li commented Dec 9, 2023

mathematicalmichael commented Dec 9, 2023

benz0li commented Dec 9, 2023

benz0li commented Dec 9, 2023

ChristophSchranz commented Dec 16, 2023

ChristophSchranz commented Dec 16, 2023

benz0li commented Dec 17, 2023

mathematicalmichael commented Dec 20, 2023

benz0li commented Mar 2, 2024

Update CUDA to 11.8 #123

Update CUDA to 11.8 #123

Comments

vanHavel commented Nov 27, 2023

benz0li commented Nov 29, 2023 • edited Loading

mathematicalmichael commented Dec 7, 2023 • edited Loading

benz0li commented Dec 7, 2023 • edited Loading

benz0li commented Dec 7, 2023 • edited Loading

benz0li commented Dec 7, 2023 • edited Loading

mathematicalmichael commented Dec 9, 2023 • edited Loading

benz0li commented Dec 9, 2023 • edited Loading

benz0li commented Dec 9, 2023 • edited Loading

benz0li commented Dec 9, 2023

mathematicalmichael commented Dec 9, 2023

benz0li commented Dec 9, 2023

benz0li commented Dec 9, 2023

ChristophSchranz commented Dec 16, 2023

ChristophSchranz commented Dec 16, 2023

benz0li commented Dec 17, 2023

mathematicalmichael commented Dec 20, 2023

benz0li commented Mar 2, 2024

benz0li commented Nov 29, 2023 •

edited

Loading

mathematicalmichael commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

benz0li commented Dec 7, 2023 •

edited

Loading

mathematicalmichael commented Dec 9, 2023 •

edited

Loading

benz0li commented Dec 9, 2023 •

edited

Loading

benz0li commented Dec 9, 2023 •

edited

Loading