WIP: Feature: fork kernel #410

maartenbreddels · 2019-05-28T21:50:35Z

Issue

For dashboarding, it is useful to have a kernel + the final state ready instantly. If for instance, it takes ~5 minutes to execute a notebook, it does not lead to good user experience.

Solution

Execute a notebook, and after it is done, consider it a template, and give each user a fork() of this template process.

Implementation

The kernel gets a new message, which is simply called fork. Asyncio/tornado-ioloop with fork is a no-go, it is basically not supported. To avoid this, we stop the current ioloop, and use the ioloop object to communicate the fork request up the callstack. The kernelapp now checks if a fork was requested and it will fork:

The parent resumes the ioloop
The child should clean up the ioloop and start listening on new ports:

Usage (although it does not work yet)

It also needs this PR: jupyter/jupyter_client#441

Shell1:

$ python -m ipykernel_launcher -f conn.json --debug

Shell2:

$ python fork_kernel.py ./conn.json # this will print the result of the fork
DEBUG:traitlets:Loading connection file ./conn.json
DEBUG:traitlets:connecting shell channel to tcp://127.0.0.1:56573
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56573
DEBUG:traitlets:connecting iopub channel to tcp://127.0.0.1:56574
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56574
DEBUG:traitlets:connecting stdin channel to tcp://127.0.0.1:56575
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56575
DEBUG:traitlets:connecting heartbeat channel to tcp://127.0.0.1:56577
{'header': {'msg_id': '81e4a9b2-155b535e6541ce736c126d47', 'msg_type': 'execute_reply', 'username': 'maartenbreddels', 'session': 'b5b03194-ba9276d2bbc5a711d5e5d44c', 'date': datetime.datetime(2019, 5, 28, 21, 39, 2, 604627, tzinfo=tzutc()), 'version': '5.3'}, 'msg_id': '81e4a9b2-155b535e6541ce736c126d47', 'msg_type': 'execute_reply', 'parent_header': {'msg_id': '73914843-0da368d574d8c114b8dfd531', 'msg_type': 'fork', 'username': 'maartenbreddels', 'session': '9c61f20c-d8a4e43f945eaf915031a3d4', 'date': datetime.datetime(2019, 5, 28, 21, 39, 2, 596944, tzinfo=tzutc()), 'version': '5.3'}, 'metadata': {'status': 'ok'}, 'content': {'status': 'ok', 'fork_id': 1042}, 'buffers': []}
$ python fork_kernel.py  /Users/maartenbreddels/Library/Jupyter/runtime/conn_fork.json
DEBUG:traitlets:Loading connection file /Users/maartenbreddels/Library/Jupyter/runtime/conn_fork.json
DEBUG:traitlets:connecting shell channel to tcp://127.0.0.1:54452
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:54452
DEBUG:traitlets:connecting iopub channel to tcp://127.0.0.1:54455
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:54455
DEBUG:traitlets:connecting stdin channel to tcp://127.0.0.1:54453
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:54453
DEBUG:traitlets:connecting heartbeat channel to tcp://127.0.0.1:54457
^[[A^CTraceback (most recent call last):
  File "fork_kernel.py", line 12, in <module>
    client.wait_for_ready(100)
  File "/Users/maartenbreddels/src/jupyter/jupyter_client/jupyter_client/blocking/client.py", line 111, in wait_for_ready
    msg = self.shell_channel.get_msg(block=True, timeout=1)
  File "/Users/maartenbreddels/src/jupyter/jupyter_client/jupyter_client/blocking/channels.py", line 50, in get_msg
    ready = self.socket.poll(timeout)
  File "/Users/maartenbreddels/miniconda3/envs/kernel_fork/lib/python3.7/site-packages/zmq/sugar/socket.py", line 697, in poll
    evts = dict(p.poll(timeout))
  File "/Users/maartenbreddels/miniconda3/envs/kernel_fork/lib/python3.7/site-packages/zmq/sugar/poll.py", line 99, in poll
    return zmq_poll(self.sockets, timeout=timeout)
  File "zmq/backend/cython/_poll.pyx", line 123, in zmq.backend.cython._poll.zmq_poll
  File "zmq/backend/cython/checkrc.pxd", line 12, in zmq.backend.cython.checkrc._check_rc
KeyboardInterrupt

The last command never does anything, so I ctrl-c'ed it, so the stacktrace is included.

Problem

It doesn't work 😄. The forked child process doesn't seem to be reachable by zmq, and no debug info is printed, so I'm giving up here, and hope that someone else has good ideas or wants to pick this up.

Side uses

Forking the kernel could be used for 'undoing' the excution of a cell/line, by forking after each execute(..). Note that since fork uses copy on write, this will be fairly cheap in resource usage.

LucianaMarques · 2019-06-02T22:05:54Z

Hey @maartenbreddels ! I'm interested in helping you with this.

I don't quite get what is the strategy here... Decreasing run time can indeed be done with more than one process, but you did mention this:

Execute a notebook, and after it is done, consider it a template, and give each user a fork() of this template process.

If you are waiting for it to be done, then we are waiting for the whole run time, right? Could you please be more specific about this idea?

It does sound interesting, though! :)

edisongustavo · 2019-08-22T11:00:04Z

Hello @maartenbreddels, I'm very interested in this feature.

Is there a way I can help on how we can implement this together?

maartenbreddels · 2019-08-23T14:59:16Z

I currently don't have much time to work on this, but I'm happy for you to take it over a bit, or test it out.

edisongustavo · 2019-08-25T18:18:25Z

Hey @maartenbreddels, I believe I found out why the child kernel does not receive the messages.

That's due to the check_pid flag of Session in the send() method:

class Session:
    def send(...):
        ...
        if self.check_pid and not os.getpid() == self.pid:
            get_logger().warning("WARNING: attempted to send message from fork\n%s",
                msg
            )
            return

See here

If I set check_pid = False, then the following works (This is a modification of your fork_kernel.py script:

import itertools
import sys
import time

from jupyter_client import BlockingKernelClient
import logging
logging.basicConfig()

connection_file = sys.argv[1]

client = BlockingKernelClient()
client.log.setLevel('DEBUG')
client.load_connection_file(sys.argv[1])
client.start_channels()
client.wait_for_ready(100)

obj = client.fork()
msg = client.shell_channel.get_msg(timeout=100)

print("=========================================")
print(" Client after fork")
print("=========================================")

client_after_fork = BlockingKernelClient()
client_after_fork.log.setLevel('DEBUG')
client_after_fork.load_connection_info(msg["content"]["conn"])
client_after_fork.start_channels()
client_after_fork.wait_for_ready(100)

client_after_fork.execute(code="foo = 5+5", user_expressions={"my-user-expresssion": "foo"})
msg = client_after_fork.get_shell_msg(timeout=100)
print("Value of last execution is {}".format(msg["content"]["user_expressions"]["my-user-expresssion"]["data"]["text/plain"]))

client_after_fork.execute(code="bar = 1+2", user_expressions={"my-user-expresssion": "foo,bar"})
msg = client_after_fork.get_shell_msg(timeout=100)
print("Value of last execution is {}".format(msg["content"]["user_expressions"]["my-user-expresssion"]["data"]["text/plain"]))

Will print to the console:

DEBUG:traitlets:Loading connection file conn.json
DEBUG:traitlets:connecting shell channel to tcp://127.0.0.1:56573
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56573
DEBUG:traitlets:connecting iopub channel to tcp://127.0.0.1:56574
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56574
DEBUG:traitlets:connecting stdin channel to tcp://127.0.0.1:56575
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:56575
DEBUG:traitlets:connecting heartbeat channel to tcp://127.0.0.1:56577
DEBUG:traitlets:connecting shell channel to tcp://127.0.0.1:48419
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:48419
DEBUG:traitlets:connecting iopub channel to tcp://127.0.0.1:52073
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:52073
DEBUG:traitlets:connecting stdin channel to tcp://127.0.0.1:50093
DEBUG:traitlets:Connecting to: tcp://127.0.0.1:50093
DEBUG:traitlets:connecting heartbeat channel to tcp://127.0.0.1:60527
=========================================
 Client after fork
=========================================
Value of last execution is 10
Value of last execution is (10, 3)

edisongustavo · 2019-08-25T18:28:47Z

I've also tried connecting to a "forked" kernel and it does work!

maartenbreddels · 2019-08-25T18:35:10Z

Awesome work, I'm really glad I put out this half working PR now. Hooray to open source :) (from mobile phone)

…

On Sun, 25 Aug 2019, 20:28 Edison Gustavo Muenz, ***@***.***> wrote: I've also tried connecting in another process to a "forked" kernel and it does work! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#410?email_source=notifications&email_token=AANPEPOU6W3N3WEA4DPQ4ALQGLFOBA5CNFSM4HQHMM5KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD5CZEFI#issuecomment-524653077>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AANPEPMDFFDSJMYX7DWKPCLQGLFOBANCNFSM4HQHMM5A> .

edisongustavo · 2019-08-25T19:40:17Z

Testing `shutdown()`

So I was testing if the shutdown() message would work and it doesn't out of the box.

I found out that the shell variable must also be updated, so I modified the block where you reset the kernel to this:

# NOTE: we actually start a new kernel, but once this works
# we can actually think about reusing the kernel object
self.kernel_class.clear_instance()
self.kernel.shell_class.clear_instance()

Then shutdown() does work correctly.

edisongustavo · 2019-09-02T13:09:22Z

I'm working now on getting to reuse the Kernel, I've forked the repo here: https://github.com/edisongustavo/ipykernel/, I believe it will take some time now, since I can only work on this on my free time.

But I'm getting there!

By spawning subprocesses with `stdout = open('/dev/null')` then PyCharm is not able to attach to them. This is specially painful when debugging the tests in test_kernel(), where the methods `kernel()` do spawn kernels in subprocesses if required.

This is the directory created by PyCharm

maartenbreddels force-pushed the feat_fork branch from 550763d to 13137a7 Compare May 28, 2019 21:51

maartenbreddels mentioned this pull request Jul 24, 2019

Pre-loading notebooks? voila-dashboards/voila#318

Closed

edisongustavo and others added 7 commits September 2, 2019 22:24

Add '.idea' to .gitignore

f03f443

This is the directory created by PyCharm

Name the threads IOPub and Heartbeat for easier debugging

2da574d

minor: indentation

3e20f81

WIP feat(fork): Allow forking of a kernel

0422bfa

Add test_fork()

03a29e4

Add ForkReply

083458e

edisongustavo force-pushed the feat_fork branch from a28afe4 to 083458e Compare September 9, 2019 14:13

SylvainCorlay mentioned this pull request Sep 20, 2020

Pre-execute notebooks voila-dashboards/voila#712

Open

maartenbreddels mentioned this pull request May 9, 2024

Restore to previous state with copy-on-write #1236

Open

zmbc mentioned this pull request Aug 4, 2024

Feature: Fork kernel #1261

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Feature: fork kernel #410

WIP: Feature: fork kernel #410

maartenbreddels commented May 28, 2019

LucianaMarques commented Jun 2, 2019

edisongustavo commented Aug 22, 2019

maartenbreddels commented Aug 23, 2019

edisongustavo commented Aug 25, 2019 •

edited

Loading

edisongustavo commented Aug 25, 2019 •

edited

Loading

maartenbreddels commented Aug 25, 2019 via email

edisongustavo commented Aug 25, 2019 •

edited

Loading

edisongustavo commented Sep 2, 2019

WIP: Feature: fork kernel #410

Are you sure you want to change the base?

WIP: Feature: fork kernel #410

Conversation

maartenbreddels commented May 28, 2019

Issue

Solution

Implementation

Usage (although it does not work yet)

Problem

Side uses

LucianaMarques commented Jun 2, 2019

edisongustavo commented Aug 22, 2019

maartenbreddels commented Aug 23, 2019

edisongustavo commented Aug 25, 2019 • edited Loading

edisongustavo commented Aug 25, 2019 • edited Loading

maartenbreddels commented Aug 25, 2019 via email

edisongustavo commented Aug 25, 2019 • edited Loading

Testing shutdown()

edisongustavo commented Sep 2, 2019

edisongustavo commented Aug 25, 2019 •

edited

Loading

edisongustavo commented Aug 25, 2019 •

edited

Loading

edisongustavo commented Aug 25, 2019 •

edited

Loading

Testing `shutdown()`