Skip to content

Commit

Permalink
Backport: gh-82300: Add track parameter to multiprocessing.shared_memory
Browse files Browse the repository at this point in the history
Summary:
Fix resource tracker destroys shared memory segments when other processes should still have valid access

Upstream issue: python/cpython#82300
Upstream PR: python/cpython@81ee026

Reviewed By: itamaro

Differential Revision: D58048029

fbshipit-source-id: 087e18caea2a106a1d4d30eec102fcf2c919cb7b
  • Loading branch information
Huy Le authored and facebook-github-bot committed Jun 2, 2024
1 parent 7f4c505 commit 7908605
Show file tree
Hide file tree
Showing 3 changed files with 106 additions and 23 deletions.
52 changes: 36 additions & 16 deletions Doc/library/multiprocessing.shared_memory.rst
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ or other communications requiring the serialization/deserialization and
copying of data.


.. class:: SharedMemory(name=None, create=False, size=0)
.. class:: SharedMemory(name=None, create=False, size=0, *, track=True)

Create an instance of the :class:`!SharedMemory` class for either
creating a new shared memory block or attaching to an existing shared
Expand Down Expand Up @@ -70,26 +70,46 @@ copying of data.
When attaching to an existing shared memory block,
the *size* parameter is ignored.

:param bool track:
When enabled, registers the shared memory block with a resource
tracker process on platforms where the OS does not do this automatically.
The resource tracker ensures proper cleanup of the shared memory even
if all other processes with access to the memory exit without doing so.
Python processes created from a common ancestor using :mod:`multiprocessing`
facilities share a single resource tracker process, and the lifetime of
shared memory segments is handled automatically among these processes.
Python processes created in any other way will receive their own
resource tracker when accessing shared memory with *track* enabled.
This will cause the shared memory to be deleted by the resource tracker
of the first process that terminates.
To avoid this issue, users of :mod:`subprocess` or standalone Python
processes should set *track* to ``False`` when there is already another
process in place that does the bookkeeping.
*track* is ignored on Windows, which has its own tracking and
automatically deletes shared memory when all handles to it have been closed.

.. versionchanged:: 3.13 Added *track* parameter.

.. method:: close()

Close access to the shared memory from this instance. In order to
ensure proper cleanup of resources, all instances should call
:meth:`close` once the instance is no longer needed. Note that calling
:meth:`!close` does not cause the shared memory block itself to be
destroyed.
Closes the file descriptor/handle to the shared memory from this
instance. :meth:`close()` should be called once access to the shared
memory block from this instance is no longer needed. Depending
on operating system, the underlying memory may or may not be freed
even if all handles to it have been closed. To ensure proper cleanup,
use the :meth:`unlink()` method.

.. method:: unlink()

Request that the underlying shared memory block be destroyed. In
order to ensure proper cleanup of resources, :meth:`unlink` should be
called once (and only once) across all processes which have need
for the shared memory block. After requesting its destruction, a
shared memory block may or may not be immediately destroyed and
this behavior may differ across platforms. Attempts to access data
inside the shared memory block after :meth:`!unlink` has been called may
result in memory access errors. Note: the last process relinquishing
its hold on a shared memory block may call :meth:`!unlink` and
:meth:`close` in either order.
Deletes the underlying shared memory block. This should be called only
once per shared memory block regardless of the number of handles to it,
even in other processes.
:meth:`unlink()` and :meth:`close()` can be called in any order, but
trying to access data inside a shared memory block after :meth:`unlink()`
may result in memory access errors, depending on platform.

This method has no effect on Windows, where the only way to delete a
shared memory block is to close all handles.

.. attribute:: buf

Expand Down
24 changes: 17 additions & 7 deletions Lib/multiprocessing/shared_memory.py
Original file line number Diff line number Diff line change
Expand Up @@ -71,8 +71,9 @@ class SharedMemory:
_flags = os.O_RDWR
_mode = 0o600
_prepend_leading_slash = True if _USE_POSIX else False
_track = True

def __init__(self, name=None, create=False, size=0):
def __init__(self, name=None, create=False, size=0, *, track=True):
if not size >= 0:
raise ValueError("'size' must be a positive integer")
if create:
Expand All @@ -82,6 +83,7 @@ def __init__(self, name=None, create=False, size=0):
if name is None and not self._flags & os.O_EXCL:
raise ValueError("'name' can only be None if create=True")

self._track = track
if _USE_POSIX:

# POSIX Shared Memory
Expand Down Expand Up @@ -116,8 +118,8 @@ def __init__(self, name=None, create=False, size=0):
except OSError:
self.unlink()
raise

resource_tracker.register(self._name, "shared_memory")
if self._track:
resource_tracker.register(self._name, "shared_memory")

else:

Expand Down Expand Up @@ -236,12 +238,20 @@ def close(self):
def unlink(self):
"""Requests that the underlying shared memory block be destroyed.
In order to ensure proper cleanup of resources, unlink should be
called once (and only once) across all processes which have access
to the shared memory block."""
Unlink should be called once (and only once) across all handles
which have access to the shared memory block, even if these
handles belong to different processes. Closing and unlinking may
happen in any order, but trying to access data inside a shared
memory block after unlinking may result in memory errors,
depending on platform.
This method has no effect on Windows, where the only way to
delete a shared memory block is to close all handles."""

if _USE_POSIX and self._name:
_posixshmem.shm_unlink(self._name)
resource_tracker.unregister(self._name, "shared_memory")
if self._track:
resource_tracker.unregister(self._name, "shared_memory")


_encoding = "utf8"
Expand Down
53 changes: 53 additions & 0 deletions Lib/test/_test_multiprocessing.py
Original file line number Diff line number Diff line change
Expand Up @@ -4462,6 +4462,59 @@ def test_shared_memory_cleaned_after_process_termination(self):
"resource_tracker: There appear to be 1 leaked "
"shared_memory objects to clean up at shutdown", err)

@unittest.skipIf(os.name != "posix", "resource_tracker is posix only")
def test_shared_memory_untracking(self):
# gh-82300: When a separate Python process accesses shared memory
# with track=False, it must not cause the memory to be deleted
# when terminating.
cmd = '''if 1:
import sys
from multiprocessing.shared_memory import SharedMemory
mem = SharedMemory(create=False, name=sys.argv[1], track=False)
mem.close()
'''
mem = shared_memory.SharedMemory(create=True, size=10)
# The resource tracker shares pipes with the subprocess, and so
# err existing means that the tracker process has terminated now.
try:
rc, out, err = test.support.script_helper.assert_python_ok("-c", cmd, mem.name)
self.assertNotIn(b"resource_tracker", err)
self.assertEqual(rc, 0)
mem2 = shared_memory.SharedMemory(create=False, name=mem.name)
mem2.close()
finally:
try:
mem.unlink()
except OSError:
pass
mem.close()

@unittest.skipIf(os.name != "posix", "resource_tracker is posix only")
def test_shared_memory_tracking(self):
# gh-82300: When a separate Python process accesses shared memory
# with track=True, it must cause the memory to be deleted when
# terminating.
cmd = '''if 1:
import sys
from multiprocessing.shared_memory import SharedMemory
mem = SharedMemory(create=False, name=sys.argv[1], track=True)
mem.close()
'''
mem = shared_memory.SharedMemory(create=True, size=10)
try:
rc, out, err = test.support.script_helper.assert_python_ok("-c", cmd, mem.name)
self.assertEqual(rc, 0)
self.assertIn(
b"resource_tracker: There appear to be 1 leaked "
b"shared_memory objects to clean up at shutdown", err)
finally:
try:
mem.unlink()
except OSError:
pass
resource_tracker.unregister(mem._name, "shared_memory")
mem.close()

#
# Test to verify that `Finalize` works.
#
Expand Down

0 comments on commit 7908605

Please sign in to comment.