Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add advisory file locking API #111

Open
wants to merge 21 commits into
base: main
Choose a base branch
from

Conversation

milseman
Copy link
Contributor

When trying to resuscitate fcntl support, file locks came up again. Here I try to carve off flock support.

Some questions:

  • Should it be named advisoryLock?
  • Should unlock exist or should we have a 3-way enum for shared, exclusive, unlock?

@milseman milseman force-pushed the files_of_a_feather_advisory_lock_together branch from 6a4dcd0 to eb5dd59 Compare October 12, 2022 16:13
@milseman
Copy link
Contributor Author

We'll probably want to use OFD locks.

@@ -203,6 +224,10 @@ final class FileOperationsTest: XCTestCase {
XCTAssertEqual(readBytesAfterTruncation, Array("ab".utf8))
}
}

func testFlock() throws {
// TODO: We need multiple processes in order to test blocking behavior
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's been many years since I wrote a flock(2) wrapper, but in those tests I ended up shelling out to flock(1) as an interop test on Linux. I guess we won't be able to do anything like that in this project?

What do you have in mind for this test?

We could potentially at least check the lock-lock, lock-unlock, and lock-upgrade flows?

Given that two independent file descriptors for the same underlying file can still block each other there are some tests we can write in a single process.

Copy link
Contributor Author

@milseman milseman Oct 13, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should pursue OFD locks which would be easier to check. Two separate open calls in the same process should have different lock sets. When we get fork support we can also test some behavior there.

///
/// A shared lock may be upgraded to an exclusive lock, and vice versa, simply by specifying the appropriate
/// lock type; this results in the previous lock being released and the new lock
/// applied (possibly after other processes have gained and released the lock).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I appreciate this is as specific as man 2 flock gets and its implicit that the upgrade only works on the same file descriptor I suppose, i.e. if a process has two independent file descriptors to the same underlying file, they can still block each other.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe flock(2) would be associated between process/file, while OFD locks would be file-description associated (but not descriptor, i.e. dup would share locks).

Copy link
Contributor

@simonjbeaumont simonjbeaumont Oct 14, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We may already be on the same page here, but I believe that, with flock(2), using dup(2) will allow the new fd to upgrade a lock, but using open(2) will not.

Here's a toy program that accepts NEW_FD_METHOD as an environment variable and attempts to upgrade a shared lock to an exclusive one using a new file descriptor pointing to the same file as was already locked:

https://gist.github.com/simonjbeaumont/7bd12f1f97e162734b6fec847d3a969f

Running it with both options and dup is fine, open fails:

clang -o flock-test flock-test.c && NEW_FD_METHOD=dup ./flock-testclang -o flock-test flock-test.c && NEW_FD_METHOD=open ./flock-test
error obtaining exclusive lock with new fd 4: Resource temporarily unavailable

@simonjbeaumont
Copy link
Contributor

  • Should unlock exist or should we have a 3-way enum for shared, exclusive, unlock?

In a previous life I wrote a flock(2) wrapper which took a 3-way enum and a non-blocking flag. However, I on reflection, I think the API you have now, i.e. a separate unlock, is more reflective of usage since IIUC the LOCK_UN cannot block?

I guess the question is: how faithful/thin a wrapper do you want this to be over flock(2) vs. how much would you like to abstract away the bit flags?

@milseman
Copy link
Contributor Author

I am definitely going to use OFD locks for the higher level API. This means that there is a getLock() which returns read/write/none. We need the 3-way enum anyways to answer the API, and unlock() can be a simple wrapper around lock(.none).

Since it's easier/better for the developer to call unlock(), we can choose names that are better geared towards getLock(). That is we can call F_UNLCK .none since that is what the constant is used for.

@milseman
Copy link
Contributor Author

Ok, this has been switched over entirely to OFD locks.

Questions still open:

  • Is getConflictingLock() the right API, or should it be something like canLock(_:FileLock.Kind)?
  • For docs, can/should we link somewhere for OFD locks? Can we reduce duplication?
  • Are we ready to require 5.7?
    • otherwise embrace the overloads for some RangeExpression
    • speaking of... do we need to not use some?
  • Is non-blocking/non-waiting the default?
  • Need to get availability hooked up

@milseman
Copy link
Contributor Author

Also note that getConflictingLock has no atomicity guarantee between it and actually trying to set a lock, nor is there atomicity inside it's algorithm trying to find the strongest lock (though I weakly-believe that races within the algorithm could be benign).

An alternative is to drop the API and have lock() return a Bool indicating success. I.e. we handle EAGAIN in this top-level API under similar-ish reasoning to handling EINTR. EAGAIN / EWOULDBLOCK serves a somewhat analogous purpose: errno happens to be the vehicle used for fine-grained communication regarding the API, but it's not intrinsically used for "errors".

We could craft some philosophy around this as well which could extend to e.g. read under EWOULDBLOCK.

@numist
Copy link
Member

numist commented Nov 16, 2022

I feel like the API here may still be more faithful to the history of file locking APIs than that history deserves.

In mutexing APIs it's common to have a try method that returns Bool immediately and a lock method that blocks until success1; with that pair you would resolve your atomicity problems and gain the opportunity to use F_SETLKWTIMEOUT as an implementation detail in the blocking method (on platforms that support it) to get priority donation on contention.

Footnotes

  1. Also—more opinionatedly—lock state checking functions that always crash if the lock is not in the expected state.

@milseman
Copy link
Contributor Author

milseman commented Feb 7, 2023

I like the idea of os_unfair_lock_assert_not_owner, though terminating the process is a little hostile to testing. In Swift, we'd probably want something to compose with assert precondition. E.g. precondition(try myFD.ownedLock == .write) or something. We would have documentation recommending that it be used for these kind of purposes rather than program logic as it wouldn't provide atomicity with other code. I wonder if there's a name that would hint a little more strongly at this (we can also discuss that during API review).

For the lock operation, it seems there are 3 variants:

  • block-til-lock
  • lock-no-block
  • block-til-lock-unless-timeout (Darwin only)

When you say using F_SETLKWTIMEOUT as an implementation detail, do you mean that semantics of the operation would differ across platforms? I'd lean a little more closely to having a separate API (or a boolean parameter), but I'm curious your thoughts here.

For example, we could have:

extension FileDescriptor {
    var ownedLock: throws FileLock.Kind?

    func tryLock(...) throws -> Bool
    func lock(...) throws
    func unlock(...) throws

#if !os(Linux)
    func tryLock(..., blockUntilTimeout:...) throws -> Bool
#endif    
}

Though I'm sure we'll iterate on the shape and names of those API during review. E.g. I'm not clear if the timeout taking lock should be a variant of tryLock (which matches the API shape and use site) or lock.

@milseman
Copy link
Contributor Author

This has the API as lock/tryLock/unlock. Alternate names could be lockOrWait/lock/unlock, etc. There is no way to do ownedLock with the current syscalls, unfortunately.

This is ready for review and close to mergeable. We will need to go through some steps before the release:

  1. We need commented out availability annotations
  2. We'll want to do some API review for names
  3. We'll want to test wait and timeouts (@lorentey ideas on how?)
  4. We need to figure out if unlocking without waiting is always expected to work

@milseman milseman force-pushed the files_of_a_feather_advisory_lock_together branch from 2a64ac6 to a1cbeeb Compare March 8, 2023 00:12
@milseman
Copy link
Contributor Author

milseman commented Mar 8, 2023

Early draft of API doc: https://gist.github.com/milseman/fbf1ec12ac9a6635d830d9457fec0776

@milseman milseman force-pushed the files_of_a_feather_advisory_lock_together branch from 384b14d to a15d9a7 Compare March 8, 2023 00:50
@milseman
Copy link
Contributor Author

milseman commented Mar 8, 2023

@swift-ci please test

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants