RFC: Globally Locked Cargo #1781

Byron · 2015-07-04T13:23:36Z

This PR is the successor of 'Concurrent Cargo' and uses a single lock to assure multiple cargo invocations will not interfere with each other in an undefined fashion.

Implementation Details

The file-lock crate provides the actual lock implementation. It allows to try obtaining a lock (non-blocking) or to wait until the lock was obtained. All this is done using standard operating system facilities which are available on both windows and posix-compatible systems.

The lock-file is persistent and is currently placed in ${CARGO_HOME}/.global-lock. Please note it s not comparable to the locking mechanism git uses, as the latter doesn't support blocking until a lock is obtained.

Using the cargo configuration and the key build.lock-kind, it is possible to change the default from nowait to wait and thus cause Cargo processes to wait for each other.

Open Questions

In order of perceived relevance:

How can I implement a test-case which doesn't suffer from races ?
- The preliminary test only works occasionally, and for some reason the spawned background process starts after the foreground process and thus fails to obtain the lock. Adding sleep_ms seems like the wrong thing to do.
Should the lock better be obtained in cargo::process(...) ?
Which name would you prefer for the lock file ?
Which configuration key would you prefer ? The current one seems not to be too fitting.

Testing

Manual tests indicate the system works as expected.

$ git clone https://github.com/Byron/google-apis-rs
$ cd google-apis-rs
# Build 4 targets with plenty of dependencies in parallel. Depending on the `build.jobs` variable 
# in `.cargo.config`, there is more or less intra-process contention.
$ CARGO_HOME=$PWD/cargo_home make -j4 groupsmigration1-cli-cargo discovery1-cli-cargo translate2-cli-cargo audit1-cli-cargo ARGS=build

The .cargo/config contained in the aforementioned repository is used to configure cargo. Currently it looks like this:

[build]
target-dir = "target"
lock-kind = "wait"

Work still to be done

Tests for both, blocking and nonblocking, lock options
Support for the windows platform.
- The latter is achieved through improvements to the file-lock crate and should be transparent to cargo.

It was previously used in the `Concurrent Cargo` [PR](https://goo.gl/pcUvVH).

The lock is obtained in a section shared by all cargo sub-command invocations. By default, failure to obtain a lock will result in gracefully aborting the operation. However, it is possible to configure it to wait until a lock can be obtained.

However, it suffers from a race condition that make it succeed only occasionally. The lock code was moved into cargo's main execute function, which should be better as the lock will be obtained even sooner after cargo comes up. Removed some special-case code which made the lock work better in a multi-threaded environment, which now is not needed anymore due to the changed lock granularity. This also allows to remove `errno` crate.

rust-highfive · 2015-07-04T13:23:46Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

alexcrichton · 2015-07-06T15:39:19Z

Cargo.lock

@@ -93,6 +94,26 @@ dependencies = [
 ]

 [[package]]
+name = "errno"


I think this lockfile may need to be regenerated.

Regenerating it does not remove the errno entry - it just updates dependencies to the latest patch-levels. I am not sure if this is what you intended.

Oh right, duh! Forgot this was a dependency of file-lock.

That being said I believe that this crate doesn't provide anything beyond io::Error::last_os_error() and io::Error::raw_os_error(), so I'd prefer if the file-lock crate were adjusted to not have this dependency.

alexcrichton · 2015-07-06T15:48:17Z

I think that to start off with we can just pick a reasonable default (either wait or don't wait), and in the case of waiting I also think that a message needs to be printed to the effect of "we're now waiting for some other Cargo to exit"

Byron · 2015-07-06T16:19:11Z

[...] and in the case of waiting I also think that a message needs to be printed to the effect of "we're now waiting for some other Cargo to exit"

Unfortunately this doesn't work, as we don't know that we are waiting in case we do so. We can only tell we didn't get a lock if we tried to get a lock first, then print "we are waiting" and then wait on the lock. However, that would clearly be a race.

So far I have the impression you favour to not make the handling configurable, which to my mind would restrict the usefulness of the implementation. After all, both behaviours, wait and nowait have their benefits, and the file-lock crate can handle both.

Maybe I just misunderstood you in this regard.

alexcrichton · 2015-07-07T16:12:57Z

However, that would clearly be a race.

Could you elaborate on the race here? I'm not quite sure what the problem would be in this case.

So far I have the impression you favour to not make the handling configurable, which to my mind would restrict the usefulness of the implementation.

Yeah I think it's fine to perhaps expand into this in the future, but I'd prefer to get some experience with the current implementation before getting more ambitious, and I figure that waiting-by-default is a reasonable way to start out.

Byron · 2015-07-07T19:52:25Z

Thanks for the replies, I believe I know enough now to prepare everything for a next review. I will let you know once I think I addressed all issues.

Good luck with getting a window CI environment ready ! I wonder why it's not possible to build cargo along with rustc, which certainly is already tested on all supported platforms. After all, cargo is bundled with rustc and expected to run equally well on all supported platforms.

Could you elaborate on the race here? I'm not quite sure what the problem would be in this case.

A non-waiting (try) lock followed by blocking lock in case try-lock failed would cause cargo to print "waiting for other process". That other process could, while we are printing this for example, drop the lock, and we get the lock without blocking at all. Two sys-calls in short succession are always a race, to my mind. One could argue that we are talking about a fraction of a second, and even if there is a race, the worst thing that could happen is a message printed for no real reason.

alexcrichton · 2015-07-07T23:36:21Z

That other process could, while we are printing this for example, drop the lock, and we get the lock without blocking at all.

This doesn't sound like a race condition, just something that can happen? There's no harm in doing this and if Cargo continues quickly then even better!

* removed access to configuration, and will instead default to blocking semantics. This caused a change in the API of the `CargoLock` type, that generally reduced complexity.

That way, all error related code can be found in one module, and isn't sprinkled all over the place.

* just fail if the directory creation fails. This could happen if there is a race, but such a race would only possibly occour if cargo has never been invoked before on a particular CARGO_HOME * use try! + chain_error + human instead of more complex error handling

alexcrichton · 2015-07-08T22:40:57Z

Ah and I've now set up AppVeyor CI for Cargo so when this PR is rebased I think it'll trigger a new CI build on Windows.

bors · 2015-07-29T17:44:17Z

☔ The latest upstream changes (presumably #1860) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2015-08-04T23:03:11Z

☔ The latest upstream changes (presumably #1830) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2015-08-17T04:47:16Z

☔ The latest upstream changes (presumably #1885) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2015-10-06T22:09:08Z

☔ The latest upstream changes (presumably #2022) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2015-10-21T21:20:10Z

☔ The latest upstream changes (presumably #2061) made this pull request unmergeable. Please resolve the merge conflicts.

It builds, some tests fail though

bors · 2015-11-19T19:12:54Z

☔ The latest upstream changes (presumably #2154) made this pull request unmergeable. Please resolve the merge conflicts.

Byron · 2015-12-07T15:49:07Z

Thanks for the time invested into this PR. However, I have to face the bitter truth that I won't be the one finishing this.

The latest attempt to bring it up-to-date with master revealed that libc has moved or renamed constants used by the file-lock crate, which now would need additional work - something inevitable considering the required windows port.

Therefore I believe it's best to close the issue to prevent it from being dragged along into 2016.

Byron added 3 commits July 4, 2015 11:44

Added CargoLock implementation

170b961

It was previously used in the `Concurrent Cargo` [PR](https://goo.gl/pcUvVH).

Lock every primary cargo command

b4df291

The lock is obtained in a section shared by all cargo sub-command invocations. By default, failure to obtain a lock will result in gracefully aborting the operation. However, it is possible to configure it to wait until a lock can be obtained.

rust-highfive assigned alexcrichton Jul 4, 2015

Byron mentioned this pull request Jul 5, 2015

Concurrent Builds Byron/google-apis-rs#122

Closed

alexcrichton reviewed Jul 6, 2015
View reviewed changes

Byron added 3 commits July 8, 2015 10:11

CargoLock LockKind is now specified per lock call

9c2124e

* removed access to configuration, and will instead default to blocking semantics. This caused a change in the API of the `CargoLock` type, that generally reduced complexity.

Moved LockError related code to errors.rs

b900af9

That way, all error related code can be found in one module, and isn't sprinkled all over the place.

Byron mentioned this pull request Jul 21, 2015

Run rustc/cargo in the background to avoid blocking UI oschwald/SublimeLinter-contrib-rustc#17

Closed

Merge branch 'master' into global-lock

4e98db0

Merge remote-tracking branch 'origin/master' into global-lock

4c4da88

Merge remote-tracking branch 'origin/master' into global-lock

367749f

Merge remote-tracking branch 'origin/master' into global-lock

7461f8b

Merge remote-tracking branch 'origin/master' into global-lock

e83b567

It builds, some tests fail though

Byron closed this Dec 7, 2015

alexcrichton mentioned this pull request Mar 15, 2016

Fix running Cargo concurrently #2486

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Globally Locked Cargo #1781

RFC: Globally Locked Cargo #1781

Byron commented Jul 4, 2015

rust-highfive commented Jul 4, 2015

alexcrichton Jul 6, 2015

Byron Jul 8, 2015

alexcrichton Jul 8, 2015

alexcrichton commented Jul 6, 2015

Byron commented Jul 6, 2015

alexcrichton commented Jul 7, 2015

Byron commented Jul 7, 2015

alexcrichton commented Jul 7, 2015

alexcrichton commented Jul 8, 2015

bors commented Jul 29, 2015

bors commented Aug 4, 2015

bors commented Aug 17, 2015

bors commented Oct 6, 2015

bors commented Oct 21, 2015

bors commented Nov 19, 2015

Byron commented Dec 7, 2015

RFC: Globally Locked Cargo #1781

RFC: Globally Locked Cargo #1781

Conversation

Byron commented Jul 4, 2015

Implementation Details

Open Questions

Testing

Work still to be done

rust-highfive commented Jul 4, 2015

alexcrichton Jul 6, 2015

Choose a reason for hiding this comment

Byron Jul 8, 2015

Choose a reason for hiding this comment

alexcrichton Jul 8, 2015

Choose a reason for hiding this comment

alexcrichton commented Jul 6, 2015

Byron commented Jul 6, 2015

alexcrichton commented Jul 7, 2015

Byron commented Jul 7, 2015

alexcrichton commented Jul 7, 2015

alexcrichton commented Jul 8, 2015

bors commented Jul 29, 2015

bors commented Aug 4, 2015

bors commented Aug 17, 2015

bors commented Oct 6, 2015

bors commented Oct 21, 2015

bors commented Nov 19, 2015

Byron commented Dec 7, 2015