Make write safer #1416

JakeSiFive · 2023-09-12T18:57:42Z

Recently we had a very small mistake (swapping the arguments of write) cause total workspace data loss. This is unacceptable for wake. This change makes write safer via the following measures:

write will no longer allow "." or "" as a write location
write will only unlink files, and not do a deep unlink. This means that two different build options might conflict in wake but I think that's bad style anyway
write will not write outside of the root workspace. This will break some rare use cases but they can be replaced with a bespoke job instead
write will not overwrite a source file

It would be nice to add an additional check that write does not overwrite anything previously hashed but this is a bit tricky to do and thanks to the fact that we don't deep unlink anything now, I think this should be fine for now as data loss is quite limited and it would be unusual for the content to 1) be a valid file path and 2) point to something the user intended to keep.

ag-eitilt · 2023-09-12T19:25:41Z

share/wake/lib/system/io.wake

+    require False = matches `\.\..*` path
+    else failWithError "Attempt to write outside of the workspace"


This probably needs to be run through a relativizer of some sort (I don't think it is yet, but I might have missed something) to catch subdir/../... It's also filtering out things like ..inRoot which are most definitely bad names but which are still in the workspace.

All paths leading into this function simplify the path before passing it to the implementation

Great, thanks!

ag-eitilt · 2023-09-12T19:27:46Z

share/wake/lib/system/job.wake

+            require Pair (Pass inFile) _ =
                write specFilePath (prettyJSON json)
                | rmap getPathName
-            else Pair (Fail (makeError "Failed to 'write {specFilePath}'.")) ""
+                | addErrorContext "Failed to 'write {specFilePath}: '"
+                | (Pair _ "")


What's the Pair for? It seems to be unconditionally added with no meaningful data.

The return type of this function is Pair (Result ...) String because its a runner pre function. You can see that previously the same Pair was being constructed in the else branch. I just made the error message nicer while working around the return type being a Pair

Ah, now that I'm seeing the mixed lines, it looks like something to fit the else case.

V-FEXrt · 2023-09-12T19:58:56Z

share/wake/lib/system/io.wake

+    else failWithError "Attempt to write to an absolute path"
+
+    # Source files should never be deleted so we check for this case
+    def scan dir regexp = prim "sources"


Calling prim "sources" for every call to write is expensive no? Do we cache the result somewhere?

Wake reads the full list at the start and then does a linear regex scan through it. The linear scan could be improved to be a Tree in this case since we don't need a regex but this is the same cost we incur when we source a file today so I think its fine? I'll test it on a larger build to see what the effect is.

mmjconolly · 2023-09-14T17:39:29Z

We should try to get this into a wake release

1. write will no longer allow "." or "" as a write location 2. write will only unlink files, and not do a deep unlink. This means that two different build options might conflict in wake but I think that's bad style anyway 3. write will not write outside of the root workspace. This will break some rare use cases but they can be replaced with a bespoke job instead 4. write will not overwrite a source file

ag-eitilt · 2023-09-19T18:43:02Z

src/runtime/string.cpp

+      error +=
+          " is a directory and cannot be overwritten. If this is intentional please manually "
+          "delete this directory";
+      size_t len = std::min(error.size(), max_error);
+      String *out = String::claim(runtime.heap, error.c_str(), len);


This is running into max_error problems, and really doesn't look good when it does: Fail (Error "src is a directory and cannot be overwritten. If this is intentional please manually delete this direct" Nil)

oh crap, thanks for finding and debugging that! We should increase max error.

I think there's always going to be some problem if we simply bump the value. Using an unrealistically low max_error to stand in for a longer path than I want to type: Fail (Error "very/long/path/to/some/fi" Nil). We need to trim the filepath (with some indication that it's been trimmed) without risking the error message itself.

I probably won't be prioritizing that level of quality fix but I'd love to see that sort of thing go in. We can add in the the size of the path instead I think. Paths on Linux are not allowed to be any longer than 4096 so we can set it to something like 100 + std::max(path.size(), 4096).

JakeSiFive added 2 commits September 11, 2023 13:06

stashing

4ad70ea

Make write safer

baea979

JakeSiFive requested review from mmjconolly, V-FEXrt and ag-eitilt September 12, 2023 18:57

JakeSiFive added 2 commits September 12, 2023 11:58

Whoops, got my eviction code mixed up in this

602b214

whoops, missed the Cargo file as well

a754059

ag-eitilt reviewed Sep 12, 2023

View reviewed changes

ag-eitilt approved these changes Sep 12, 2023

View reviewed changes

V-FEXrt reviewed Sep 12, 2023

View reviewed changes

V-FEXrt approved these changes Sep 12, 2023

View reviewed changes

JakeSiFive merged commit 117fb96 into master Sep 13, 2023
12 checks passed

JakeSiFive deleted the make_write_safer branch September 13, 2023 19:45

ag-eitilt reviewed Sep 19, 2023

View reviewed changes

ag-eitilt mentioned this pull request Sep 20, 2023

Refactor installAs for out-of-workspace copies #1422

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make write safer #1416

Make write safer #1416

JakeSiFive commented Sep 12, 2023

ag-eitilt Sep 12, 2023

JakeSiFive Sep 12, 2023

ag-eitilt Sep 12, 2023

ag-eitilt Sep 12, 2023

JakeSiFive Sep 12, 2023

ag-eitilt Sep 12, 2023

V-FEXrt Sep 12, 2023

JakeSiFive Sep 12, 2023

mmjconolly commented Sep 14, 2023

ag-eitilt Sep 19, 2023

JakeSiFive Sep 19, 2023

ag-eitilt Sep 19, 2023

JakeSiFive Sep 19, 2023

		require False = matches `\.\..*` path
		else failWithError "Attempt to write outside of the workspace"

Make write safer #1416

Make write safer #1416

Conversation

JakeSiFive commented Sep 12, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mmjconolly commented Sep 14, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment