RFC: Avoid some tracebacks during populate #785

dwlehman · 2019-07-08T19:25:31Z

Having fixed big issues with active LVM on partial VGs and with unusable dmraid arrays, the most-reported failure during blivet populate involves partitioned disks that have been cloned by users without adjusting the disklabel UUIDs to make them unique. In the past I have generally favored a "you must fix this problem before proceeding" approach, but now I'm thinking more about being robust and not crashing unless it's really necessary.

I'm mostly curious what people think of "Allow duplicate UUIDs...", "Unset partition UUIDs...", and "General protection against...". The first two of these are tested, both via unit tests and a "real" test using examples/list_devices.py in a vm. The third ("General protection...") probably needs more work.

vojtechtrefny

Generally looks like a good improvement to me. Definitely needs some testing probably also from Anaconda.

vojtechtrefny · 2019-07-09T10:56:03Z

blivet/devicetree.py

-            result = six.next((d for d in devices if d.uuid == uuid or d.format.uuid == uuid), None)
+            matches = iter(d for d in devices if d.uuid == uuid or d.format.uuid == uuid)
+            result = six.next(matches, None)
+            extra = six.next(matches, None)


I think a simple list comprehension and checking if len(matches) == 1 would be better here. It would also allow printing more than two duplicate devices.

Agreed. Done.

vojtechtrefny · 2019-07-09T10:57:29Z

blivet/devicetree.py

@@ -160,6 +160,12 @@ def _add_device(self, newdev, new=True):
            dev = self.get_device_by_uuid(newdev.uuid, incomplete=True, hidden=True)


Git comment in the commit message.

vojtechtrefny · 2019-07-09T12:12:51Z

blivet/devicetree.py

+                # pretend it has no UUID. The only negative effect is on event handling,
+                # which is disabled by default.
+                newdev.uuid = None
+                dev.uuid = None


I don't like the idea of "removing" the UUIDs. We already have two ways how to make the reset "stable" in this situation -- special exception that can be caught and used to ignore one of the disks and the flag. And now we are adding a special case for partitions that just "silently" removes the UUID?. I think the flag should be enough and Anaconda should set it by default.

I agree that seems like a sketchy thing to do. I dropped that commit.

There is a new flag to control this behavior: allow_inconsistent_config. When False, DuplicateUUIDError will be raised when trying to add any subsequent device whose UUID matches one already in the tree. When True, the duplicate UUIDs will be allowed to coexist until/unless a call to DeviceTree.get_device_by_uuid is made with the duplicate UUID as its argument. In this case, DuplicateUUIDError will be raised.

The goal here is to absolutely minimize tracebacks that occur when populating the devicetree. There are mechanisms in place that will limit the supported functionality for managing such devices, but there will be no tracebacks until/unless the user tries to manage them in ways that are precluded by the detection issues previously encountered.

vpodzime

Looks good to me.

vpodzime · 2020-03-08T21:25:23Z

blivet/populator/populator.py

-            device = helper_class(self, info).run()
+            try:
+                device = helper_class(self, info).run()
+            except DeviceTreeError as e:


What could be really important here is that every run() method should make sure to gather as much information as possible before throwing the exception. Otherwise the information about the problematic device may be very incomplete.

vojtechtrefny · 2024-10-18T14:21:53Z

Obsoleted by #1306

vojtechtrefny reviewed Jul 9, 2019

View reviewed changes

dwlehman force-pushed the robust-populate branch 3 times, most recently from 473229f to 6583721 Compare July 10, 2019 16:39

dwlehman added 5 commits July 31, 2019 17:51

Fix name resolution for md member partitions. (#1699173)

b4e9cee

Base UnusableConfigurationError on DeviceTreeError.

9a8a8ad

Remove logging of expected/unremarkable tracebacks.

af9cd39

dwlehman force-pushed the robust-populate branch from 6583721 to de62be7 Compare July 31, 2019 21:52

vojtechtrefny mentioned this pull request Mar 5, 2020

Fix name resolution for md member partitions. (#1798792) #827

Merged

vpodzime approved these changes Mar 8, 2020

View reviewed changes

vojtechtrefny mentioned this pull request Jan 6, 2021

unable to get udev info for sdd #919

Open

vojtechtrefny closed this Oct 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Avoid some tracebacks during populate #785

RFC: Avoid some tracebacks during populate #785

dwlehman commented Jul 8, 2019 •

edited

Loading

vojtechtrefny left a comment

vojtechtrefny Jul 9, 2019

dwlehman Jul 9, 2019

vojtechtrefny Jul 9, 2019

vojtechtrefny Jul 9, 2019

dwlehman Jul 9, 2019

vpodzime left a comment

vpodzime Mar 8, 2020

vojtechtrefny commented Oct 18, 2024

		@@ -160,6 +160,12 @@ def _add_device(self, newdev, new=True):
		dev = self.get_device_by_uuid(newdev.uuid, incomplete=True, hidden=True)

RFC: Avoid some tracebacks during populate #785

RFC: Avoid some tracebacks during populate #785

Conversation

dwlehman commented Jul 8, 2019 • edited Loading

vojtechtrefny left a comment

Choose a reason for hiding this comment

vojtechtrefny Jul 9, 2019

Choose a reason for hiding this comment

dwlehman Jul 9, 2019

Choose a reason for hiding this comment

vojtechtrefny Jul 9, 2019

Choose a reason for hiding this comment

vojtechtrefny Jul 9, 2019

Choose a reason for hiding this comment

dwlehman Jul 9, 2019

Choose a reason for hiding this comment

vpodzime left a comment

Choose a reason for hiding this comment

vpodzime Mar 8, 2020

Choose a reason for hiding this comment

vojtechtrefny commented Oct 18, 2024

dwlehman commented Jul 8, 2019 •

edited

Loading