fs/ufs: change default locking protocol - v4.1.x #12760

edgargabriel · 2024-08-14T20:17:27Z

The fs/ufs component by default disabled all file locking before read/write operations (except for NFS file systems). This was based on the assumption, that the OS itself performs the required locking operation and hence we don't have to add to it.

This assumption is incorrect when using data sieving. In data sieving, the code 'ignore' small gaps when we write to a file, and perform instead a read-modify-write sequence ourselves for performance reasons. The problem is however that even within a collective operation not all aggregators might want to use data sieving. Hence, enabling locking just for the data-sieving routines is insufficient, all processes have to perform the locking. Therefore, our two options are: a) either disable write data-sieving by default, or b) enable range-locking by default.

After some testing, I think enabling range-locking be default is the safer and better approach. It doesn't seem to show any significant performance impact on my test systems.

Note, that on Lustre file systems, we can keep the default to no-locking as far as I can see, since the collective algorithm used by Lustre is unlikely to produce this pattern. I did add in however an mca parameter that allows us to control the locking algorithm used by the Lustre component as well, in case we need to change that for a particular use-case or platform.

Fixes Issue #12718

Signed-off-by: Edgar Gabriel [email protected]
(cherry picked from commit c697f28)

The fs/ufs component by default disabled all file locking before read/write operations (except for NFS file systems). This was based on the assumption, that the file system itself performs the required locking operation and hence we don't have to add to it. This assumption is incorrect when using data sieving. In data sieving, the code 'ignore' small gaps when we write to a file, and perform instead a read-modify-write sequence ourselves for performance reasons. The problem is however that even within a collective operation not all aggregators might want to use data sieving. Hence, enabling locking just for the data-sieving routines is insufficient, all processes have to perform the locking. Therefore, our two options are: a) either disable write data-sieving by default, or b) enable range-locking by default. After some testing, I think enabling range-locking be default is the safer and better approach. It doesn't seem to show any significant performance impact on my test systems. Note, that on Lustre file systems, we can keep the default to no-locking as far as I can see, since the collective algorithm used by Lustre is unlikely to produce this pattern. I did add in however an mca parameter that allows us to control the locking algorithm used by the Lustre component as well, in case we need to change that for a particular use-case or platform. Fixes Issue open-mpi#12718 Signed-off-by: Edgar Gabriel <[email protected]> (cherry picked from commit c697f28)

jsquyres · 2024-09-24T19:21:43Z

ompi/mca/fs/lustre/fs_lustre_component.c

+    (void) mca_base_component_var_register(&mca_fs_lustre_component.fsm_version,
+                                           "lock_algorithm", "Locking algorithm used by the fs ufs component. "
+                                           " 0: auto (default), 1: skip locking, 2: always lock entire file, "
+                                           "3: lock only specific ranges",


If this MCA parameter only accepts specific values, is there any reason not to use the enum MCA type? (such that the MCA base will take care of ensuring that only allowable values are set)

@jsquyres you are technically correct that we could enum MCA types, this is more a feature of what I am used to doing vs. what is possible. That being said, the same pr has already been merged to 5.0.x as well, do we really want to go back and change that as well? We could still fix it in main branch for future releases.

edgargabriel requested a review from lrbison August 14, 2024 20:17

github-actions bot added this to the v4.1.7 milestone Aug 14, 2024

github-actions bot added the Target: v4.1.x label Aug 14, 2024

edgargabriel changed the title ~~fs/ufs: change default locking protocol~~ fs/ufs: change default locking protocol - v4.1.x Aug 14, 2024

lrbison approved these changes Aug 14, 2024

View reviewed changes

jsquyres reviewed Sep 24, 2024

View reviewed changes

bwbarrett merged commit 1d02355 into open-mpi:v4.1.x Sep 30, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fs/ufs: change default locking protocol - v4.1.x #12760

fs/ufs: change default locking protocol - v4.1.x #12760

edgargabriel commented Aug 14, 2024

jsquyres Sep 24, 2024

edgargabriel Sep 25, 2024

fs/ufs: change default locking protocol - v4.1.x #12760

fs/ufs: change default locking protocol - v4.1.x #12760

Conversation

edgargabriel commented Aug 14, 2024

jsquyres Sep 24, 2024

Choose a reason for hiding this comment

edgargabriel Sep 25, 2024

Choose a reason for hiding this comment