Proposal: Lazy statepoint loading #238

bdice · 2019-10-31T22:54:08Z

Feature description

I am experimenting with refactoring statepoint loading to be lazy. Currently, project.open_job() is called on every job during a flow run command, which causes the statepoints to be loaded. However, the statepoint information isn't even used in many cases. This may be a case where signac is executing unnecessary I/O (which is costly for large projects like the ones I'm working with).

Proposed solution

The property job.statepoint currently just returns job._statepoint, meaning that the internal state is effectively identical to the publicly-accessible state. I am investigating making the property job.statepoint load lazily, with job._statepoint = None until the statepoint is requested.

I see a HUGE performance boost (from 30 seconds to <0.5 seconds for a simple operation on 3000 jobs) since the I/O is dropped dramatically, but some tests fail because job statepoint corruption (which raises a JobsCorruptedError) is not detected at the time of project.open_job(). I think that lazy loading is generally safe except in cases where the statepoint hash and job id don't match (corrupted jobs).

I just wanted to open this for discussion - I am not sure where we stand on approaches to handling corrupted data. Specifically, I want to know whether we think that validating hash(statepoint) == id (and thus the I/O cost of a statepoint load) is always necessary when opening a job by id.

Additional context

Maybe investigate async as an alternative...? Still expensive but maybe less so.

The text was updated successfully, but these errors were encountered:

bdice · 2020-01-26T18:31:35Z

We discussed this offline (I can't remember who was a part of the discussion). Our conclusion was that loading a job by id should not require validating that hash(statepoint) == id. A user (or I/O error) may corrupt the data space by breaking that invariant, and the user will still be allowed to open the job by id. In the proposed lazy-load of statepoints in #239, job corruption will be checked on accessing job.statepoint. (Note that this may not reflect the current design in that PR.) @glotzerlab/signac-committers Was this a conclusion you recall and/or agree with?

csadorf · 2020-01-27T09:57:24Z

I am ok with lazy loading and only validating the state point metadata when it is accessed.

b-butler · 2020-01-27T15:29:24Z

@bdice I don't remember being a part of the conversation, but I agree with the approach.

vyasr · 2020-01-27T18:21:54Z

@bdice and I did discuss and agree on this approach. I need to post my pseudo-prototype for #249 along with a class diagram so that we can progress on that front. IMO that serves as the cleanest path forward for implementing lazy loading by isolating the logic for synchronization and ensuring that a clear set of invariants are well-defined and well-tested for the different possible use-cases of nesting and buffering.

bdice · 2020-01-27T18:32:28Z

@vyasr I think (but am not sure) that my intended implementation of lazy loading can be clearly separated from your work on #249. I may need to work on #239 a little more to be sure, but I believe my implementation will rely more heavily on the Project class's caching and opening of jobs than the specifics of how job state points / job documents are synchronized. You should feel free to go ahead and post your "pseudo-prototype" (😉) in the meantime.

bdice mentioned this issue Oct 31, 2019

Lazy statepoint loading #239

Merged

12 tasks

mikemhenry added the enhancement New feature or request label Nov 18, 2019

vyasr mentioned this issue Nov 26, 2019

Proposal: Unify dict classes and improve buffering and synchronization #249

Closed

bdice linked a pull request Feb 27, 2020 that will close this issue

Lazy statepoint loading #239

Merged

12 tasks

bdice added proposal GSoC Google Summer of Code labels Feb 27, 2020

bdice changed the title ~~Lazy statepoint loading / corruption checks~~ Proposal: Lazy statepoint loading Feb 27, 2020

bdice added this to the v2.0.0 milestone Mar 16, 2020

bdice closed this as completed in #239 Dec 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Lazy statepoint loading #238

Proposal: Lazy statepoint loading #238

bdice commented Oct 31, 2019 •

edited

Loading

bdice commented Jan 26, 2020 •

edited

Loading

csadorf commented Jan 27, 2020

b-butler commented Jan 27, 2020

vyasr commented Jan 27, 2020

bdice commented Jan 27, 2020

Proposal: Lazy statepoint loading #238

Proposal: Lazy statepoint loading #238

Comments

bdice commented Oct 31, 2019 • edited Loading

Feature description

Proposed solution

Additional context

bdice commented Jan 26, 2020 • edited Loading

csadorf commented Jan 27, 2020

b-butler commented Jan 27, 2020

vyasr commented Jan 27, 2020

bdice commented Jan 27, 2020

bdice commented Oct 31, 2019 •

edited

Loading

bdice commented Jan 26, 2020 •

edited

Loading