Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support batch archiving #7493

Closed
qqmyers opened this issue Dec 23, 2020 · 0 comments · Fixed by #8610
Closed

Support batch archiving #7493

qqmyers opened this issue Dec 23, 2020 · 0 comments · Fixed by #8610
Assignees
Labels
TDL of interest to the Texas Digital Library

Comments

@qqmyers
Copy link
Member

qqmyers commented Dec 23, 2020

Currently, the only ways to trigger archiving leveraging the OA-ORE and Bag outputs is to configure a post-publication workflow or to use the API call that can archive a single specified datset version. To support batch archiving, e.g. of all dataset versions not yet archived (e.g. one published since the last archiving run), an API to find/archive all new dataset versions would be useful.

TDL has created such an admin API call that will query to find unarchived dataset versions and, optionally to start a thread to archive them. The thread is asynchronous, creates Bags sequentially (to avoid having multiple versions taking processing power, memory and temporary disk space), and logs successes and failures. The API has three query parameters that can be used in combination:

  • listonly - true: retrieves the list of unarchived versions but does not attempt to archive any
  • latestonly - true: only lists/processes the most recently published version of a given dataset
  • limit - maximum number of versions to attempt to archive

PR to follow.

@qqmyers qqmyers self-assigned this Apr 12, 2022
@qqmyers qqmyers added the TDL of interest to the Texas Digital Library label Apr 12, 2022
pdurbin added a commit that referenced this issue Apr 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
TDL of interest to the Texas Digital Library
Projects
None yet
1 participant