-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] Force-delete for bad builds #482
Comments
I've experienced something similar. Builds will get stuck for days and any new builds will be listed as "Queued" while they are stuck. Has there been any progress on a fix for this? An example build that has been getting stuck regularly is attached. We are using conda-store v0.4.14 |
Hi @wroddenMSS while I don't have a long-term fix, I can share a possible workaround. My hypothesis is that the database is in a bad-state somehow and we need to manually delete the tainted records; to do this, you will need access to the underlying postgresql database. On you local machine that means you can connect to it via In either case, next you will need the database credentials to login. Locally this is a config.json/config.yaml or similar. On Nebari you can get it these secrets with this command:
From there you can
Once connected to the database, you can try and delete the build that is causing trouble; here's more detail on the conda-store database. |
So a few things having builds lingering in the BUILDING state doesn't mean that the build is still building. It is due to the worker being interupted and not updating the database when the build fails. I've merged a PR which is in main to mark these as FAILED. #530 which is the proper fix for this. Once it is in the FAILED state it is possible to delete that build and a force delete is not necessary. |
With the upcoming release of conda-store the following issues should help with this issue: |
Occasionally builds wind up in a bad state and need to be deleted. At the moment, there is no convenient way of doing this other than deleting the build from the DB directly (AFAIK).
As an example, build 155 has been running for a few weeks now:
The text was updated successfully, but these errors were encountered: