Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No mechanism for Orphaned Editions to be removed from Solr once resolved #893

Closed
hornc opened this issue Mar 19, 2018 · 3 comments
Closed
Assignees
Labels
Module: Merging Record merging Module: Solr Issues related to the configuration or use of the Solr subsystem. [managed]

Comments

@hornc
Copy link
Collaborator

hornc commented Mar 19, 2018

Orphaned editions (ones without Works) are currently indexed in Solr as /works/<edition_ID> e.g /works/OL20454633M

This is a workaround to allow them to appear in search results

# When an edition does not contain a works list, create a fake work and index it.

When a work is added by a move or other edit, there is nothing that notices the change and removes the old /works/<edition_ID> from the index, so the fake work will continue to appear in search results.

/works/<edition_ID> needs a Delete update sent to Solr.

Currently the only way to remove them is to manually add the /works/<edition_ID> keys to admin/solr interface.

Suggestion:
When the solr updater script sees an edition edit in the Infogami logs, in the changeset / changes fields, it should send a delete request to Solr for the /works/<edition_ID> This will produce some delete requests for non-existent keys, but this does not cause a problem for solr. The alternative would be to query solr to find whether a '/works/` key exists for that edition, and delete if so. My thinking is that sending a delete anyway will put less load on solr but have the same end result.

@hornc hornc added Module: Solr Issues related to the configuration or use of the Solr subsystem. [managed] Module: Merging Record merging labels Mar 19, 2018
@hornc hornc changed the title No mechanism whereby Orphaned Editions are removed form Solr once resolved No mechanism for Orphaned Editions to be removed form Solr once resolved Mar 19, 2018
@hornc hornc changed the title No mechanism for Orphaned Editions to be removed form Solr once resolved No mechanism for Orphaned Editions to be removed from Solr once resolved Mar 19, 2018
@hornc hornc self-assigned this Apr 3, 2018
@hornc hornc added the blocker label Apr 3, 2018
@hornc
Copy link
Collaborator Author

hornc commented Apr 3, 2018

Added the "blocker" tag because bulk or manual resolving orphaned editions is not much use if search results and author work lists are going to show them indefinitely.

@LeadSongDog
Copy link

LeadSongDog commented Apr 3, 2018

These fake works are particularly egregious in that an edition without all three of an author, a publisher, and a publication place seems to create a new fake work even in the attempt to link that edition to an existing known work.

@hornc
Copy link
Collaborator Author

hornc commented Oct 23, 2018

Resolved in cd67d7b

@hornc hornc closed this as completed Oct 23, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Module: Merging Record merging Module: Solr Issues related to the configuration or use of the Solr subsystem. [managed]
Projects
None yet
Development

No branches or pull requests

2 participants