Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TED recipe by topic fails with HTTP read timed out in extract_info_from_video_page #162

Closed
benoit74 opened this issue Feb 29, 2024 · 6 comments · Fixed by #198
Closed

TED recipe by topic fails with HTTP read timed out in extract_info_from_video_page #162

benoit74 opened this issue Feb 29, 2024 · 6 comments · Fixed by #198
Assignees
Milestone

Comments

@benoit74
Copy link
Collaborator

While many TED recipe by topic are working, some are systematically failing (at least twice in a row) with an HTTP read timeout in the extract_info_from_video_page operations:

Looks like a given recipe is not failing always on the same video (but is it because video order is random?)

@benoit74 benoit74 added this to the 2.1.1 milestone Feb 29, 2024
@benoit74
Copy link
Collaborator Author

benoit74 commented Mar 1, 2024

@benoit74
Copy link
Collaborator Author

benoit74 commented Mar 8, 2024

Not reproduced locally.

A common point on all these tasks is that they all ran on pixelmemory, except https://farm.openzim.org/pipeline/4058f2dc-e460-498a-991f-e72c5a12c680 which ran on athena18. Maybe just a coincidence, but I will request them again on another worker and we will see.

@benoit74
Copy link
Collaborator Author

benoit74 commented Mar 8, 2024

Task https://farm.openzim.org/pipeline/ce6d0579-a246-4098-ac78-d35d78cec00e/debug is already past the point where it previously failed. This hence looks like an issue specific to some workers? I will confirm by starting again a small recipe on pixelmemory

@benoit74
Copy link
Collaborator Author

Seems to be transient issues because https://farm.openzim.org/pipeline/70613e69-5e39-462e-8c3a-1a2354ddbfba from https://farm.openzim.org/recipes/ted_topic_history now succeeded on pixelmemory.

Closing this issue for "not reproduced" for now. I've added a section to the FAQ.

@benoit74 benoit74 closed this as not planned Won't fix, can't repro, duplicate, stale Mar 11, 2024
@benoit74 benoit74 modified the milestones: 2.1.1, 3.0.0 Mar 19, 2024
@benoit74
Copy link
Collaborator Author

Issue is still happening, always on pixelmemory worker, but mostly randomly.

We need to look into how to make this more robust / identify the underlying root cause which happens only on pixelmemory worker.

@benoit74 benoit74 reopened this Apr 28, 2024
@benoit74 benoit74 modified the milestones: 3.0.0, 3.1.0 Apr 28, 2024
@benoit74 benoit74 self-assigned this May 14, 2024
@benoit74 benoit74 modified the milestones: 3.1.0, 3.0.1 May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant