Update scraping to support checking high frequency articles before completing entire job #12

carlgieringer · 2017-11-15T21:57:24Z

The current run_continuously.py may run for two hours to scrape all files, which is well over the expectation of scraping new articles every 5 minutes. This would probably best be handled by some distributed or at least multi-thread/process solution.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update scraping to support checking high frequency articles before completing entire job #12

Update scraping to support checking high frequency articles before completing entire job #12

carlgieringer commented Nov 15, 2017

Update scraping to support checking high frequency articles before completing entire job #12

Update scraping to support checking high frequency articles before completing entire job #12

Comments

carlgieringer commented Nov 15, 2017