yandex-q

Scripts that were used to scrape and process data from Yandex.Q. The resulting dataset can be found here.
Some scripts are messy, but they get the job done.

Scripts used

parse_questions_search.py - to parse questions by searching all 4 letter combinations, because of the 1000 items limit per search
parse_question_ids.py - to parse question ids by using question recommendation endpoint
get_ids.py - to extract ids from questions that were retrieved from search
parse_qa.py - to parse all question info from ids collected

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

yandex-q

Scripts used

Files

README.md

Latest commit

History

README.md

File metadata and controls

yandex-q

Scripts used