This repository hosts a small subset of the Microsoft Knowledgebase metadata. The data in the data.json
contains the following:
- Date of the KB publication
- KB UUID
- KB Slug
- KB URL
The list of KB IDs in the database is scraped from the URLs in discovery.txt
. The primary usecase of the dataset is to provide a KB:DATE
mapping to other projects.
Discovery notes. Need to check the sitemaps more thoroughly before i automate this
See https://learn.microsoft.com/_sitemaps/sitemapindex.xml
curl --silent https://learn.microsoft.com/_sitemaps/officeupdates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
curl --silent https://learn.microsoft.com/_sitemaps/security-updates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
Data and code in this repository is licensed under Creative Commons Zero v1.0 Universal.