A chrome extension save webpage snapshot as html 一个将网页渲染结果保存为 HTML 代码的 chrome 插件
When you need a large number of webpages to do some AI thing, it must help a lot!
- inline all same origin external styles
- grab the rendered webpage code
- remove all scripts
- remove all iframe srcs
- normalize all urls