Concurrent web scraper with primitive cache

How to run

A prerequisite is to have Go 1.21+ installed
In root repo dir, run:

go run .

Or, using docker, run:

docker build -t scraper .
docker run -d -p 1337:1337 scraper

Above commands will build and run the container, binding your host's port 1337 to the same port on the container.

Sites sourced from urls in main.go will get scraped, with the retrieved words displayed on localhost:1337/metrics

As of writing this README, the scraper runs an unholy amount of Goroutines.
The scraper does not follow redirects, only finds href elements,
It does not use ETag and If-None-Match headers, nor does it use If-Modified-Since (the cache isn't persistent and there is no way to invalidate it)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.idea		.idea
cache		cache
cache_test		cache_test
data		data
scraper		scraper
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
main.go		main.go
metrics.go		metrics.go