enfsolar.com Scraper

Given a list of URLs, as a .csv file, the program extracts company contact information from each individual URL.

Installation

pip install -r requirements.txt

or

peotry install

Using with selenium

requires having hombrew installed, how to install: https://brew.sh

homebrew is a macOS package manager

then

brew install cask

Installation

brew cask install firefox, or just download it https://www.mozilla.org/en-US/firefox/new/
brew install geckodriver, install the geckodriver for selenium

Running the program

Scrape & download:

python main.py --scrape --csv-file /path/to/csv/file
Parse & write report:

python main.py --extract

Params

--scrape, download HTML content of each URL

--headless, do no show the selenium browser window, when scraping

--extract, extracts contact information from each file

--csv-file, path to the input CSV file

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

enfsolar.com Scraper

Installation

Using with selenium

Installation

Running the program

Params

About

Releases

Packages

Languages

License

xbogdan/enfsolar.com-scraper

Folders and files

Latest commit

History

Repository files navigation

enfsolar.com Scraper

Installation

Using with selenium

Installation

Running the program

Params

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages