foi-tracker-scraper

This project is designed to scrape PDF files from oic website and download them to your local machine.

Prerequisites

python -m venv env

pip install -r requirements.txt

This command will run the spider defined in scraper.py and save the output to pdf_files.json.

scrapy runspider -O pdf_files.json scraper.py

After running the spider, you can download the PDFs using the download-pdf.py script.

python download-pdf.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.vscode		.vscode
pdfs		pdfs
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
download-pdf.py		download-pdf.py
pdf_files.json		pdf_files.json
requirements.txt		requirements.txt
scraper.py		scraper.py