Skip to content

wevisdemo/foi-tracker-scraper

Repository files navigation

foi-tracker-scraper

This project is designed to scrape PDF files from oic website and download them to your local machine.

Prerequisites

  • Python 3.x
  • pip (Python package installer)

Setup

  1. Create a Python virtual environment
python -m venv env
  1. Activate the virtual environment

    • On Windows:
    .\env\Scripts\activate
    • On macOS and Linux:
    source env/bin/activate
  2. Install the required dependencies

pip install -r requirements.txt

Running the Scraper

  1. Run the spider crawler

This command will run the spider defined in scraper.py and save the output to pdf_files.json.

scrapy runspider -O pdf_files.json scraper.py
  1. Download the PDFs

After running the spider, you can download the PDFs using the download-pdf.py script.

python download-pdf.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages