- Download the access logs and parse them to retrieve the actual access counts for the files being hosted on S3
- Generate a CSV with fields: file, date, downloads
$ virtualenv pyenv --no-site-packages
$ source pyenv/bin/activate
$ pip install -r requirements.txt
$ python sync.py [the flags]
$ get a coffee (if initial_import)
$ python parse.py --folder [the folder name where the log files got downloaded to]