Skip to content

Generate daily download stats for files hosted on S3 from the logs

License

Notifications You must be signed in to change notification settings

ogdch/s3-logs-analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

S3-Logs-Analyzer

  • Download the access logs and parse them to retrieve the actual access counts for the files being hosted on S3
  • Generate a CSV with fields: file, date, downloads

Setup

$ virtualenv pyenv --no-site-packages
$ source pyenv/bin/activate
$ pip install -r requirements.txt

Download S3 log files (only downloads non existing ones)

$ python sync.py [the flags]
$ get a coffee (if initial_import)

Parse

$ python parse.py --folder [the folder name where the log files got downloaded to]

About

Generate daily download stats for files hosted on S3 from the logs

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages