This project is now complete, and we are working on sorting and finalizing the data. Thank you to everyone who contributed!
Worker for the Save Community Captions
project: Archiving unpublished YouTube community-contributions.
Lost? Click here to learn what this is all about!
See how much has been archived so far.
- https://atdash.meo.ws/d/attv2/archive-team-tracker-charts-v2?orgId=1&var-project=ext-yt-communitycontribs
- https://tracker.archiveteam.org/ext-yt-communitycontribs/
To run these tools you will need to supply session cookies (SSID,HSID,SID) see the tutorial for more details.
A wrapper repo for free and easy deployment and environment configuration, as well automatic updates every 24-27.6 hours is available. Deploy up to 5 instances of it to a free Heroku account (total max monthly runtime 550 hours) with no need for credit card verification by clicking the button below.
After completing the above setup steps, simply run
python3 worker.py
Stable Docker Image:
docker pull fusl/ytcc-archive
Run:
docker container run --restart=unless-stopped --network=host -d --tmpfs /grab/out --name=grab_ext-yt-communitycontribs -e HSID=XXX-e SID=XXX -e SSID=XXX -e TRACKER_USERNAME=Fusl -e PYTHONUNBUFFERED=1 fusl/ytcc-archive
This feature requires an older version of export.py
. Get this file, Python 3, and the requests
module (pip install requests
). Then, simply run python3 exporter.py
followed by a list of space-separated YouTube video IDs, and all community-contributed captioning and titles/descriptions in all languages will be exported.
Simply run python3 discovery.py
followed by a list of space-separated YouTube video IDs and a list of discovered video, channel and playlist IDs will be printed, as well as whether caption contributions are enabled.