This is an OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
See the wiki for more information on installing and running ImageCat:
You can clone the wiki by running
git clone https://github.com/chrismattmann/imagecat.wiki.git
Send them to Chris A. Mattmann.