You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
The beginner that i am need your help, i use tika server to extract meta and text with ocr strategy auto on native pdf documents no problem as thé process Time is low but on scanned pdf files (hundreds pages) i hit the timeout of thé request throught python or curl.
Is their a way to config tika-config.yml file to make the thé ocr process all the pages with strategy auto.
Thks in advance.
The text was updated successfully, but these errors were encountered:
Hello,
The beginner that i am need your help, i use tika server to extract meta and text with ocr strategy auto on native pdf documents no problem as thé process Time is low but on scanned pdf files (hundreds pages) i hit the timeout of thé request throught python or curl.
Is their a way to config tika-config.yml file to make the thé ocr process all the pages with strategy auto.
Thks in advance.
The text was updated successfully, but these errors were encountered: