i need to integrate the tesseract-ocr which converts scanned image as in pdf to text.
there is tesseractOCRParser already available.
bu there is no invoke method given.
when am trying to build tika with tesseract-ocr referral path am getting this following error
Results :
Failed tests: testNoConfig(org.apache.tika.parser.ocr.TesseractOCRConfigTest): Invalid default tesseractPath value expected:<[]> but was:<[/home/serendio/tesseract-ocr/]>
Tests run: 569, Failures: 1, Errors: 0, Skipped: 7
can anyone help me out ???
or any other-way to resolve this problem??