I am currently working on a project that involves using the Tess4j Tesseract OCR engine. While working on this project I come along a lot of websites that state that Tesseract works best on images of at least 300 DPI (Dots per Inch).
My question is why is DPI mentioned so many times for images. I understand that when you scan an object that you want to scan it with at least 300 DPI. I just cannot figure out why this is relevant for pictures taken with a camera. DPI is as far as I know a property for the printer. Based on this property the higher it is the smaller the image but with greater quality.
Now if DPI has nothing to do with these images than I am wondering why the results on my program differs when I change the DPI property of images between 72 & 300. Is there a pre-process of Tesseract that I am unaware of?