Binarisation of image for OCR

Question

I want to turn scanned images into black and white images, the goal is to reduce the file size before the images are transferred over the internet for OCR.

The normal binarisation/ black and white images created by scanners/ general image editing software produces undesirable results.

Lots of random black pixels are left behind which are really just noise from binarisation, this causes the OCR to try and recognise characters where there are none, or insert full stops, colons etc after characters.

What can I use in OpenCV to binarise an image, keep lines, characters & dark areas solid, and, reduce pixel noise in white areas?

I've toyed with cvThreshold and cvAdaptiveThreshold but results have not been great yet.

As an example, check out this original image and the desired result.

Your example appears to be trinary, I see at least one shade of gray in addition to the black and white. — Mark Ransom
@MarkRansom When I went back and looked at the images in IrfanView, I thought you were right and I must have saved the B&W image wrong. However, when looking at the images in Gimp, the pixels are just B&W. What are you using to view the image? In my case I trust gimp over IrfanView. — Michael
I was looking at it in Chrome. Today in Firefox it looks OK, don't know what happened. — Mark Ransom

praks411 praks411 · Accepted Answer · 2013-05-15T23:09:59

You can try this however you still need to adjust some parameters.

#define ALPHA_SCALE 2
#define THRESHOLD_VAL 40
#define MAX_VAL_FOR_THRESHOLD 250
#define PIXEL_MISMATCH_COUNT 10 //9, 7
Mat current_frame_t2;        

     IplImage *img = cvLoadImage("Original.tiff", CV_LOAD_IMAGE_UNCHANGED );
     cvNamedWindow("My_Win", CV_WINDOW_AUTOSIZE);
    // namedWindow("My_Win", 1);
     cvShowImage("My_Win", img);
      cvWaitKey(10);
     Mat current_frame_t1(img);
     cvtColor(current_frame_t1, current_frame_t2, CV_RGB2GRAY);
    current_frame_t1.release();
    imshow("My_Win", current_frame_t2);
     cvWaitKey(10);
     equalizeHist(current_frame_t2, current_frame_t1);
    current_frame_t2.release();
    convertScaleAbs(current_frame_t1, current_frame_t2,ALPHA_SCALE);

    threshold(current_frame_t2, current_frame_t1, THRESHOLD_VAL, MAX_VAL_FOR_THRESHOLD, CV_THRESH_BINARY);
    medianBlur(current_frame_t1,current_frame_t2,1); 
    imshow("My_Win", current_frame_t2);
    imwrite("outimg.tiff", current_frame_t2),
    cvWaitKey(0);

Binarisation of image for OCR

2 Answers