1
votes

I used Azure Cognitive Vision API to extract the text from a cheque image. But, it is not correctly extracting the text from cheque. Example MICR code having characters like " || are incorrectly read into some other digits. Is there any way we can work on to improve the accuracy or set some context to specifically extract text from cheque images?

Sample Image used for Vision API: enter image description here

Output of Vision API:

MDBS
Date
2 16
NOT NEG
AS PAYES
D
D
M
M
Y
Y
Pay
Hexagon metrology Asia Pacific
or Bearer
SECURA 3011678/2 06/19
Singapore Siahd hundred and niudy- S$ 896<
Dollars
Sit only
2M PRECISION ENGRG
DBS Bank Ltd
PTE LTD
Cheque No.
Bank/Branch Code
Account No.
Please sign above this line
1;91 308 2081717 00540 0549001996
1
please add more details about the sample image and output that you are getting. - Ram-msft
Ram, I added the sample image I used and its output produced by Vision API. I blackened some text due to security concerns. - atulsri

1 Answers

1
votes

This custom tags is currently not available. However, our engineers are working to bring this functionality to Computer Vision. You can use the custom vision to detect.

Azure Form Recognizer does a fantastic job in creating a viable solution with just five sample documents. It performs end-to-end Optical Character Recognition (OCR) on handwritten as well as digital documents with an amazing accuracy score and in just three seconds.

If you are extracting only text, tables and selection marks from documents you should use layout, if you also need to extract key value pairs you can train a custom model or use a pre-built (Invoice, Receipts, Business Cards). Layout results (text, tables and selection marks) are included in all the Analyze outputs (custom and pre-built) in the readResults (text) and pageResults (tables) of the JSON output.

• Layout – extract text, tables selection marks no training required

• Pre-built – Invoice, Receipts, Business Cards – extract values of interest from these type of documents

• Custom – Extract key value pairs trained on your own documents

Here is the snapshot of output text computer vision API. enter image description here.

Please follow the Computer Vision API Frequently Asked Questions that could help.