Home » Courses » Improving Text Recognition capabilities using Transformer
Paid Project
,
Project
,
Improving Text Recognition capabilities using Transformer
In this project, we’ll improve CRNN model using the Beam Search technique for text generation. Then we’ll improve the text recognition capabilities using a State-of-the-art transformer model.  We’ll use TrOCR model from Microsft for building the OCR model. For that we’ll use the base model and make inferences on some images using that, and then we’ll fine-tune that TrOCR base model on the IAM dataset using HuggingFace tool. On completion of the training of the model, we’ll use TrOCR_hadwritting model (which is fine-tuned on the IAM lines dataset).