You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this project, we implement a lightweight model SVTR in pytorch for handwritten-ocr task. In terms of steps, we trained a pretrained model with a large of generated data. Then, we fintuned model with real handwritten data.
There are still many hard code here, sorry for that.
Environment
To create virtual environment:
vitualenv venv
pip install -r requiments.txt
Data
To generate data, you can use some handwritten fonts and the text corpus to generate with my repo OCR-Vietnamese-Text-Generator (or the original repo). I also provided some address corpus files in data/corpus for your reference.
Note: data format should be in 2 type (a folder contains all images and a folder contains all label text files)
I had a mistake when building dictionary that duplicates 2 symbols. I dont have the resource to retrain model, so comment warning in python3.10/site-packages/pyctcdecode/alphabet.py to pass the duplicate check. Please use utils/vi_dict_fix.txt to avoid this mistake.