Transformer based model for state‑of‑the‑art optical character recognition (OCR) on both printed and handwritten text.
End‑to‑end text recognition approach with pre‑trained image transformer and text transformer models for both image understanding and wordpiece‑level text generation.
Model checkpoint:trocr-small-stage1
Input resolution:320x320
Number of parameters (TrOCRDecoder):38.3M
Model size (TrOCRDecoder) (float):146 MB
Number of parameters (TrOCREncoder):23.0M
Model size (TrOCREncoder) (float):87.8 MB
Publishing
Healthcare
Document Management
Source Model: MIT
Deployable Model: AI-HUB-MODELS-LICENSE