Transformer based model for state‑of‑the‑art optical character recognition (OCR) on both printed and handwritten text.
End‑to‑end text recognition approach with pre‑trained image transformer and text transformer models for both image understanding and wordpiece‑level text generation.
SC8380
Inference Time : 2.21 ms
Memory Usage : 7 MB
Layers : 375 NPU
Model checkpoint:trocr-small-stage1
Input resolution:320x320
Number of parameters (TrOCREncoder):23.0M
Model size (TrOCREncoder):87.8 MB
Number of parameters (TrOCRDecoder):38.3M
Model size (TrOCRDecoder):146 MB
Publishing
Healthcare
Document Management
SC8380
Source Model:MIT
Deployable Model:AI Model Hub License