TrOCR

简介

Transformer based model for state‑of‑the‑art optical character recognition (OCR) on both printed and handwritten text.
End‑to‑end text recognition approach with pre‑trained image transformer and text transformer models for both image understanding and wordpiece‑level text generation.

效果视频

规格与下载

技术细节

Model checkpoint:trocr-small-stage1
Input resolution:320x320
Number of parameters (TrOCRDecoder):38.3M
Model size (TrOCRDecoder) (float):146 MB
Number of parameters (TrOCREncoder):23.0M
Model size (TrOCREncoder) (float):87.8 MB

应用领域

Publishing
Healthcare
Document Management

授权信息

Source Model: MIT
Deployable Model: AI-HUB-MODELS-LICENSE