TrOCR

简介

Transformer based model for state‑of‑the‑art optical character recognition (OCR) on both printed and handwritten text.
End‑to‑end text recognition approach with pre‑trained image transformer and text transformer models for both image understanding and wordpiece‑level text generation.

效果视频

适用平台

SC8380

性能信息

Inference Time : 2.21 ms
Memory Usage : 7 MB
Layers : 375 NPU

技术细节

Model checkpoint:trocr-small-stage1
Input resolution:320x320
Number of parameters (TrOCREncoder):23.0M
Model size (TrOCREncoder):87.8 MB
Number of parameters (TrOCRDecoder):38.3M
Model size (TrOCRDecoder):146 MB

应用领域

Publishing
Healthcare
Document Management

支持平台类型

SC8380

授权信息

Source Model:MIT
Deployable Model:AI Model Hub License

下载链接

Decoder 点这里下载

Encoder 点这里下载