VIT

简介

Imagenet classifier and general purpose backbone.
VIT is a machine learning model that can classify images from the Imagenet dataset. It can also be used as a backbone in building more complex models for specific use cases.

效果视频

规格与下载

技术细节

Model checkpoint:Imagenet
Input resolution:224x224
Number of parameters:86.6M
Model size (float):330 MB
Model size (w8a16):86.2 MB
Model size (w8a8):83.2 MB

应用领域

Medical Imaging
Anomaly Detection
Inventory Management

授权信息

Source Model: BSD-3-CLAUSE
Deployable Model: AI-HUB-MODELS-LICENSE