Sports and human action recognition in videos.
ResNet Mixed Convolutions is a network with a mixture of 2D and 3D convolutions used for video understanding.
Model checkpoint:Kinectics-400
Input resolution:112x112
Number of parameters:11.7M
Model size (float):44.6 MB
Model size (w8a16):11.5 MB
Camera
Action Recognition
Source Model: BSD-3-CLAUSE
Deployable Model: AI-HUB-MODELS-LICENSE