Audio Event classification Model.
An audio event classifier trained on the AudioSet dataset to predict audio events from the AudioSet ontology employing the Mobilenet_v1 depthwise‑separable convolution architecture.
Model checkpoint:yamnet.pth
Input resolution:1x1x96x64
Number of parameters:3.73M
Model size (float):14.2 MB
Audio Recognition
Source Model: MIT
Deployable Model: AI-HUB-MODELS-LICENSE