In our proposed model, SPPF-AMP, besides passing the output of the convolutional layer from a sequence of max pooling layers with kernel sizes of 5, it is also passed through two average pooling layers with kernel sizes of 9 and 13. Then the output of each max pooling layer, each average pooling layer, and the convolutional layer are concatenated and passed to the next convolutional layer.
Source: Inception-YOLO: Computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modulesPaper | Code | Results | Date | Stars |
---|
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |