├── LICENSE ├── README.md ├── YoloV5.cbp ├── busstop.jpg ├── parking.jpg ├── yolov5.cpp ├── yolov5s.bin └── yolov5s.param /LICENSE: -------------------------------------------------------------------------------- 1 | BSD 3-Clause License 2 | 3 | Copyright (c) 2021, Q-engineering 4 | All rights reserved. 5 | 6 | Redistribution and use in source and binary forms, with or without 7 | modification, are permitted provided that the following conditions are met: 8 | 9 | 1. Redistributions of source code must retain the above copyright notice, this 10 | list of conditions and the following disclaimer. 11 | 12 | 2. Redistributions in binary form must reproduce the above copyright notice, 13 | this list of conditions and the following disclaimer in the documentation 14 | and/or other materials provided with the distribution. 15 | 16 | 3. Neither the name of the copyright holder nor the names of its 17 | contributors may be used to endorse or promote products derived from 18 | this software without specific prior written permission. 19 | 20 | THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" 21 | AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE 22 | IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE 23 | DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE 24 | FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL 25 | DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR 26 | SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER 27 | CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, 28 | OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE 29 | OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. 30 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # YoloV5 Raspberry Pi 4 2 | ![output image]( https://qengineering.eu/images/test_parkV5.jpg ) 3 | ## YoloV5 with the ncnn framework.
4 | [![License](https://img.shields.io/badge/License-BSD%203--Clause-blue.svg)](https://opensource.org/licenses/BSD-3-Clause)

5 | Paper: https://towardsdatascience.com/yolo-v5-is-here-b668ce2a4908

6 | Specially made for a bare Raspberry Pi 4, see [Q-engineering deep learning examples](https://qengineering.eu/deep-learning-examples-on-raspberry-32-64-os.html) 7 | 8 | ------------ 9 | 10 | ## Benchmark. 11 | Numbers in **FPS** and reflect only the inference timing. Grabbing frames, post-processing and drawing are not taken into account. 12 | 13 | | Model | size | mAP | Jetson Nano | RPi 4 1950 | RPi 5 2900 | Rock 5 | RK3588¹
NPU | RK3566/68²
NPU | Nano
TensorRT | Orin
TensorRT | 14 | | ------------- | :-----: | :-----: | :-------------: | :-------------: | :-----: | :-----: | :-------------: | :-------------: | :-----: | :-----: | 15 | | [NanoDet](https://github.com/Qengineering/NanoDet-ncnn-Raspberry-Pi-4) | 320x320 | 20.6 | 26.2 | 13.0 | 43.2 |36.0 ||||| 16 | | [NanoDet Plus](https://github.com/Qengineering/NanoDetPlus-ncnn-Raspberry-Pi-4) | 416x416 | 30.4 | 18.5 | 5.0 | 30.0 | 24.9 ||||| 17 | | [PP-PicoDet](https://github.com/Qengineering/PP-PicoDet-ncnn-Raspberry-Pi-4) | 320x320 | 27.0 | 24.0 | 7.5 | 53.7 | 46.7 ||||| 18 | | [YoloFastestV2](https://github.com/Qengineering/YoloFastestV2-ncnn-Raspberry-Pi-4) | 352x352 | 24.1 | 38.4 | 18.8 | 78.5 | 65.4 | |||| 19 | | [YoloV2](https://github.com/Qengineering/YoloV2-ncnn-Raspberry-Pi-4) ²⁰| 416x416 | 19.2 | 10.1 | 3.0 | 24.0 | 20.0 | |||| 20 | | [YoloV3](https://github.com/Qengineering/YoloV3-ncnn-Raspberry-Pi-4) ²⁰| 352x352 tiny | 16.6 | 17.7 | 4.4 | 18.1 | 15.0 | |||| 21 | | [YoloV4](https://github.com/Qengineering/YoloV4-ncnn-Raspberry-Pi-4) | 416x416 tiny | 21.7 | 16.1 | 3.4 | 17.5 | 22.4 | |||| 22 | | [YoloV4](https://github.com/Qengineering/YoloV4-ncnn-Raspberry-Pi-4) | 608x608 full | 45.3 | 1.3 | 0.2 | 1.82 | 1.5 | |||| 23 | | [YoloV5](https://github.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4) | 640x640 nano | 22.5 | 5.0 | 1.6 | 13.6 | 12.5 | 58.8 | 14.8 | 19.0 | 100 | 24 | | [YoloV5](https://github.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4) | 640x640 small | 22.5 | 5.0 | 1.6 | 6.3 | 12.5 | 37.7 | 11.7 | 9.25 | 100 | 25 | | [YoloV6](https://github.com/Qengineering/YoloV6-ncnn-Raspberry-Pi-4) | 640x640 nano | 35.0 | 10.5 | 2.7 | 15.8 | 20.8 | 63.0 | 18.0 ||| 26 | | [YoloV7](https://github.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4) | 640x640 tiny | 38.7 | 8.5 | 2.1 | 14.4 | 17.9 | 53.4 | 16.1 | 15.0 || 27 | | [YoloV8](https://github.com/Qengineering/YoloV8-ncnn-Raspberry-Pi-4) | 640x640 nano | 37.3 | 14.5 | 3.1 | 20.0 | 16.3 | 53.1 | 18.2 ||| 28 | | [YoloV8](https://github.com/Qengineering/YoloV8-ncnn-Raspberry-Pi-4) | 640x640 small | 44.9 | 4.5 | 1.47 | 11.0 | 9.2 | 28.5 | 8.9 ||| 29 | | [YoloV9](https://github.com/Qengineering/YoloV9-ncnn-Raspberry-Pi-4) | 640x640 comp | 53.0 | 1.2 | 0.28 | 1.5 | 1.2 | |||| 30 | | [YoloX](https://github.com/Qengineering/YoloX-ncnn-Raspberry-Pi-4) | 416x416 nano | 25.8 | 22.6 | 7.0 | 38.6 | 28.5 | |||| 31 | | [YoloX](https://github.com/Qengineering/YoloX-ncnn-Raspberry-Pi-4) | 416x416 tiny | 32.8 | 11.35 | 2.8 | 17.2 | 18.1 | |||| 32 | | [YoloX](https://github.com/Qengineering/YoloX-ncnn-Raspberry-Pi-4) | 640x640 small | 40.5 | 3.65 | 0.9 | 4.5 | 7.5 | 30.0 | 10.0 ||| 33 | 34 | ¹ The Rock 5 and Orange Pi5 have the RK3588 on board.
35 | ² The Rock 3, Radxa Zero 3 and Orange Pi3B have the RK3566 on board.
36 | ²⁰ Recognize 20 objects (VOC) instead of 80 (COCO) 37 | 38 | ------------ 39 | 40 | ## Dependencies. 41 | To run the application, you have to: 42 | - A Raspberry Pi 4 with a 32 or 64-bit operating system. It can be the Raspberry 64-bit OS, or Ubuntu 18.04 / 20.04. [Install 64-bit OS](https://qengineering.eu/install-raspberry-64-os.html)
43 | - The Tencent ncnn framework installed. [Install ncnn](https://qengineering.eu/install-ncnn-on-raspberry-pi-4.html)
44 | - OpenCV 64-bit installed. [Install OpenCV 4.5](https://qengineering.eu/install-opencv-4.5-on-raspberry-64-os.html)
45 | - Code::Blocks installed. (```$ sudo apt-get install codeblocks```) 46 | 47 | ------------ 48 | 49 | ## Installing the app. 50 | To extract and run the network in Code::Blocks
51 | $ mkdir *MyDir*
52 | $ cd *MyDir*
53 | $ wget https://github.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4/archive/refs/heads/main.zip
54 | $ unzip -j master.zip
55 | Remove master.zip, LICENSE and README.md as they are no longer needed.
56 | $ rm master.zip
57 | $ rm LICENSE
58 | $ rm README.md

59 | Your *MyDir* folder must now look like this:
60 | parking.jpg
61 | busstop.jpg
62 | YoloV5.cpb
63 | yoloV5.cpp
64 | yolov5s.bin
65 | yolov5s.param
66 | 67 | ------------ 68 | 69 | ## Running the app. 70 | To run the application load the project file YoloV5.cbp in Code::Blocks. More info or
71 | if you want to connect a camera to the app, follow the instructions at [Hands-On](https://qengineering.eu/deep-learning-examples-on-raspberry-32-64-os.html#HandsOn).

72 | Many thanks to [nihui](https://github.com/nihui/) again!

73 | ![output image]( https://qengineering.eu/images/test_busV5.jpg ) 74 | 75 | ------------ 76 | 77 | [![paypal](https://qengineering.eu/images/TipJarSmall4.png)](https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=CPZTM5BB3FCYL) 78 | 79 | 80 | -------------------------------------------------------------------------------- /YoloV5.cbp: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 67 | 68 | -------------------------------------------------------------------------------- /busstop.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4/134933aa72247a41bf598ef38b1476d11d003f28/busstop.jpg -------------------------------------------------------------------------------- /parking.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Qengineering/YoloV5-ncnn-Raspberry-Pi-4/134933aa72247a41bf598ef38b1476d11d003f28/parking.jpg -------------------------------------------------------------------------------- /yolov5.cpp: -------------------------------------------------------------------------------- 1 | // Tencent is pleased to support the open source community by making ncnn available. 2 | // 3 | // Copyright (C) 2020 THL A29 Limited, a Tencent company. All rights reserved. 4 | // 5 | // Licensed under the BSD 3-Clause License (the "License"); you may not use this file except 6 | // in compliance with the License. You may obtain a copy of the License at 7 | // 8 | // https://opensource.org/licenses/BSD-3-Clause 9 | // 10 | // Unless required by applicable law or agreed to in writing, software distributed 11 | // under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR 12 | // CONDITIONS OF ANY KIND, either express or implied. See the License for the 13 | // specific language governing permissions and limitations under the License. 14 | 15 | // modified 12-31-2021 Q-engineering 16 | 17 | #include "layer.h" 18 | #include "net.h" 19 | 20 | #include 21 | #include 22 | #include 23 | #include 24 | #include 25 | 26 | ncnn::Net yolov5; 27 | 28 | const int target_size = 640; 29 | const float prob_threshold = 0.25f; 30 | const float nms_threshold = 0.45f; 31 | const float norm_vals[3] = {1 / 255.f, 1 / 255.f, 1 / 255.f}; 32 | 33 | const char* class_names[] = { 34 | "person", "bicycle", "car", "motorcycle", "airplane", "bus", "train", "truck", "boat", "traffic light", 35 | "fire hydrant", "stop sign", "parking meter", "bench", "bird", "cat", "dog", "horse", "sheep", "cow", 36 | "elephant", "bear", "zebra", "giraffe", "backpack", "umbrella", "handbag", "tie", "suitcase", "frisbee", 37 | "skis", "snowboard", "sports ball", "kite", "baseball bat", "baseball glove", "skateboard", "surfboard", 38 | "tennis racket", "bottle", "wine glass", "cup", "fork", "knife", "spoon", "bowl", "banana", "apple", 39 | "sandwich", "orange", "broccoli", "carrot", "hot dog", "pizza", "donut", "cake", "chair", "couch", 40 | "potted plant", "bed", "dining table", "toilet", "tv", "laptop", "mouse", "remote", "keyboard", "cell phone", 41 | "microwave", "oven", "toaster", "sink", "refrigerator", "book", "clock", "vase", "scissors", "teddy bear", 42 | "hair drier", "toothbrush" 43 | }; 44 | 45 | 46 | class YoloV5Focus : public ncnn::Layer 47 | { 48 | public: 49 | YoloV5Focus() 50 | { 51 | one_blob_only = true; 52 | } 53 | 54 | virtual int forward(const ncnn::Mat& bottom_blob, ncnn::Mat& top_blob, const ncnn::Option& opt) const 55 | { 56 | int w = bottom_blob.w; 57 | int h = bottom_blob.h; 58 | int channels = bottom_blob.c; 59 | 60 | int outw = w / 2; 61 | int outh = h / 2; 62 | int outc = channels * 4; 63 | 64 | top_blob.create(outw, outh, outc, 4u, 1, opt.blob_allocator); 65 | if (top_blob.empty()) 66 | return -100; 67 | 68 | #pragma omp parallel for num_threads(opt.num_threads) 69 | for (int p = 0; p < outc; p++) 70 | { 71 | const float* ptr = bottom_blob.channel(p % channels).row((p / channels) % 2) + ((p / channels) / 2); 72 | float* outptr = top_blob.channel(p); 73 | 74 | for (int i = 0; i < outh; i++) 75 | { 76 | for (int j = 0; j < outw; j++) 77 | { 78 | *outptr = *ptr; 79 | 80 | outptr += 1; 81 | ptr += 2; 82 | } 83 | 84 | ptr += w; 85 | } 86 | } 87 | 88 | return 0; 89 | } 90 | }; 91 | 92 | DEFINE_LAYER_CREATOR(YoloV5Focus) 93 | 94 | struct Object 95 | { 96 | cv::Rect_ rect; 97 | int label; 98 | float prob; 99 | }; 100 | 101 | static inline float intersection_area(const Object& a, const Object& b) 102 | { 103 | cv::Rect_ inter = a.rect & b.rect; 104 | return inter.area(); 105 | } 106 | 107 | static void qsort_descent_inplace(std::vector