├── Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf
├── README.md
├── Transforms in shape-based matching.pdf
├── match.png
└── shape_based_matching-subpixel
    ├── shape_based_matching-subpixel.sln
    ├── shape_based_matching-subpixel
        ├── CMakeLists.txt
        ├── Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf
        ├── LICENSE
        ├── MIPP
        │   ├── math
        │   │   ├── avx512_mathfun.h
        │   │   ├── avx512_mathfun.hxx
        │   │   ├── avx_mathfun.h
        │   │   ├── avx_mathfun.hxx
        │   │   ├── neon_mathfun.h
        │   │   ├── neon_mathfun.hxx
        │   │   ├── sse_mathfun.h
        │   │   └── sse_mathfun.hxx
        │   ├── mipp.h
        │   ├── mipp_impl_AVX.hxx
        │   ├── mipp_impl_AVX512.hxx
        │   ├── mipp_impl_NEON.hxx
        │   ├── mipp_impl_SSE.hxx
        │   ├── mipp_object.hxx
        │   ├── mipp_scalar_op.h
        │   └── mipp_scalar_op.hxx
        ├── README.md
        ├── cuda_icp
        │   ├── CMakeLists.txt
        │   ├── geometry.h
        │   ├── icp.cpp
        │   ├── icp.cu
        │   ├── icp.h
        │   └── scene
        │   │   ├── common.cpp
        │   │   ├── common.cu
        │   │   ├── common.h
        │   │   └── edge_scene
        │   │       ├── edge_scene.cpp
        │   │       ├── edge_scene.cu
        │   │       └── edge_scene.h
        ├── demo.ini
        ├── detector.cpp
        ├── detector.h
        ├── line2Dup.cpp
        ├── line2Dup.h
        ├── line2Dup.hpp
        ├── linemod.cpp
        ├── linemod.hpp
        ├── match.cpp
        ├── match.h
        ├── match.png
        ├── openCV410.props
        ├── openCV410d.props
        ├── pch.cpp
        ├── pch.h
        ├── shape_based_matching-subpixel.cpp
        ├── shape_based_matching-subpixel.vcxproj
        ├── shape_based_matching-subpixel.vcxproj.filters
        ├── shape_based_matching-subpixel.vcxproj.user
        ├── test.cpp
        ├── test
        │   ├── case0
        │   │   ├── 1.jpg
        │   │   ├── 2.jpg
        │   │   ├── 3.png
        │   │   ├── 4.png
        │   │   ├── circle_info.yaml
        │   │   ├── circle_templ.yaml
        │   │   ├── features
        │   │   │   ├── nms_templ.png
        │   │   │   └── no_nms_templ.png
        │   │   ├── result
        │   │   │   ├── 1.png
        │   │   │   ├── 2.png
        │   │   │   └── 3.png
        │   │   └── templ
        │   │   │   └── circle.png
        │   ├── case1
        │   │   ├── test.tif
        │   │   ├── test_info.yaml
        │   │   ├── test_templ.yaml
        │   │   └── train.tif
        │   ├── case2
        │   │   ├── result
        │   │   │   ├── result.png
        │   │   │   ├── templ.png
        │   │   │   └── together.png
        │   │   ├── test.png
        │   │   ├── test_info.yaml
        │   │   ├── test_templ.yaml
        │   │   └── train.png
        │   └── ori_16bit_experiment
        │   │   ├── LUT16.txt
        │   │   ├── LUT_gen.cpp
        │   │   └── line2Dup_16bit_ori.cpp
        └── x64
        │   └── Debug
        │       └── vc141.idb
    └── x64
        └── Debug
            ├── shape_based_matching-subpixel.exe
            ├── shape_based_matching-subpixel.ilk
            └── shape_based_matching-subpixel.pdb


/Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
 1 | # shape_based_matching  
 2 | 
 3 | update:   
 4 | [Transforms in shape-based matching](./Transforms%20in%20shape-based%20matching.pdf)  
 5 | [pose refine with icp branch](https://github.com/meiqua/shape_based_matching/tree/icp2D), 0.1-0.5 degree accuracy   
 6 | [icp + subpixel branch](https://github.com/meiqua/shape_based_matching/tree/subpixel), < 0.1 degree accuracy  
 7 | [icp + subpixel + sim3(previous is so3) branch](https://github.com/meiqua/shape_based_matching/tree/sim3), deal with scale error  
 8 | 
 9 | try to implement halcon shape based matching, refer to machine vision algorithms and applications, page 317 3.11.5, written by halcon engineers  
10 | We find that shape based matching is the same as linemod. [linemod pdf](Gradient%20Response%20Maps%20for%20Real-TimeDetection%20of%20Textureless%20Objects.pdf)  
11 | 
12 | halcon match solution guide for how to select matching methods([halcon documentation](https://www.mvtec.com/products/halcon/documentation/#reference_manual)):  
13 | ![match](./match.png)  
14 | 
15 | ## steps
16 | 
17 | 1. change test.cpp line 9 prefix to top level folder
18 | 
19 | 2. in cmakeList line 23, change /opt/ros/kinetic to somewhere opencv3 can be found(if opencv3 is installed in default env then don't need to)
20 | 
21 | 3. cmake make & run. To learn usage, see different tests in test.cpp. Particularly, scale_test are fully commented.
22 | 
23 | NOTE: On windows, it's confirmed that visual studio 17 works fine, but there are some problems with MIPP in vs13. You may want old codes without [MIPP](https://github.com/aff3ct/MIPP): [old commit](https://github.com/meiqua/shape_based_matching/tree/fc3560a1a3bc7c6371eacecdb6822244baac17ba)  
24 | 
25 | ## thoughts about the method
26 | 
27 | The key of shape based matching, or linemod, is using gradient orientation only. Though both edge and orientation are resistant to disturbance,
28 | edge have only 1bit info(there is an edge or not), so it's hard to dig wanted shapes out if there are too many edges, but we have to have as many edges as possible if we want to find all the target shapes. It's quite a dilemma.  
29 | 
30 | However, gradient orientation has much more info than edge, so we can easily match shape orientation in the overwhelming img orientation by template matching across the img.  
31 | 
32 | Speed is also important. Thanks to the speeding up magic in linemod, we can handle 1000 templates in 20ms or so.  
33 | 
34 | [Chinese blog about the thoughts](https://www.zhihu.com/question/39513724/answer/441677905)  
35 | 
36 | ## improvment
37 | 
38 | Comparing to opencv linemod src, we improve from 6 aspects:  
39 | 
40 | 1. delete depth modality so we don't need virtual func, this may speed up  
41 | 
42 | 2. opencv linemod can't use more than 63 features. Now wo can have up to 8191  
43 | 
44 | 3. simple codes for rotating and scaling img for training. see test.cpp for examples  
45 | 
46 | 4. nms for accurate edge selection  
47 | 
48 | 5. one channel orientation extraction to save time, slightly faster for gray img
49 | 
50 | 6. use [MIPP](https://github.com/aff3ct/MIPP) for multiple platforms SIMD, for example, x86 SSE AVX, arm neon.
51 |    To have better performance, we have extended MIPP to uint8_t for some instructions.(Otherwise we can only use
52 |    half feature points to avoid int8_t overflow)
53 | 
54 | ## some test
55 | 
56 | ### Example for circle shape  
57 | 
58 | #### You can imagine how many circles we will find if use edges  
59 | ![circle1](test/case0/1.jpg)
60 | ![circle1](test/case0/result/1.png)  
61 | 
62 | #### Not that circular  
63 | ![circle2](test/case0/2.jpg)
64 | ![circle2](test/case0/result/2.png)  
65 | 
66 | #### Blur  
67 | ![circle3](test/case0/3.png)
68 | ![circle3](test/case0/result/3.png)  
69 | 
70 | ### circle template before and after nms  
71 | 
72 | #### before nms
73 | 
74 | ![before](test/case0/features/no_nms_templ.png)
75 | 
76 | #### after nms
77 | 
78 | ![after](test/case0/features/nms_templ.png)  
79 | 
80 | ### Simple example for arbitary shape
81 | 
82 | Well, the example is too simple to show the robustness  
83 | running time: 1024x1024, 60ms to construct response map, 7ms for 360 templates  
84 | 
85 | test img & templ features  
86 | ![test](./test/case1/result.png)  
87 | ![templ](test/case1/templ.png)  
88 | 
89 | 
90 | ### noise test  
91 | 
92 | ![test2](test/case2/result/together.png)  
93 | 


--------------------------------------------------------------------------------
/Transforms in shape-based matching.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/Transforms in shape-based matching.pdf


--------------------------------------------------------------------------------
/match.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/match.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel.sln:
--------------------------------------------------------------------------------
 1 | ﻿
 2 | Microsoft Visual Studio Solution File, Format Version 12.00
 3 | # Visual Studio 15
 4 | VisualStudioVersion = 15.0.28307.572
 5 | MinimumVisualStudioVersion = 10.0.40219.1
 6 | Project("{8BC9CEB8-8B4A-11D0-8D11-00A0C91BC942}") = "shape_based_matching-subpixel", "shape_based_matching-subpixel\shape_based_matching-subpixel.vcxproj", "{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}"
 7 | EndProject
 8 | Global
 9 | 	GlobalSection(SolutionConfigurationPlatforms) = preSolution
10 | 		Debug|x64 = Debug|x64
11 | 		Debug|x86 = Debug|x86
12 | 		Release|x64 = Release|x64
13 | 		Release|x86 = Release|x86
14 | 	EndGlobalSection
15 | 	GlobalSection(ProjectConfigurationPlatforms) = postSolution
16 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Debug|x64.ActiveCfg = Debug|x64
17 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Debug|x64.Build.0 = Debug|x64
18 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Debug|x86.ActiveCfg = Debug|Win32
19 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Debug|x86.Build.0 = Debug|Win32
20 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Release|x64.ActiveCfg = Release|x64
21 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Release|x64.Build.0 = Release|x64
22 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Release|x86.ActiveCfg = Release|Win32
23 | 		{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}.Release|x86.Build.0 = Release|Win32
24 | 	EndGlobalSection
25 | 	GlobalSection(SolutionProperties) = preSolution
26 | 		HideSolutionNode = FALSE
27 | 	EndGlobalSection
28 | 	GlobalSection(ExtensibilityGlobals) = postSolution
29 | 		SolutionGuid = {F5FAC5D4-D792-4D63-910A-EE443F6F623E}
30 | 	EndGlobalSection
31 | EndGlobal
32 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/CMakeLists.txt:
--------------------------------------------------------------------------------
 1 | cmake_minimum_required(VERSION 2.8)
 2 | set (CMAKE_CXX_STANDARD 14)
 3 | project(shape_based_matching)
 4 | 
 5 | 
 6 | # debug or release
 7 | SET(CMAKE_BUILD_TYPE "Release")
 8 | #SET(CMAKE_BUILD_TYPE "Debug")
 9 | 
10 | 
11 | # arm or x86
12 | IF(${CMAKE_SYSTEM_PROCESSOR} MATCHES "arm")
13 |     SET(PLATFORM_COMPILE_FLAGS "-mfpu=neon")
14 | ELSE()
15 |     SET(PLATFORM_COMPILE_FLAGS "-march=native")
16 | 
17 |     # some places of the algorithm are designed for 128 SIMD
18 |     # so 128 SSE may slightly faster than 256 AVX, you may want this
19 | #    SET(PLATFORM_COMPILE_FLAGS "-msse -msse2 -msse3 -msse4 -mssse3")  # SSE only
20 | ENDIF()
21 | 
22 | # SET(PLATFORM_COMPILE_FLAGS "-DMIPP_NO_INTRINSICS")  # close SIMD
23 | SET(COMMON_COMPILE_FLAGS "-fopenmp -Wall -Wno-sign-compare")
24 | SET(CMAKE_CXX_FLAGS "${PLATFORM_COMPILE_FLAGS} ${COMMON_COMPILE_FLAGS} $ENV{CXXFLAGS}")
25 | SET(CMAKE_CXX_FLAGS_DEBUG "-O0 -g2 -ggdb")
26 | SET(CMAKE_CXX_FLAGS_RELEASE "-O3")
27 | 
28 | 
29 | # opencv
30 | set(CMAKE_PREFIX_PATH ${CMAKE_PREFIX_PATH} /opt/ros/kinetic)
31 | find_package(OpenCV 3 REQUIRED)
32 | include_directories(${OpenCV_INCLUDE_DIRS})
33 | 
34 | 
35 | # include MIPP headers
36 | include_directories (${INCLUDE_DIRECTORIES} "${CMAKE_CURRENT_SOURCE_DIR}/MIPP/")
37 | 
38 | 
39 | # icp for refine
40 | option(USE_CUDA "use cuda or not" OFF)
41 | 
42 | if(USE_CUDA)
43 | set(CUDA_TOOLKIT_ROOT_DIR /usr/local/cuda-10.0)
44 | add_definitions(-DCUDA_ON)
45 | endif()
46 | 
47 | add_subdirectory(cuda_icp)
48 | 
49 | 
50 | # test exe
51 | add_executable(${PROJECT_NAME}_test line2Dup.cpp test.cpp)
52 | target_link_libraries(${PROJECT_NAME}_test ${OpenCV_LIBS} cuda_icp)
53 | 
54 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/Gradient Response Maps for Real-TimeDetection of Textureless Objects.pdf


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/LICENSE:
--------------------------------------------------------------------------------
 1 | BSD 2-Clause License
 2 | 
 3 | Copyright (c) 2018, 
 4 | All rights reserved.
 5 | 
 6 | Redistribution and use in source and binary forms, with or without
 7 | modification, are permitted provided that the following conditions are met:
 8 | 
 9 | * Redistributions of source code must retain the above copyright notice, this
10 |   list of conditions and the following disclaimer.
11 | 
12 | * Redistributions in binary form must reproduce the above copyright notice,
13 |   this list of conditions and the following disclaimer in the documentation
14 |   and/or other materials provided with the distribution.
15 | 
16 | THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
17 | AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
18 | IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
19 | DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
20 | FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
21 | DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
22 | SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
23 | CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
24 | OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
25 | OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
26 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/avx512_mathfun.h:
--------------------------------------------------------------------------------
 1 | /* 
 2 |    AVX512 implementation of sin, cos, sincos, exp and log
 3 | 
 4 |    Based on "sse_mathfun.h", by Julien Pommier
 5 |    http://gruntthepeon.free.fr/ssemath/
 6 | 
 7 |    Copyright (C) 2017 Adrien Cassagne
 8 |    MIT license
 9 | */
10 | #ifdef __AVX512F__
11 | #ifndef AVX512_MATHFUN_H_
12 | #define AVX512_MATHFUN_H_
13 | 
14 | #include <immintrin.h>
15 | 
16 | typedef __m512 v16sf; // vector of 8 float (avx)
17 | 
18 | // prototypes
19 | inline v16sf log512_ps(v16sf x);
20 | inline v16sf exp512_ps(v16sf x);
21 | inline v16sf sin512_ps(v16sf x);
22 | inline v16sf cos512_ps(v16sf x);
23 | inline void sincos512_ps(v16sf x, v16sf *s, v16sf *c);
24 | 
25 | #include "avx512_mathfun.hxx"
26 | 
27 | #endif
28 | #endif
29 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/avx512_mathfun.hxx:
--------------------------------------------------------------------------------
  1 | /* 
  2 |    AVX512 implementation of sin, cos, sincos, exp and log
  3 | 
  4 |    Based on "sse_mathfun.h", by Julien Pommier
  5 |    http://gruntthepeon.free.fr/ssemath/
  6 | 
  7 |    Copyright (C) 2017 Adrien Cassagne
  8 |    MIT license
  9 | */
 10 | #ifdef __AVX512F__
 11 | 
 12 | #include "avx512_mathfun.h"
 13 | 
 14 | typedef __m512i v16si; // vector of 16 int   (avx)
 15 | 
 16 | /* yes I know, the top of this file is quite ugly */
 17 | #ifdef _MSC_VER /* visual c++ */
 18 | # define ALIGN32_BEG __declspec(align(32))
 19 | # define ALIGN32_END 
 20 | #else /* gcc or icc */
 21 | # define ALIGN32_BEG
 22 | # define ALIGN32_END __attribute__((aligned(32)))
 23 | #endif
 24 | 
 25 | /* declare some AVX512 constants -- why can't I figure a better way to do that? */
 26 | #define _PS512_CONST(Name, Val)                                            \
 27 |   static const ALIGN32_BEG float _ps512_##Name[16] ALIGN32_END = { Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val }
 28 | #define _PI32_CONST512(Name, Val)                                            \
 29 |   static const ALIGN32_BEG int _pi32_512_##Name[16] ALIGN32_END = { Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val }
 30 | #define _PS512_CONST_TYPE(Name, Type, Val)                                 \
 31 |   static const ALIGN32_BEG Type _ps512_##Name[16] ALIGN32_END = { Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val, Val }
 32 | 
 33 | _PS512_CONST(1  , 1.0f);
 34 | _PS512_CONST(0p5, 0.5f);
 35 | /* the smallest non denormalized float number */
 36 | _PS512_CONST_TYPE(min_norm_pos, int, 0x00800000);
 37 | //_PS512_CONST_TYPE(mant_mask, int, 0x7f800000);
 38 | _PS512_CONST_TYPE(inv_mant_mask, int, ~0x7f800000);
 39 | 
 40 | _PS512_CONST_TYPE(sign_mask, int, (int)0x80000000);
 41 | _PS512_CONST_TYPE(inv_sign_mask, int, ~0x80000000);
 42 | 
 43 | _PI32_CONST512(0, 0);
 44 | _PI32_CONST512(1, 1);
 45 | _PI32_CONST512(0xffffffff, (int)0xFFFFFFFF);
 46 | _PI32_CONST512(inv1, ~1);
 47 | _PI32_CONST512(2, 2);
 48 | _PI32_CONST512(4, 4);
 49 | _PI32_CONST512(0x7f, 0x7f);
 50 | 
 51 | _PS512_CONST(cephes_SQRTHF, 0.707106781186547524f);
 52 | _PS512_CONST(cephes_log_p0, 7.0376836292E-2f);
 53 | _PS512_CONST(cephes_log_p1, - 1.1514610310E-1f);
 54 | _PS512_CONST(cephes_log_p2, 1.1676998740E-1f);
 55 | _PS512_CONST(cephes_log_p3, - 1.2420140846E-1f);
 56 | _PS512_CONST(cephes_log_p4, + 1.4249322787E-1f);
 57 | _PS512_CONST(cephes_log_p5, - 1.6668057665E-1f);
 58 | _PS512_CONST(cephes_log_p6, + 2.0000714765E-1f);
 59 | _PS512_CONST(cephes_log_p7, - 2.4999993993E-1f);
 60 | _PS512_CONST(cephes_log_p8, + 3.3333331174E-1f);
 61 | _PS512_CONST(cephes_log_q1, -2.12194440e-4f);
 62 | _PS512_CONST(cephes_log_q2, 0.693359375f);
 63 | 
 64 | static inline v16si _wrap_mm512_slli_epi32(v16si x, int  y) { return _mm512_slli_epi32(x,y); }
 65 | static inline v16si _wrap_mm512_srli_epi32(v16si x, int  y) { return _mm512_srli_epi32(x,y); }
 66 | static inline v16si _wrap_mm512_sub_epi32 (v16si x, v16si y) { return _mm512_sub_epi32 (x,y); }
 67 | static inline v16si _wrap_mm512_add_epi32 (v16si x, v16si y) { return _mm512_add_epi32 (x,y); }
 68 | 
 69 | 
 70 | /* natural logarithm computed for 16 simultaneous float
 71 |    return NaN for x <= 0
 72 | */
 73 | v16sf log512_ps(v16sf x) {
 74 |   v16si imm0;
 75 |   v16sf one = *(v16sf*)_ps512_1;
 76 | 
 77 |   //v16sf invalid_mask = _mm512_cmple_ps(x, _mm512_setzero_ps());
 78 |   __mmask16 invalid_mask2 = _mm512_cmp_ps_mask(x, _mm512_setzero_ps(), _CMP_LE_OS);
 79 |   v16sf invalid_mask = _mm512_mask_blend_ps(invalid_mask2, *(v16sf*)_pi32_512_0, *(v16sf*)_pi32_512_0xffffffff);
 80 | 
 81 |   x = _mm512_max_ps(x, *(v16sf*)_ps512_min_norm_pos);  /* cut off denormalized stuff */
 82 | 
 83 |   // can be done with AVX2
 84 |   imm0 = _wrap_mm512_srli_epi32(_mm512_castps_si512(x), 23);
 85 | 
 86 |   /* keep only the fractional part */
 87 | //  x = _mm512_and_ps(x, *(v16sf*)_ps512_inv_mant_mask);
 88 |   x = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(x), _mm512_castps_si512(*(v16sf*)_ps512_inv_mant_mask)));
 89 | //  x = _mm512_or_ps(x, *(v16sf*)_ps512_0p5);
 90 |   x = _mm512_castsi512_ps(_mm512_or_si512(_mm512_castps_si512(x), _mm512_castps_si512(*(v16sf*)_ps512_0p5)));
 91 | 
 92 |   // this is again another AVX2 instruction
 93 |   imm0 = _wrap_mm512_sub_epi32(imm0, *(v16si*)_pi32_512_0x7f);
 94 |   v16sf e = _mm512_cvtepi32_ps(imm0);
 95 | 
 96 |   e = _mm512_add_ps(e, one);
 97 | 
 98 |   /* part2: 
 99 |      if( x < SQRTHF ) {
100 |        e -= 1;
101 |        x = x + x - 1.0;
102 |      } else { x = x - 1.0; }
103 |   */
104 |   //v16sf mask = _mm512_cmplt_ps(x, *(v16sf*)_ps512_cephes_SQRTHF);
105 |   __mmask16 mask2 = _mm512_cmp_ps_mask(x, *(v16sf*)_ps512_cephes_SQRTHF, _CMP_LT_OS);
106 |   v16sf mask = _mm512_mask_blend_ps(mask2, *(v16sf*)_pi32_512_0, *(v16sf*)_pi32_512_0xffffffff);
107 | 
108 | //  v16sf tmp = _mm512_and_ps(x, mask);
109 |   v16sf tmp = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(x), _mm512_castps_si512(mask)));
110 |   x = _mm512_sub_ps(x, one);
111 | //  e = _mm512_sub_ps(e, _mm512_and_ps(one, mask));
112 |   e = _mm512_sub_ps(e, _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(one), _mm512_castps_si512(mask))));
113 |   x = _mm512_add_ps(x, tmp);
114 | 
115 |   v16sf z = _mm512_mul_ps(x,x);
116 | 
117 |   v16sf y = *(v16sf*)_ps512_cephes_log_p0;
118 |   y = _mm512_mul_ps(y, x);
119 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p1);
120 |   y = _mm512_mul_ps(y, x);
121 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p2);
122 |   y = _mm512_mul_ps(y, x);
123 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p3);
124 |   y = _mm512_mul_ps(y, x);
125 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p4);
126 |   y = _mm512_mul_ps(y, x);
127 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p5);
128 |   y = _mm512_mul_ps(y, x);
129 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p6);
130 |   y = _mm512_mul_ps(y, x);
131 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p7);
132 |   y = _mm512_mul_ps(y, x);
133 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_log_p8);
134 |   y = _mm512_mul_ps(y, x);
135 | 
136 |   y = _mm512_mul_ps(y, z);
137 |   
138 |   tmp = _mm512_mul_ps(e, *(v16sf*)_ps512_cephes_log_q1);
139 |   y = _mm512_add_ps(y, tmp);
140 | 
141 | 
142 |   tmp = _mm512_mul_ps(z, *(v16sf*)_ps512_0p5);
143 |   y = _mm512_sub_ps(y, tmp);
144 | 
145 |   tmp = _mm512_mul_ps(e, *(v16sf*)_ps512_cephes_log_q2);
146 |   x = _mm512_add_ps(x, y);
147 |   x = _mm512_add_ps(x, tmp);
148 | //  x = _mm512_or_ps(x, invalid_mask); // negative arg will be NAN
149 |   x = _mm512_castsi512_ps(_mm512_or_si512(_mm512_castps_si512(x), _mm512_castps_si512(invalid_mask)));
150 |   return x;
151 | }
152 | 
153 | _PS512_CONST(exp_hi,	88.3762626647949f);
154 | _PS512_CONST(exp_lo,	-88.3762626647949f);
155 | 
156 | _PS512_CONST(cephes_LOG2EF, 1.44269504088896341f);
157 | _PS512_CONST(cephes_exp_C1, 0.693359375f);
158 | _PS512_CONST(cephes_exp_C2, -2.12194440e-4f);
159 | 
160 | _PS512_CONST(cephes_exp_p0, 1.9875691500E-4f);
161 | _PS512_CONST(cephes_exp_p1, 1.3981999507E-3f);
162 | _PS512_CONST(cephes_exp_p2, 8.3334519073E-3f);
163 | _PS512_CONST(cephes_exp_p3, 4.1665795894E-2f);
164 | _PS512_CONST(cephes_exp_p4, 1.6666665459E-1f);
165 | _PS512_CONST(cephes_exp_p5, 5.0000001201E-1f);
166 | 
167 | v16sf exp512_ps(v16sf x) {
168 |   v16sf tmp = _mm512_setzero_ps(), fx;
169 |   v16si imm0;
170 |   v16sf one = *(v16sf*)_ps512_1;
171 | 
172 |   x = _mm512_min_ps(x, *(v16sf*)_ps512_exp_hi);
173 |   x = _mm512_max_ps(x, *(v16sf*)_ps512_exp_lo);
174 | 
175 |   /* express exp(x) as exp(g + n*log(2)) */
176 |   fx = _mm512_mul_ps(x, *(v16sf*)_ps512_cephes_LOG2EF);
177 |   fx = _mm512_add_ps(fx, *(v16sf*)_ps512_0p5);
178 | 
179 |   /* how to perform a floorf with SSE: just below */
180 |   //imm0 = _mm512_cvttps_epi32(fx);
181 |   //tmp  = _mm512_cvtepi32_ps(imm0);
182 |   
183 |   tmp = _mm512_floor_ps(fx);
184 | 
185 |   /* if greater, substract 1 */
186 |   //v16sf mask = _mm512_cmpgt_ps(tmp, fx);
187 | //  v16sf mask = _mm512_cmp_ps(tmp, fx, _CMP_GT_OS);
188 |   __mmask16 mask2 = _mm512_cmp_ps_mask(tmp, fx, _CMP_GT_OS);
189 |   v16sf mask = _mm512_mask_blend_ps(mask2, *(v16sf*)_pi32_512_0, *(v16sf*)_pi32_512_0xffffffff);
190 | //  mask = _mm512_and_ps(mask, one);
191 |   mask = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(mask), _mm512_castps_si512(one)));
192 |   fx = _mm512_sub_ps(tmp, mask);
193 | 
194 |   tmp = _mm512_mul_ps(fx, *(v16sf*)_ps512_cephes_exp_C1);
195 |   v16sf z = _mm512_mul_ps(fx, *(v16sf*)_ps512_cephes_exp_C2);
196 |   x = _mm512_sub_ps(x, tmp);
197 |   x = _mm512_sub_ps(x, z);
198 | 
199 |   z = _mm512_mul_ps(x,x);
200 |   
201 |   v16sf y = *(v16sf*)_ps512_cephes_exp_p0;
202 |   y = _mm512_mul_ps(y, x);
203 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_exp_p1);
204 |   y = _mm512_mul_ps(y, x);
205 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_exp_p2);
206 |   y = _mm512_mul_ps(y, x);
207 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_exp_p3);
208 |   y = _mm512_mul_ps(y, x);
209 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_exp_p4);
210 |   y = _mm512_mul_ps(y, x);
211 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_cephes_exp_p5);
212 |   y = _mm512_mul_ps(y, z);
213 |   y = _mm512_add_ps(y, x);
214 |   y = _mm512_add_ps(y, one);
215 | 
216 |   /* build 2^n */
217 |   imm0 = _mm512_cvttps_epi32(fx);
218 |   // another two AVX2 instructions
219 |   imm0 = _wrap_mm512_add_epi32(imm0, *(v16si*)_pi32_512_0x7f);
220 |   imm0 = _wrap_mm512_slli_epi32(imm0, 23);
221 |   v16sf pow2n = _mm512_castsi512_ps(imm0);
222 |   y = _mm512_mul_ps(y, pow2n);
223 |   return y;
224 | }
225 | 
226 | _PS512_CONST(minus_cephes_DP1, -0.78515625f);
227 | _PS512_CONST(minus_cephes_DP2, -2.4187564849853515625e-4f);
228 | _PS512_CONST(minus_cephes_DP3, -3.77489497744594108e-8f);
229 | _PS512_CONST(sincof_p0, -1.9515295891E-4f);
230 | _PS512_CONST(sincof_p1,  8.3321608736E-3f);
231 | _PS512_CONST(sincof_p2, -1.6666654611E-1f);
232 | _PS512_CONST(coscof_p0,  2.443315711809948E-005f);
233 | _PS512_CONST(coscof_p1, -1.388731625493765E-003f);
234 | _PS512_CONST(coscof_p2,  4.166664568298827E-002f);
235 | _PS512_CONST(cephes_FOPI, 1.27323954473516f); // 4 / M_PI
236 | 
237 | 
238 | /* evaluation of 16 sines at onces using AVX intrisics
239 | 
240 |    The code is the exact rewriting of the cephes sinf function.
241 |    Precision is excellent as long as x < 8192 (I did not bother to
242 |    take into account the special handling they have for greater values
243 |    -- it does not return garbage for arguments over 8192, though, but
244 |    the extra precision is missing).
245 | 
246 |    Note that it is such that sinf((float)M_PI) = 8.74e-8, which is the
247 |    surprising but correct result.
248 | 
249 | */
250 | v16sf sin512_ps(v16sf x) { // any x
251 |   v16sf xmm1, xmm2 = _mm512_setzero_ps(), xmm3, sign_bit, y;
252 |   v16si imm0, imm2;
253 | 
254 |   sign_bit = x;
255 |   /* take the absolute value */
256 | //  x = _mm512_and_ps(x, *(v16sf*)_ps512_inv_sign_mask);
257 |   x = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(x), _mm512_castps_si512(*(v16sf*)_ps512_inv_sign_mask)));
258 |   /* extract the sign bit (upper one) */
259 | //  sign_bit = _mm512_and_ps(sign_bit, *(v16sf*)_ps512_sign_mask);
260 |   sign_bit = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(sign_bit), _mm512_castps_si512(*(v16sf*)_ps512_sign_mask)));
261 |   
262 |   /* scale by 4/Pi */
263 |   y = _mm512_mul_ps(x, *(v16sf*)_ps512_cephes_FOPI);
264 | 
265 |   /*
266 |     Here we start a series of integer operations, which are in the
267 |     realm of AVX2.
268 |     If we don't have AVX, let's perform them using SSE2 directives
269 |   */
270 | 
271 |   /* store the integer part of y in mm0 */
272 |   imm2 = _mm512_cvttps_epi32(y);
273 |   /* j=(j+1) & (~1) (see the cephes sources) */
274 |   // another two AVX2 instruction
275 |   imm2 = _wrap_mm512_add_epi32(imm2, *(v16si*)_pi32_512_1);
276 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_inv1);
277 |   y = _mm512_cvtepi32_ps(imm2);
278 | 
279 |   /* get the swap sign flag */
280 |   imm0 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_4);
281 |   imm0 = _wrap_mm512_slli_epi32(imm0, 29);
282 |   /* get the polynom selection mask 
283 |      there is one polynom for 0 <= x <= Pi/4
284 |      and another one for Pi/4<x<=Pi/2
285 | 
286 |      Both branches will be computed.
287 |   */
288 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_2);
289 | 
290 | //  imm2 = _mm512_cmpeq_epi32(imm2,*(v16si*)_pi32_512_0);
291 |   __mmask16 imm22 = _mm512_cmpeq_epi32_mask(imm2,*(v16si*)_pi32_512_0);
292 |   imm2= _mm512_mask_blend_epi32(imm22, *(v16si*)_pi32_512_0, *(v16si*)_pi32_512_0xffffffff);
293 |  
294 |   v16sf swap_sign_bit = _mm512_castsi512_ps(imm0);
295 |   v16sf poly_mask = _mm512_castsi512_ps(imm2);
296 | //  sign_bit = _mm512_xor_ps(sign_bit, swap_sign_bit);
297 |   sign_bit = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(sign_bit), _mm512_castps_si512(swap_sign_bit)));
298 | 
299 |   /* The magic pass: "Extended precision modular arithmetic" 
300 |      x = ((x - y * DP1) - y * DP2) - y * DP3; */
301 |   xmm1 = *(v16sf*)_ps512_minus_cephes_DP1;
302 |   xmm2 = *(v16sf*)_ps512_minus_cephes_DP2;
303 |   xmm3 = *(v16sf*)_ps512_minus_cephes_DP3;
304 |   xmm1 = _mm512_mul_ps(y, xmm1);
305 |   xmm2 = _mm512_mul_ps(y, xmm2);
306 |   xmm3 = _mm512_mul_ps(y, xmm3);
307 |   x = _mm512_add_ps(x, xmm1);
308 |   x = _mm512_add_ps(x, xmm2);
309 |   x = _mm512_add_ps(x, xmm3);
310 | 
311 |   /* Evaluate the first polynom  (0 <= x <= Pi/4) */
312 |   y = *(v16sf*)_ps512_coscof_p0;
313 |   v16sf z = _mm512_mul_ps(x,x);
314 | 
315 |   y = _mm512_mul_ps(y, z);
316 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p1);
317 |   y = _mm512_mul_ps(y, z);
318 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p2);
319 |   y = _mm512_mul_ps(y, z);
320 |   y = _mm512_mul_ps(y, z);
321 |   v16sf tmp = _mm512_mul_ps(z, *(v16sf*)_ps512_0p5);
322 |   y = _mm512_sub_ps(y, tmp);
323 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_1);
324 |   
325 |   /* Evaluate the second polynom  (Pi/4 <= x <= 0) */
326 | 
327 |   v16sf y2 = *(v16sf*)_ps512_sincof_p0;
328 |   y2 = _mm512_mul_ps(y2, z);
329 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p1);
330 |   y2 = _mm512_mul_ps(y2, z);
331 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p2);
332 |   y2 = _mm512_mul_ps(y2, z);
333 |   y2 = _mm512_mul_ps(y2, x);
334 |   y2 = _mm512_add_ps(y2, x);
335 | 
336 |   /* select the correct result from the two polynoms */  
337 |   xmm3 = poly_mask;
338 | //  y2 = _mm512_and_ps(xmm3, y2); //, xmm3);
339 |   y2 = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y2)));
340 | //  y = _mm512_andnot_ps(xmm3, y);
341 |   y = _mm512_castsi512_ps(_mm512_andnot_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y)));
342 |   y = _mm512_add_ps(y,y2);
343 |   /* update the sign */
344 | //  y = _mm512_xor_ps(y, sign_bit);
345 |   y = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(y), _mm512_castps_si512(sign_bit)));
346 | 
347 |   return y;
348 | }
349 | 
350 | /* almost the same as sin_ps */
351 | v16sf cos512_ps(v16sf x) { // any x
352 |   v16sf xmm1, xmm2 = _mm512_setzero_ps(), xmm3, y;
353 |   v16si imm0, imm2;
354 | 
355 |   /* take the absolute value */
356 | //  x = _mm512_and_ps(x, *(v16sf*)_ps512_inv_sign_mask);
357 |   x = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(x), _mm512_castps_si512(*(v16sf*)_ps512_inv_sign_mask)));
358 | 
359 |   /* scale by 4/Pi */
360 |   y = _mm512_mul_ps(x, *(v16sf*)_ps512_cephes_FOPI);
361 | 
362 |   /* store the integer part of y in mm0 */
363 |   imm2 = _mm512_cvttps_epi32(y);
364 |   /* j=(j+1) & (~1) (see the cephes sources) */
365 |   imm2 = _wrap_mm512_add_epi32(imm2, *(v16si*)_pi32_512_1);
366 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_inv1);
367 |   y = _mm512_cvtepi32_ps(imm2);
368 |   imm2 = _wrap_mm512_sub_epi32(imm2, *(v16si*)_pi32_512_2);
369 | 
370 |   /* get the swap sign flag */
371 |   imm0 = _mm512_andnot_si512(imm2, *(v16si*)_pi32_512_4);
372 |   imm0 = _wrap_mm512_slli_epi32(imm0, 29);
373 |   /* get the polynom selection mask */
374 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_2);
375 | //  imm2 = _mm512_cmpeq_epi32(imm2, *(v16si*)_pi32_512_0);
376 |   __mmask16 imm22 = _mm512_cmpeq_epi32_mask(imm2, *(v16si*)_pi32_512_0);
377 |   imm2 = _mm512_mask_blend_epi32(imm22, *(v16si*)_pi32_512_0, *(v16si*)_pi32_512_0xffffffff);
378 | 
379 |   v16sf sign_bit = _mm512_castsi512_ps(imm0);
380 |   v16sf poly_mask = _mm512_castsi512_ps(imm2);
381 | 
382 |   /* The magic pass: "Extended precision modular arithmetic"
383 |      x = ((x - y * DP1) - y * DP2) - y * DP3; */
384 |   xmm1 = *(v16sf*)_ps512_minus_cephes_DP1;
385 |   xmm2 = *(v16sf*)_ps512_minus_cephes_DP2;
386 |   xmm3 = *(v16sf*)_ps512_minus_cephes_DP3;
387 |   xmm1 = _mm512_mul_ps(y, xmm1);
388 |   xmm2 = _mm512_mul_ps(y, xmm2);
389 |   xmm3 = _mm512_mul_ps(y, xmm3);
390 |   x = _mm512_add_ps(x, xmm1);
391 |   x = _mm512_add_ps(x, xmm2);
392 |   x = _mm512_add_ps(x, xmm3);
393 | 
394 |   /* Evaluate the first polynom  (0 <= x <= Pi/4) */
395 |   y = *(v16sf*)_ps512_coscof_p0;
396 |   v16sf z = _mm512_mul_ps(x,x);
397 | 
398 |   y = _mm512_mul_ps(y, z);
399 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p1);
400 |   y = _mm512_mul_ps(y, z);
401 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p2);
402 |   y = _mm512_mul_ps(y, z);
403 |   y = _mm512_mul_ps(y, z);
404 |   v16sf tmp = _mm512_mul_ps(z, *(v16sf*)_ps512_0p5);
405 |   y = _mm512_sub_ps(y, tmp);
406 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_1);
407 | 
408 |   /* Evaluate the second polynom  (Pi/4 <= x <= 0) */
409 | 
410 |   v16sf y2 = *(v16sf*)_ps512_sincof_p0;
411 |   y2 = _mm512_mul_ps(y2, z);
412 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p1);
413 |   y2 = _mm512_mul_ps(y2, z);
414 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p2);
415 |   y2 = _mm512_mul_ps(y2, z);
416 |   y2 = _mm512_mul_ps(y2, x);
417 |   y2 = _mm512_add_ps(y2, x);
418 | 
419 |   /* select the correct result from the two polynoms */
420 |   xmm3 = poly_mask;
421 | //  y2 = _mm512_and_ps(xmm3, y2); //, xmm3);
422 |   y2 = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y2)));
423 | //  y = _mm512_andnot_ps(xmm3, y);
424 |   y = _mm512_castsi512_ps(_mm512_andnot_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y)));
425 |   y = _mm512_add_ps(y,y2);
426 |   /* update the sign */
427 | //  y = _mm512_xor_ps(y, sign_bit);
428 |   y = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(y), _mm512_castps_si512(sign_bit)));
429 | 
430 |   return y;
431 | }
432 | 
433 | /* since sin512_ps and cos512_ps are almost identical, sincos512_ps could replace both of them..
434 |    it is almost as fast, and gives you a free cosine with your sine */
435 | void sincos512_ps(v16sf x, v16sf *s, v16sf *c) {
436 | 
437 |   v16sf xmm1, xmm2, xmm3 = _mm512_setzero_ps(), sign_bit_sin, y;
438 |   v16si imm0, imm2, imm4;
439 | 
440 |   sign_bit_sin = x;
441 |   /* take the absolute value */
442 | //  x = _mm512_and_ps(x, *(v16sf*)_ps512_inv_sign_mask);
443 |   x = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(x), _mm512_castps_si512(*(v16sf*)_ps512_inv_sign_mask)));
444 |   /* extract the sign bit (upper one) */
445 | //  sign_bit_sin = _mm512_and_ps(sign_bit_sin, *(v16sf*)_ps512_sign_mask);
446 |   sign_bit_sin = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(sign_bit_sin), _mm512_castps_si512(*(v16sf*)_ps512_sign_mask)));
447 |   
448 |   /* scale by 4/Pi */
449 |   y = _mm512_mul_ps(x, *(v16sf*)_ps512_cephes_FOPI);
450 | 
451 |   /* store the integer part of y in imm2 */
452 |   imm2 = _mm512_cvttps_epi32(y);
453 | 
454 |   /* j=(j+1) & (~1) (see the cephes sources) */
455 |   imm2 = _wrap_mm512_add_epi32(imm2, *(v16si*)_pi32_512_1);
456 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_inv1);
457 | 
458 |   y = _mm512_cvtepi32_ps(imm2);
459 |   imm4 = imm2;
460 | 
461 |   /* get the swap sign flag for the sine */
462 |   imm0 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_4);
463 |   imm0 = _wrap_mm512_slli_epi32(imm0, 29);
464 |   //v16sf swap_sign_bit_sin = _mm512_castsi512_ps(imm0);
465 | 
466 |   /* get the polynom selection mask for the sine*/
467 |   imm2 = _mm512_and_si512(imm2, *(v16si*)_pi32_512_2);
468 | //  imm2 = _mm512_cmpeq_epi32(imm2, *(v16si*)_pi32_512_0);
469 |   __mmask16 imm22 = _mm512_cmpeq_epi32_mask(imm2, *(v16si*)_pi32_512_0);
470 |   imm2 = _mm512_mask_blend_epi32(imm22, *(v16si*)_pi32_512_0, *(v16si*)_pi32_512_0xffffffff);
471 | 
472 |   //v16sf poly_mask = _mm512_castsi512_ps(imm2);
473 | 
474 |   v16sf swap_sign_bit_sin = _mm512_castsi512_ps(imm0);
475 |   v16sf poly_mask = _mm512_castsi512_ps(imm2);
476 | 
477 |   /* The magic pass: "Extended precision modular arithmetic" 
478 |      x = ((x - y * DP1) - y * DP2) - y * DP3; */
479 |   xmm1 = *(v16sf*)_ps512_minus_cephes_DP1;
480 |   xmm2 = *(v16sf*)_ps512_minus_cephes_DP2;
481 |   xmm3 = *(v16sf*)_ps512_minus_cephes_DP3;
482 |   xmm1 = _mm512_mul_ps(y, xmm1);
483 |   xmm2 = _mm512_mul_ps(y, xmm2);
484 |   xmm3 = _mm512_mul_ps(y, xmm3);
485 |   x = _mm512_add_ps(x, xmm1);
486 |   x = _mm512_add_ps(x, xmm2);
487 |   x = _mm512_add_ps(x, xmm3);
488 | 
489 |   imm4 = _wrap_mm512_sub_epi32(imm4, *(v16si*)_pi32_512_2);
490 |   imm4 = _mm512_andnot_si512(imm4, *(v16si*)_pi32_512_4);
491 |   imm4 = _wrap_mm512_slli_epi32(imm4, 29);
492 | 
493 |   v16sf sign_bit_cos = _mm512_castsi512_ps(imm4);
494 | 
495 | //  sign_bit_sin = _mm512_xor_ps(sign_bit_sin, swap_sign_bit_sin);
496 |   sign_bit_sin = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(sign_bit_sin), _mm512_castps_si512(swap_sign_bit_sin)));
497 |   
498 |   /* Evaluate the first polynom  (0 <= x <= Pi/4) */
499 |   v16sf z = _mm512_mul_ps(x,x);
500 |   y = *(v16sf*)_ps512_coscof_p0;
501 | 
502 |   y = _mm512_mul_ps(y, z);
503 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p1);
504 |   y = _mm512_mul_ps(y, z);
505 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_coscof_p2);
506 |   y = _mm512_mul_ps(y, z);
507 |   y = _mm512_mul_ps(y, z);
508 |   v16sf tmp = _mm512_mul_ps(z, *(v16sf*)_ps512_0p5);
509 |   y = _mm512_sub_ps(y, tmp);
510 |   y = _mm512_add_ps(y, *(v16sf*)_ps512_1);
511 |   
512 |   /* Evaluate the second polynom  (Pi/4 <= x <= 0) */
513 | 
514 |   v16sf y2 = *(v16sf*)_ps512_sincof_p0;
515 |   y2 = _mm512_mul_ps(y2, z);
516 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p1);
517 |   y2 = _mm512_mul_ps(y2, z);
518 |   y2 = _mm512_add_ps(y2, *(v16sf*)_ps512_sincof_p2);
519 |   y2 = _mm512_mul_ps(y2, z);
520 |   y2 = _mm512_mul_ps(y2, x);
521 |   y2 = _mm512_add_ps(y2, x);
522 | 
523 |   /* select the correct result from the two polynoms */  
524 |   xmm3 = poly_mask;
525 | //  v16sf ysin2 = _mm512_and_ps(xmm3, y2);
526 |   v16sf ysin2 = _mm512_castsi512_ps(_mm512_and_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y2)));
527 | //  v16sf ysin1 = _mm512_andnot_ps(xmm3, y);
528 |   v16sf ysin1 = _mm512_castsi512_ps(_mm512_andnot_si512(_mm512_castps_si512(xmm3), _mm512_castps_si512(y)));
529 |   y2 = _mm512_sub_ps(y2,ysin2);
530 |   y = _mm512_sub_ps(y, ysin1);
531 | 
532 |   xmm1 = _mm512_add_ps(ysin1,ysin2);
533 |   xmm2 = _mm512_add_ps(y,y2);
534 |  
535 |   /* update the sign */
536 | //  *s = _mm512_xor_ps(xmm1, sign_bit_sin);
537 |   *s = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(xmm1), _mm512_castps_si512(sign_bit_sin)));
538 | //  *c = _mm512_xor_ps(xmm2, sign_bit_cos);
539 |   *c = _mm512_castsi512_ps(_mm512_xor_si512(_mm512_castps_si512(xmm2), _mm512_castps_si512(sign_bit_cos)));
540 | }
541 | 
542 | #endif
543 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/avx_mathfun.h:
--------------------------------------------------------------------------------
 1 | /* 
 2 |    AVX implementation of sin, cos, sincos, exp and log
 3 | 
 4 |    Based on "sse_mathfun.h", by Julien Pommier
 5 |    http://gruntthepeon.free.fr/ssemath/
 6 | 
 7 |    Copyright (C) 2012 Giovanni Garberoglio
 8 |    Interdisciplinary Laboratory for Computational Science (LISC)
 9 |    Fondazione Bruno Kessler and University of Trento
10 |    via Sommarive, 18
11 |    I-38123 Trento (Italy)
12 | 
13 |   This software is provided 'as-is', without any express or implied
14 |   warranty.  In no event will the authors be held liable for any damages
15 |   arising from the use of this software.
16 | 
17 |   Permission is granted to anyone to use this software for any purpose,
18 |   including commercial applications, and to alter it and redistribute it
19 |   freely, subject to the following restrictions:
20 | 
21 |   1. The origin of this software must not be misrepresented; you must not
22 |      claim that you wrote the original software. If you use this software
23 |      in a product, an acknowledgment in the product documentation would be
24 |      appreciated but is not required.
25 |   2. Altered source versions must be plainly marked as such, and must not be
26 |      misrepresented as being the original software.
27 |   3. This notice may not be removed or altered from any source distribution.
28 | 
29 |   (this is the zlib license)
30 | */
31 | #ifdef __AVX__
32 | #ifndef AVX_MATHFUN_H_
33 | #define AVX_MATHFUN_H_
34 | 
35 | #include <immintrin.h>
36 | 
37 | typedef __m256 v8sf; // vector of 8 float (avx)
38 | 
39 | // prototypes
40 | inline v8sf log256_ps(v8sf x);
41 | inline v8sf exp256_ps(v8sf x);
42 | inline v8sf sin256_ps(v8sf x);
43 | inline v8sf cos256_ps(v8sf x);
44 | inline void sincos256_ps(v8sf x, v8sf *s, v8sf *c);
45 | 
46 | #include "avx_mathfun.hxx"
47 | 
48 | #endif
49 | #endif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/neon_mathfun.h:
--------------------------------------------------------------------------------
 1 | /* NEON implementation of sin, cos, exp and log
 2 | 
 3 |    Inspired by Intel Approximate Math library, and based on the
 4 |    corresponding algorithms of the cephes math library
 5 | */
 6 | 
 7 | /* Copyright (C) 2011  Julien Pommier
 8 | 
 9 |   This software is provided 'as-is', without any express or implied
10 |   warranty.  In no event will the authors be held liable for any damages
11 |   arising from the use of this software.
12 | 
13 |   Permission is granted to anyone to use this software for any purpose,
14 |   including commercial applications, and to alter it and redistribute it
15 |   freely, subject to the following restrictions:
16 | 
17 |   1. The origin of this software must not be misrepresented; you must not
18 |      claim that you wrote the original software. If you use this software
19 |      in a product, an acknowledgment in the product documentation would be
20 |      appreciated but is not required.
21 |   2. Altered source versions must be plainly marked as such, and must not be
22 |      misrepresented as being the original software.
23 |   3. This notice may not be removed or altered from any source distribution.
24 | 
25 |   (this is the zlib license)
26 | */
27 | 
28 | #if defined(__ARM_NEON__) || defined(__ARM_NEON)
29 | #ifndef NEON_MATHFUN_H_
30 | #define NEON_MATHFUN_H_
31 | 
32 | #include <arm_neon.h>
33 | 
34 | typedef float32x4_t v4sf; // vector of 4 float
35 | 
36 | // prototypes
37 | inline v4sf log_ps(v4sf x);
38 | inline v4sf exp_ps(v4sf x);
39 | inline v4sf sin_ps(v4sf x);
40 | inline v4sf cos_ps(v4sf x);
41 | inline void sincos_ps(v4sf x, v4sf *s, v4sf *c);
42 | 
43 | #include "neon_mathfun.hxx"
44 | 
45 | #endif
46 | #endif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/neon_mathfun.hxx:
--------------------------------------------------------------------------------
  1 | /* NEON implementation of sin, cos, exp and log
  2 | 
  3 |    Inspired by Intel Approximate Math library, and based on the
  4 |    corresponding algorithms of the cephes math library
  5 | */
  6 | 
  7 | /* Copyright (C) 2011  Julien Pommier
  8 | 
  9 |   This software is provided 'as-is', without any express or implied
 10 |   warranty.  In no event will the authors be held liable for any damages
 11 |   arising from the use of this software.
 12 | 
 13 |   Permission is granted to anyone to use this software for any purpose,
 14 |   including commercial applications, and to alter it and redistribute it
 15 |   freely, subject to the following restrictions:
 16 | 
 17 |   1. The origin of this software must not be misrepresented; you must not
 18 |      claim that you wrote the original software. If you use this software
 19 |      in a product, an acknowledgment in the product documentation would be
 20 |      appreciated but is not required.
 21 |   2. Altered source versions must be plainly marked as such, and must not be
 22 |      misrepresented as being the original software.
 23 |   3. This notice may not be removed or altered from any source distribution.
 24 | 
 25 |   (this is the zlib license)
 26 | */
 27 | #if defined(__ARM_NEON__) || defined(__ARM_NEON)
 28 | 
 29 | typedef uint32x4_t v4su;  // vector of 4 uint32
 30 | typedef int32x4_t v4si;  // vector of 4 uint32
 31 | 
 32 | #define c_inv_mant_mask ~0x7f800000u
 33 | #define c_cephes_SQRTHF 0.707106781186547524
 34 | #define c_cephes_log_p0 7.0376836292E-2
 35 | #define c_cephes_log_p1 - 1.1514610310E-1
 36 | #define c_cephes_log_p2 1.1676998740E-1
 37 | #define c_cephes_log_p3 - 1.2420140846E-1
 38 | #define c_cephes_log_p4 + 1.4249322787E-1
 39 | #define c_cephes_log_p5 - 1.6668057665E-1
 40 | #define c_cephes_log_p6 + 2.0000714765E-1
 41 | #define c_cephes_log_p7 - 2.4999993993E-1
 42 | #define c_cephes_log_p8 + 3.3333331174E-1
 43 | #define c_cephes_log_q1 -2.12194440e-4
 44 | #define c_cephes_log_q2 0.693359375
 45 | 
 46 | /* natural logarithm computed for 4 simultaneous float 
 47 |    return NaN for x <= 0
 48 | */
 49 | v4sf log_ps(v4sf x) {
 50 |   v4sf one = vdupq_n_f32(1);
 51 | 
 52 |   x = vmaxq_f32(x, vdupq_n_f32(0)); /* force flush to zero on denormal values */
 53 |   v4su invalid_mask = vcleq_f32(x, vdupq_n_f32(0));
 54 | 
 55 |   v4si ux = vreinterpretq_s32_f32(x);
 56 |   
 57 |   v4si emm0 = vshrq_n_s32(ux, 23);
 58 | 
 59 |   /* keep only the fractional part */
 60 |   ux = vandq_s32(ux, vdupq_n_s32(c_inv_mant_mask));
 61 |   ux = vorrq_s32(ux, vreinterpretq_s32_f32(vdupq_n_f32(0.5f)));
 62 |   x = vreinterpretq_f32_s32(ux);
 63 | 
 64 |   emm0 = vsubq_s32(emm0, vdupq_n_s32(0x7f));
 65 |   v4sf e = vcvtq_f32_s32(emm0);
 66 | 
 67 |   e = vaddq_f32(e, one);
 68 | 
 69 |   /* part2: 
 70 |      if( x < SQRTHF ) {
 71 |        e -= 1;
 72 |        x = x + x - 1.0;
 73 |      } else { x = x - 1.0; }
 74 |   */
 75 |   v4su mask = vcltq_f32(x, vdupq_n_f32(c_cephes_SQRTHF));
 76 |   v4sf tmp = vreinterpretq_f32_u32(vandq_u32(vreinterpretq_u32_f32(x), mask));
 77 |   x = vsubq_f32(x, one);
 78 |   e = vsubq_f32(e, vreinterpretq_f32_u32(vandq_u32(vreinterpretq_u32_f32(one), mask)));
 79 |   x = vaddq_f32(x, tmp);
 80 | 
 81 |   v4sf z = vmulq_f32(x,x);
 82 | 
 83 |   v4sf y = vdupq_n_f32(c_cephes_log_p0);
 84 |   y = vmulq_f32(y, x);
 85 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p1));
 86 |   y = vmulq_f32(y, x);
 87 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p2));
 88 |   y = vmulq_f32(y, x);
 89 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p3));
 90 |   y = vmulq_f32(y, x);
 91 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p4));
 92 |   y = vmulq_f32(y, x);
 93 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p5));
 94 |   y = vmulq_f32(y, x);
 95 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p6));
 96 |   y = vmulq_f32(y, x);
 97 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p7));
 98 |   y = vmulq_f32(y, x);
 99 |   y = vaddq_f32(y, vdupq_n_f32(c_cephes_log_p8));
100 |   y = vmulq_f32(y, x);
101 | 
102 |   y = vmulq_f32(y, z);
103 |   
104 | 
105 |   tmp = vmulq_f32(e, vdupq_n_f32(c_cephes_log_q1));
106 |   y = vaddq_f32(y, tmp);
107 | 
108 | 
109 |   tmp = vmulq_f32(z, vdupq_n_f32(0.5f));
110 |   y = vsubq_f32(y, tmp);
111 | 
112 |   tmp = vmulq_f32(e, vdupq_n_f32(c_cephes_log_q2));
113 |   x = vaddq_f32(x, y);
114 |   x = vaddq_f32(x, tmp);
115 |   x = vreinterpretq_f32_u32(vorrq_u32(vreinterpretq_u32_f32(x), invalid_mask)); // negative arg will be NAN
116 |   return x;
117 | }
118 | 
119 | #define c_exp_hi 88.3762626647949f
120 | #define c_exp_lo -88.3762626647949f
121 | 
122 | #define c_cephes_LOG2EF 1.44269504088896341
123 | #define c_cephes_exp_C1 0.693359375
124 | #define c_cephes_exp_C2 -2.12194440e-4
125 | 
126 | #define c_cephes_exp_p0 1.9875691500E-4
127 | #define c_cephes_exp_p1 1.3981999507E-3
128 | #define c_cephes_exp_p2 8.3334519073E-3
129 | #define c_cephes_exp_p3 4.1665795894E-2
130 | #define c_cephes_exp_p4 1.6666665459E-1
131 | #define c_cephes_exp_p5 5.0000001201E-1
132 | 
133 | /* exp() computed for 4 float at once */
134 | v4sf exp_ps(v4sf x) {
135 |   v4sf tmp, fx;
136 | 
137 |   v4sf one = vdupq_n_f32(1);
138 |   x = vminq_f32(x, vdupq_n_f32(c_exp_hi));
139 |   x = vmaxq_f32(x, vdupq_n_f32(c_exp_lo));
140 | 
141 |   /* express exp(x) as exp(g + n*log(2)) */
142 |   fx = vmlaq_f32(vdupq_n_f32(0.5f), x, vdupq_n_f32(c_cephes_LOG2EF));
143 | 
144 |   /* perform a floorf */
145 |   tmp = vcvtq_f32_s32(vcvtq_s32_f32(fx));
146 | 
147 |   /* if greater, substract 1 */
148 |   v4su mask = vcgtq_f32(tmp, fx);    
149 |   mask = vandq_u32(mask, vreinterpretq_u32_f32(one));
150 | 
151 | 
152 |   fx = vsubq_f32(tmp, vreinterpretq_f32_u32(mask));
153 | 
154 |   tmp = vmulq_f32(fx, vdupq_n_f32(c_cephes_exp_C1));
155 |   v4sf z = vmulq_f32(fx, vdupq_n_f32(c_cephes_exp_C2));
156 |   x = vsubq_f32(x, tmp);
157 |   x = vsubq_f32(x, z);
158 | 
159 |   static const float cephes_exp_p[6] = { c_cephes_exp_p0, c_cephes_exp_p1, c_cephes_exp_p2, c_cephes_exp_p3, c_cephes_exp_p4, c_cephes_exp_p5 };
160 |   v4sf y = vld1q_dup_f32(cephes_exp_p+0);
161 |   v4sf c1 = vld1q_dup_f32(cephes_exp_p+1); 
162 |   v4sf c2 = vld1q_dup_f32(cephes_exp_p+2); 
163 |   v4sf c3 = vld1q_dup_f32(cephes_exp_p+3); 
164 |   v4sf c4 = vld1q_dup_f32(cephes_exp_p+4); 
165 |   v4sf c5 = vld1q_dup_f32(cephes_exp_p+5);
166 | 
167 |   y = vmulq_f32(y, x);
168 |   z = vmulq_f32(x,x);
169 |   y = vaddq_f32(y, c1);
170 |   y = vmulq_f32(y, x);
171 |   y = vaddq_f32(y, c2);
172 |   y = vmulq_f32(y, x);
173 |   y = vaddq_f32(y, c3);
174 |   y = vmulq_f32(y, x);
175 |   y = vaddq_f32(y, c4);
176 |   y = vmulq_f32(y, x);
177 |   y = vaddq_f32(y, c5);
178 |   
179 |   y = vmulq_f32(y, z);
180 |   y = vaddq_f32(y, x);
181 |   y = vaddq_f32(y, one);
182 | 
183 |   /* build 2^n */
184 |   int32x4_t mm;
185 |   mm = vcvtq_s32_f32(fx);
186 |   mm = vaddq_s32(mm, vdupq_n_s32(0x7f));
187 |   mm = vshlq_n_s32(mm, 23);
188 |   v4sf pow2n = vreinterpretq_f32_s32(mm);
189 | 
190 |   y = vmulq_f32(y, pow2n);
191 |   return y;
192 | }
193 | 
194 | #define c_minus_cephes_DP1 -0.78515625
195 | #define c_minus_cephes_DP2 -2.4187564849853515625e-4
196 | #define c_minus_cephes_DP3 -3.77489497744594108e-8
197 | #define c_sincof_p0 -1.9515295891E-4
198 | #define c_sincof_p1  8.3321608736E-3
199 | #define c_sincof_p2 -1.6666654611E-1
200 | #define c_coscof_p0  2.443315711809948E-005
201 | #define c_coscof_p1 -1.388731625493765E-003
202 | #define c_coscof_p2  4.166664568298827E-002
203 | #define c_cephes_FOPI 1.27323954473516 // 4 / M_PI
204 | 
205 | /* evaluation of 4 sines & cosines at once.
206 | 
207 |    The code is the exact rewriting of the cephes sinf function.
208 |    Precision is excellent as long as x < 8192 (I did not bother to
209 |    take into account the special handling they have for greater values
210 |    -- it does not return garbage for arguments over 8192, though, but
211 |    the extra precision is missing).
212 | 
213 |    Note that it is such that sinf((float)M_PI) = 8.74e-8, which is the
214 |    surprising but correct result.
215 | 
216 |    Note also that when you compute sin(x), cos(x) is available at
217 |    almost no extra price so both sin_ps and cos_ps make use of
218 |    sincos_ps..
219 |   */
220 | void sincos_ps(v4sf x, v4sf *ysin, v4sf *ycos) { // any x
221 |   v4sf xmm1, xmm2, xmm3, y;
222 | 
223 |   v4su emm2;
224 |   
225 |   v4su sign_mask_sin, sign_mask_cos;
226 |   sign_mask_sin = vcltq_f32(x, vdupq_n_f32(0));
227 |   x = vabsq_f32(x);
228 | 
229 |   /* scale by 4/Pi */
230 |   y = vmulq_f32(x, vdupq_n_f32(c_cephes_FOPI));
231 | 
232 |   /* store the integer part of y in mm0 */
233 |   emm2 = vcvtq_u32_f32(y);
234 |   /* j=(j+1) & (~1) (see the cephes sources) */
235 |   emm2 = vaddq_u32(emm2, vdupq_n_u32(1));
236 |   emm2 = vandq_u32(emm2, vdupq_n_u32(~1));
237 |   y = vcvtq_f32_u32(emm2);
238 | 
239 |   /* get the polynom selection mask 
240 |      there is one polynom for 0 <= x <= Pi/4
241 |      and another one for Pi/4<x<=Pi/2
242 | 
243 |      Both branches will be computed.
244 |   */
245 |   v4su poly_mask = vtstq_u32(emm2, vdupq_n_u32(2));
246 |   
247 |   /* The magic pass: "Extended precision modular arithmetic" 
248 |      x = ((x - y * DP1) - y * DP2) - y * DP3; */
249 |   xmm1 = vmulq_n_f32(y, c_minus_cephes_DP1);
250 |   xmm2 = vmulq_n_f32(y, c_minus_cephes_DP2);
251 |   xmm3 = vmulq_n_f32(y, c_minus_cephes_DP3);
252 |   x = vaddq_f32(x, xmm1);
253 |   x = vaddq_f32(x, xmm2);
254 |   x = vaddq_f32(x, xmm3);
255 | 
256 |   sign_mask_sin = veorq_u32(sign_mask_sin, vtstq_u32(emm2, vdupq_n_u32(4)));
257 |   sign_mask_cos = vtstq_u32(vsubq_u32(emm2, vdupq_n_u32(2)), vdupq_n_u32(4));
258 | 
259 |   /* Evaluate the first polynom  (0 <= x <= Pi/4) in y1, 
260 |      and the second polynom      (Pi/4 <= x <= 0) in y2 */
261 |   v4sf z = vmulq_f32(x,x);
262 |   v4sf y1, y2;
263 | 
264 |   y1 = vmulq_n_f32(z, c_coscof_p0);
265 |   y2 = vmulq_n_f32(z, c_sincof_p0);
266 |   y1 = vaddq_f32(y1, vdupq_n_f32(c_coscof_p1));
267 |   y2 = vaddq_f32(y2, vdupq_n_f32(c_sincof_p1));
268 |   y1 = vmulq_f32(y1, z);
269 |   y2 = vmulq_f32(y2, z);
270 |   y1 = vaddq_f32(y1, vdupq_n_f32(c_coscof_p2));
271 |   y2 = vaddq_f32(y2, vdupq_n_f32(c_sincof_p2));
272 |   y1 = vmulq_f32(y1, z);
273 |   y2 = vmulq_f32(y2, z);
274 |   y1 = vmulq_f32(y1, z);
275 |   y2 = vmulq_f32(y2, x);
276 |   y1 = vsubq_f32(y1, vmulq_f32(z, vdupq_n_f32(0.5f)));
277 |   y2 = vaddq_f32(y2, x);
278 |   y1 = vaddq_f32(y1, vdupq_n_f32(1));
279 | 
280 |   /* select the correct result from the two polynoms */  
281 |   v4sf ys = vbslq_f32(poly_mask, y1, y2);
282 |   v4sf yc = vbslq_f32(poly_mask, y2, y1);
283 |   *ysin = vbslq_f32(sign_mask_sin, vnegq_f32(ys), ys);
284 |   *ycos = vbslq_f32(sign_mask_cos, yc, vnegq_f32(yc));
285 | }
286 | 
287 | v4sf sin_ps(v4sf x) {
288 |   v4sf ysin, ycos; 
289 |   sincos_ps(x, &ysin, &ycos); 
290 |   return ysin;
291 | }
292 | 
293 | v4sf cos_ps(v4sf x) {
294 |   v4sf ysin, ycos; 
295 |   sincos_ps(x, &ysin, &ycos); 
296 |   return ycos;
297 | }
298 | 
299 | #endif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/math/sse_mathfun.h:
--------------------------------------------------------------------------------
 1 | /* SIMD (SSE1+MMX or SSE2) implementation of sin, cos, exp and log
 2 | 
 3 |    Inspired by Intel Approximate Math library, and based on the
 4 |    corresponding algorithms of the cephes math library
 5 | 
 6 |    The default is to use the SSE1 version. If you define USE_SSE2 the
 7 |    the SSE2 intrinsics will be used in place of the MMX intrinsics. Do
 8 |    not expect any significant performance improvement with SSE2.
 9 | */
10 | 
11 | /* Copyright (C) 2007  Julien Pommier
12 | 
13 |   This software is provided 'as-is', without any express or implied
14 |   warranty.  In no event will the authors be held liable for any damages
15 |   arising from the use of this software.
16 | 
17 |   Permission is granted to anyone to use this software for any purpose,
18 |   including commercial applications, and to alter it and redistribute it
19 |   freely, subject to the following restrictions:
20 | 
21 |   1. The origin of this software must not be misrepresented; you must not
22 |      claim that you wrote the original software. If you use this software
23 |      in a product, an acknowledgment in the product documentation would be
24 |      appreciated but is not required.
25 |   2. Altered source versions must be plainly marked as such, and must not be
26 |      misrepresented as being the original software.
27 |   3. This notice may not be removed or altered from any source distribution.
28 | 
29 |   (this is the zlib license)
30 | */
31 | 
32 | #ifdef __SSE__
33 | #ifndef SSE_MATHFUN_H_
34 | #define SSE_MATHFUN_H_
35 | 
36 | #include <xmmintrin.h>
37 | 
38 | typedef __m128 v4sf;  // vector of 4 float (sse1)
39 | 
40 | // prototypes
41 | inline v4sf log_ps(v4sf x);
42 | inline v4sf exp_ps(v4sf x);
43 | inline v4sf sin_ps(v4sf x);
44 | inline v4sf cos_ps(v4sf x);
45 | inline void sincos_ps(v4sf x, v4sf *s, v4sf *c);
46 | 
47 | #include "sse_mathfun.hxx"
48 | 
49 | #endif
50 | #endif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/mipp_scalar_op.h:
--------------------------------------------------------------------------------
 1 | #ifndef MIPP_SCALAR_OP_H_
 2 | #define MIPP_SCALAR_OP_H_
 3 | 
 4 | namespace mipp_scop // My Intrinsics Plus Plus SCalar OPerations
 5 | {
 6 | 	template <typename T>
 7 | 	inline T add(const T val1, const T val2);
 8 | 
 9 | 	template <typename T>
10 | 	inline T sub(const T val1, const T val2);
11 | 
12 | 	template <typename T>
13 | 	inline T andb(const T val1, const T val2);
14 | 
15 | 	template <typename T>
16 | 	inline T xorb(const T val1, const T val2);
17 | 
18 | 	template <typename T>
19 | 	inline T msb(const T val);
20 | 
21 | 	template <typename T>
22 | 	inline T div2(const T val);
23 | 
24 | 	template <typename T>
25 | 	inline T div4(const T val);
26 | 
27 | 	template <typename T>
28 | 	inline T rshift(const T val, const int n);
29 | 
30 | 	template <typename T>
31 | 	inline T lshift(const T val, const int n);
32 | }
33 | 
34 | #include "mipp_scalar_op.hxx"
35 | 
36 | #endif /* MIPP_SCALAR_OP_H_ */
37 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/MIPP/mipp_scalar_op.hxx:
--------------------------------------------------------------------------------
 1 | #include <limits>
 2 | #include <cstdlib>
 3 | #include <cstdint>
 4 | #include <iostream>
 5 | #include <algorithm>
 6 | 
 7 | #include "mipp_scalar_op.h"
 8 | 
 9 | namespace mipp_scop
10 | {
11 | template <typename T> inline T       add(const T       val1, const T       val2) { return val1 + val2; }
12 | template <          > inline int16_t add(const int16_t val1, const int16_t val2) { return (int16_t)std::min(std::max((int32_t)((int32_t)val1 + (int32_t)val2),(int32_t)std::numeric_limits<int16_t>::min()),(int32_t)std::numeric_limits<int16_t>::max()); }
13 | template <          > inline int8_t  add(const int8_t  val1, const int8_t  val2) { return (int8_t )std::min(std::max((int16_t)((int16_t)val1 + (int16_t)val2),(int16_t)std::numeric_limits<int8_t >::min()),(int16_t)std::numeric_limits<int8_t >::max()); }
14 | 
15 | template <typename T> inline T       sub(const T       val1, const T       val2) { return val1 - val2; }
16 | template <          > inline int16_t sub(const int16_t val1, const int16_t val2) { return (int16_t)std::min(std::max((int32_t)((int32_t)val1 - (int32_t)val2),(int32_t)std::numeric_limits<int16_t>::min()),(int32_t)std::numeric_limits<int16_t>::max()); }
17 | template <          > inline int8_t  sub(const int8_t  val1, const int8_t  val2) { return (int8_t )std::min(std::max((int16_t)((int16_t)val1 - (int16_t)val2),(int16_t)std::numeric_limits<int8_t >::min()),(int16_t)std::numeric_limits<int8_t >::max()); }
18 | 
19 | template <typename T> inline T      andb(const T      val1, const T      val2) { return                                          val1  &                      val2;   }
20 | template <          > inline double andb(const double val1, const double val2) { return static_cast<double>(static_cast<int64_t>(val1) & static_cast<int64_t>(val2)); }
21 | template <          > inline float  andb(const float  val1, const float  val2) { return static_cast<float >(static_cast<int32_t>(val1) & static_cast<int32_t>(val2)); }
22 | 
23 | template <typename T> inline T      xorb(const T      val1, const T      val2) { return                                          val1  ^                      val2;   }
24 | template <          > inline double xorb(const double val1, const double val2) { return static_cast<double>(static_cast<int64_t>(val1) ^ static_cast<int64_t>(val2)); }
25 | template <          > inline float  xorb(const float  val1, const float  val2) { return static_cast<float >(static_cast<int32_t>(val1) ^ static_cast<int32_t>(val2)); }
26 | 
27 | template <typename T> inline T       msb(const T       val) { return (val >> (sizeof(T) * 8 -1)) << (sizeof(T) * 8 -1);              }
28 | template <          > inline double  msb(const double  val) { return static_cast<double >((static_cast<uint64_t>(val) >> 63) << 63); }
29 | template <          > inline float   msb(const float   val) { return static_cast<float  >((static_cast<uint32_t>(val) >> 31) << 31); }
30 | template <          > inline int64_t msb(const int64_t val) { return static_cast<int64_t>((static_cast<uint64_t>(val) >> 63) << 63); }
31 | template <          > inline int32_t msb(const int32_t val) { return static_cast<int32_t>((static_cast<uint32_t>(val) >> 31) << 31); }
32 | template <          > inline int16_t msb(const int16_t val) { return static_cast<int16_t>((static_cast<uint16_t>(val) >> 15) << 15); }
33 | template <          > inline int8_t  msb(const int8_t  val) { return static_cast<int8_t >((static_cast<uint8_t >(val) >>  7) <<  7); }
34 | 
35 | template <typename T> inline T       div2(const T       val) { return val * (T)0.5; }
36 | template <          > inline int64_t div2(const int64_t val) { return val >> 1;     }
37 | template <          > inline int32_t div2(const int32_t val) { return val >> 1;     }
38 | template <          > inline int16_t div2(const int16_t val) { return val >> 1;     }
39 | template <          > inline int8_t  div2(const int8_t  val) { return val >> 1;     }
40 | 
41 | template <typename T> inline T       div4(const T       val) { return val * (T)0.25; }
42 | template <          > inline int64_t div4(const int64_t val) { return val >> 2;      }
43 | template <          > inline int32_t div4(const int32_t val) { return val >> 2;      }
44 | template <          > inline int16_t div4(const int16_t val) { return val >> 2;      }
45 | template <          > inline int8_t  div4(const int8_t  val) { return val >> 2;      }
46 | 
47 | template <typename T> inline T       lshift(const T       val, const int n) { return                                            val  << n;  }
48 | template <          > inline double  lshift(const double  val, const int n) { return static_cast<double >(static_cast<uint64_t>(val) << n); }
49 | template <          > inline float   lshift(const float   val, const int n) { return static_cast<float  >(static_cast<uint32_t>(val) << n); }
50 | template <          > inline int64_t lshift(const int64_t val, const int n) { return static_cast<int64_t>(static_cast<uint64_t>(val) << n); }
51 | template <          > inline int32_t lshift(const int32_t val, const int n) { return static_cast<int32_t>(static_cast<uint32_t>(val) << n); }
52 | template <          > inline int16_t lshift(const int16_t val, const int n) { return static_cast<int16_t>(static_cast<uint16_t>(val) << n); }
53 | template <          > inline int8_t  lshift(const int8_t  val, const int n) { return static_cast<int8_t >(static_cast<uint8_t >(val) << n); }
54 | 
55 | template <typename T> inline T       rshift(const T       val, const int n) { return                                            val  >> n;  }
56 | template <          > inline double  rshift(const double  val, const int n) { return static_cast<double >(static_cast<uint64_t>(val) >> n); }
57 | template <          > inline float   rshift(const float   val, const int n) { return static_cast<float  >(static_cast<uint32_t>(val) >> n); }
58 | template <          > inline int64_t rshift(const int64_t val, const int n) { return static_cast<int64_t>(static_cast<uint64_t>(val) >> n); }
59 | template <          > inline int32_t rshift(const int32_t val, const int n) { return static_cast<int32_t>(static_cast<uint32_t>(val) >> n); }
60 | template <          > inline int16_t rshift(const int16_t val, const int n) { return static_cast<int16_t>(static_cast<uint16_t>(val) >> n); }
61 | template <          > inline int8_t  rshift(const int8_t  val, const int n) { return static_cast<int8_t >(static_cast<uint8_t >(val) >> n); }
62 | }
63 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/README.md:
--------------------------------------------------------------------------------
 1 | # shape_based_matching  
 2 | 
 3 | try to implement halcon shape based matching, refer to machine vision algorithms and applications, page 317 3.11.5, written by halcon engineers  
 4 | We find that shape based matching is the same as linemod. [linemod pdf](Gradient%20Response%20Maps%20for%20Real-TimeDetection%20of%20Textureless%20Objects.pdf)  
 5 | 
 6 | halcon match solution guide for how to select matching methods([halcon documentation](https://www.mvtec.com/products/halcon/documentation/#reference_manual)):  
 7 | ![match](./match.png)  
 8 | 
 9 | ## steps
10 | 
11 | 1. change test.cpp line 9 prefix to top level folder
12 | 
13 | 2. in cmakeList line 23, change /opt/ros/kinetic to somewhere opencv3 can be found(if opencv3 is installed in default env then don't need to)
14 | 
15 | 3. cmake make & run. To learn usage, see different tests in test.cpp. Particularly, scale_test are fully commented.
16 | 
17 | NOTE: On windows, it's confirmed that visual studio 17 works fine, but there are some problems with MIPP in vs13. You may want old codes without [MIPP](https://github.com/aff3ct/MIPP): [old commit](https://github.com/meiqua/shape_based_matching/tree/fc3560a1a3bc7c6371eacecdb6822244baac17ba)  
18 | 
19 | ## thoughts about the method
20 | 
21 | The key of shape based matching, or linemod, is using gradient orientation only. Though both edge and orientation are resistant to disturbance,
22 | edge have only 1bit info(there is an edge or not), so it's hard to dig wanted shapes out if there are too many edges, but we have to have as many edges as possible if we want to find all the target shapes. It's quite a dilemma.  
23 | 
24 | However, gradient orientation has much more info than edge, so we can easily match shape orientation in the overwhelming img orientation by template matching across the img.  
25 | 
26 | Speed is also important. Thanks to the speeding up magic in linemod, we can handle 1000 templates in 20ms or so.  
27 | 
28 | [Chinese blog about the thoughts](https://www.zhihu.com/question/39513724/answer/441677905)  
29 | 
30 | ## improvment
31 | 
32 | Comparing to opencv linemod src, we improve from 6 aspects:  
33 | 
34 | 1. delete depth modality so we don't need virtual func, this may speed up  
35 | 
36 | 2. opencv linemod can't use more than 63 features. Now wo can have up to 8191  
37 | 
38 | 3. simple codes for rotating and scaling img for training. see test.cpp for examples  
39 | 
40 | 4. nms for accurate edge selection  
41 | 
42 | 5. one channel orientation extraction to save time, slightly faster for gray img
43 | 
44 | 6. use [MIPP](https://github.com/aff3ct/MIPP) for multiple platforms SIMD, for example, x86 SSE AVX, arm neon.
45 |    To have better performance, we have extended MIPP to uint8_t for some instructions.(Otherwise we can only use
46 |    half feature points to avoid int8_t overflow)
47 | 
48 | ## some test
49 | 
50 | ### Example for circle shape  
51 | 
52 | #### You can imagine how many circles we will find if use edges  
53 | ![circle1](test/case0/1.jpg)
54 | ![circle1](test/case0/result/1.png)  
55 | 
56 | #### Not that circular  
57 | ![circle2](test/case0/2.jpg)
58 | ![circle2](test/case0/result/2.png)  
59 | 
60 | #### Blur  
61 | ![circle3](test/case0/3.png)
62 | ![circle3](test/case0/result/3.png)  
63 | 
64 | ### circle template before and after nms  
65 | 
66 | #### before nms
67 | 
68 | ![before](test/case0/features/no_nms_templ.png)
69 | 
70 | #### after nms
71 | 
72 | ![after](test/case0/features/nms_templ.png)  
73 | 
74 | ### Simple example for arbitary shape
75 | 
76 | Well, the example is too simple to show the robustness  
77 | running time: 1024x1024, 60ms to construct response map, 7ms for 360 templates  
78 | 
79 | test img & templ features  
80 | ![test](./test/case1/result.png)  
81 | ![templ](test/case1/templ.png)  
82 | 
83 | 
84 | ### noise test  
85 | 
86 | ![test2](test/case2/result/together.png)  
87 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/CMakeLists.txt:
--------------------------------------------------------------------------------
 1 | 
 2 | # opencv
 3 | find_package(OpenCV 3 REQUIRED)
 4 | list(APPEND icp_inc ${OpenCV_INCLUDE_DIRS})
 5 | list(APPEND icp_lib ${OpenCV_LIBS})
 6 | 
 7 | 
 8 | if(USE_CUDA)
 9 | # cuda
10 | find_package(CUDA REQUIRED)
11 | set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -std=c++11 -O3 --default-stream per-thread -Xcompiler -fopenmp")
12 | list(APPEND icp_inc ${CUDA_INCLUDE_DIRS})
13 | list(APPEND icp_lib ${CUDA_LIBRARIES})
14 | endif()
15 | 
16 | 
17 | # eigen
18 | find_package(Eigen3 REQUIRED)
19 | include_directories(${EIGEN3_INCLUDE_DIR})
20 | 
21 | 
22 | # src
23 | SET(icp_cuda_srcs  icp.cu scene/common.cu scene/edge_scene/edge_scene.cu)
24 | SET(icp_srcs  icp.cpp scene/common.cpp scene/edge_scene/edge_scene.cpp)
25 | 
26 | 
27 | if(USE_CUDA)
28 | CUDA_COMPILE(icp_cuda_objs ${icp_cuda_srcs})
29 | endif()
30 | 
31 | # lib & test exe
32 | add_library(cuda_icp
33 |                ${icp_srcs}
34 |                ${icp_cuda_srcs}
35 |                ${icp_cuda_objs} 
36 | )
37 | target_include_directories(cuda_icp PUBLIC ${icp_inc})
38 | target_link_libraries(cuda_icp PUBLIC ${icp_lib})
39 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/geometry.h:
--------------------------------------------------------------------------------
  1 | // refer to tinyrenderer, make it usable in cuda
  2 | 
  3 | #pragma once
  4 | 
  5 | 
  6 | #ifdef CUDA_ON
  7 | // cuda
  8 | #include <thrust/host_vector.h>
  9 | #include <thrust/device_vector.h>
 10 | #include <thrust/copy.h>
 11 | 
 12 | #else
 13 | // invalidate cuda macro
 14 | #define __device__
 15 | #define __host__
 16 | 
 17 | #endif
 18 | 
 19 | #include <cmath>
 20 | #include <vector>
 21 | #include <cassert>
 22 | #include <iostream>
 23 | 
 24 | template<size_t DimCols,size_t DimRows,typename T> class mat;
 25 | 
 26 | template <size_t DIM, typename T> struct vec {
 27 |     __device__ __host__
 28 |      vec() { for (size_t i=DIM; i--; data_[i] = T()); }
 29 |     __device__ __host__
 30 |           T& operator[](const size_t i)       { assert(i<DIM); return data_[i]; }
 31 |     __device__ __host__
 32 |     const T& operator[](const size_t i) const { assert(i<DIM); return data_[i]; }
 33 | 
 34 |     __device__ __host__
 35 |     vec<DIM, T> operator+ (const vec<DIM, T>& other){
 36 |         vec<DIM, T> res;
 37 |         for(int i=0; i<DIM; i++){
 38 |             res.data_[i] = data_[i] + other.data_[i];
 39 |         }
 40 |         return res;
 41 |     }
 42 | 
 43 |     __device__ __host__
 44 |     vec<DIM, T>& operator+= (const vec<DIM, T>& other){
 45 |         for(int i=0; i<DIM; i++){
 46 |             this->data_[i] +=  other.data_[i];
 47 |         }
 48 |         return *this;
 49 |     }
 50 | 
 51 |     __device__ __host__
 52 |     static vec<DIM, T> Zero(){
 53 |         vec<DIM, T> res;
 54 |         for(int i=0; i<DIM; i++){
 55 |             res.data_[i] = 0;
 56 |         }
 57 |         return res;
 58 |     }
 59 | 
 60 | private:
 61 |     T data_[DIM] = {0};
 62 | };
 63 | 
 64 | /////////////////////////////////////////////////////////////////////////////////
 65 | 
 66 | template <typename T> struct vec<2,T> {
 67 |     __device__ __host__
 68 |     vec() : x(T()), y(T()) {}
 69 |     __device__ __host__
 70 |     vec(T X, T Y) : x(X), y(Y) {}
 71 | 
 72 |     template <class U>
 73 |     __device__ __host__ vec<2,T>(const vec<2,U> &v);
 74 |     __device__ __host__
 75 |           T& operator[](const size_t i)       { assert(i<2); return i<=0 ? x : y; }
 76 |     __device__ __host__
 77 |     const T& operator[](const size_t i) const { assert(i<2); return i<=0 ? x : y; }
 78 | 
 79 |     T x,y;
 80 | };
 81 | 
 82 | /////////////////////////////////////////////////////////////////////////////////
 83 | 
 84 | template <typename T> struct vec<3,T> {
 85 |     __device__ __host__
 86 |     vec() : x(T()), y(T()), z(T()) {}
 87 |     __device__ __host__
 88 |     vec(T X, T Y, T Z) : x(X), y(Y), z(Z) {}
 89 | 
 90 |     template <class U>
 91 |     __device__ __host__ vec<3,T>(const vec<3,U> &v);
 92 |     __device__ __host__
 93 |           T& operator[](const size_t i)       { assert(i<3); return i<=0 ? x : (1==i ? y : z); }
 94 |     __device__ __host__
 95 |     const T& operator[](const size_t i) const { assert(i<3); return i<=0 ? x : (1==i ? y : z); }
 96 |     __device__ __host__
 97 |     float norm() { return std::sqrt(x*x+y*y+z*z); }
 98 |     __device__ __host__
 99 |     vec<3,T> & normalize(T l=1) { *this = (*this)*(l/norm()); return *this; }
100 | 
101 |     T x,y,z;
102 | };
103 | 
104 | /////////////////////////////////////////////////////////////////////////////////
105 | 
106 | template<size_t DIM,typename T> __device__ __host__
107 | T operator*(const vec<DIM,T>& lhs, const vec<DIM,T>& rhs) {
108 |     T ret = T();
109 |     for (size_t i=DIM; i--; ret+=lhs[i]*rhs[i]);
110 |     return ret;
111 | }
112 | 
113 | 
114 | template<size_t DIM,typename T> __device__ __host__
115 | vec<DIM,T> operator+(vec<DIM,T> lhs, const vec<DIM,T>& rhs) {
116 |     for (size_t i=DIM; i--; lhs[i]+=rhs[i]);
117 |     return lhs;
118 | }
119 | 
120 | 
121 | template<size_t DIM,typename T> __device__ __host__
122 | vec<DIM,T> operator-(vec<DIM,T> lhs, const vec<DIM,T>& rhs) {
123 |     for (size_t i=DIM; i--; lhs[i]-=rhs[i]);
124 |     return lhs;
125 | }
126 | 
127 | 
128 | template<size_t DIM,typename T,typename U> __device__ __host__
129 | vec<DIM,T> operator*(vec<DIM,T> lhs, const U& rhs) {
130 |     for (size_t i=DIM; i--; lhs[i]*=rhs);
131 |     return lhs;
132 | }
133 | 
134 | template<size_t DIM,typename T,typename U> __device__ __host__
135 | vec<DIM,T> operator/(vec<DIM,T> lhs, const U& rhs) {
136 |     for (size_t i=DIM; i--; lhs[i]/=rhs);
137 |     return lhs;
138 | }
139 | 
140 | template<size_t LEN,size_t DIM,typename T> __device__ __host__
141 | vec<LEN,T> embed(const vec<DIM,T> &v, T fill=1) {
142 |     vec<LEN,T> ret;
143 |     for (size_t i=LEN; i--; ret[i]=(i<DIM?v[i]:fill));
144 |     return ret;
145 | }
146 | 
147 | template<size_t LEN,size_t DIM, typename T> __device__ __host__
148 | vec<LEN,T> proj(const vec<DIM,T> &v) {
149 |     vec<LEN,T> ret;
150 |     for (size_t i=LEN; i--; ret[i]=v[i]);
151 |     return ret;
152 | }
153 | 
154 | template <typename T> vec<3,T> __device__ __host__
155 | cross(vec<3,T> v1, vec<3,T> v2) {
156 |     return vec<3,T>(v1.y*v2.z - v1.z*v2.y, v1.z*v2.x - v1.x*v2.z, v1.x*v2.y - v1.y*v2.x);
157 | }
158 | 
159 | template <size_t DIM, typename T>
160 | std::ostream& operator<<(std::ostream& out, vec<DIM,T>& v) {
161 |     for(unsigned int i=0; i<DIM; i++) {
162 |         out << v[i] << " " ;
163 |     }
164 |     return out ;
165 | }
166 | 
167 | /////////////////////////////////////////////////////////////////////////////////
168 | 
169 | template<size_t DIM,typename T> struct dt {
170 |     __device__ __host__
171 |     static T det(const mat<DIM,DIM,T>& src) {
172 |         T ret=0;
173 |         for (size_t i=DIM; i--; ret += src[0][i]*src.cofactor(0,i));
174 |         return ret;
175 |     }
176 | };
177 | 
178 | template<typename T> struct dt<1,T> {
179 |     __device__ __host__
180 |     static T det(const mat<1,1,T>& src) {
181 |         return src[0][0];
182 |     }
183 | };
184 | 
185 | /////////////////////////////////////////////////////////////////////////////////
186 | 
187 | template<size_t DimRows,size_t DimCols,typename T> class mat {
188 |     vec<DimCols,T> rows[DimRows];
189 | public:
190 |     __device__ __host__
191 |     mat() {}
192 | 
193 |     __device__ __host__
194 |     mat(const T* data) {
195 |         for(int i=0; i<DimRows; i++){
196 |             auto& row = rows[i];
197 |             for(int j=0; j<DimCols; j++){
198 |                 row[j] = data[j + i*DimCols];
199 |             }
200 |         }
201 |     }
202 | 
203 |     __device__ __host__
204 |     vec<DimCols,T>& operator[] (const size_t idx) {
205 |         assert(idx<DimRows);
206 |         return rows[idx];
207 |     }
208 | 
209 |     __device__ __host__
210 |     const vec<DimCols,T>& operator[] (const size_t idx) const {
211 |         assert(idx<DimRows);
212 |         return rows[idx];
213 |     }
214 | 
215 |     __device__ __host__
216 |     vec<DimRows,T> col(const size_t idx) const {
217 |         assert(idx<DimCols);
218 |         vec<DimRows,T> ret;
219 |         for (size_t i=DimRows; i--; ret[i]=rows[i][idx]);
220 |         return ret;
221 |     }
222 | 
223 |     __device__ __host__
224 |     void set_col(size_t idx, vec<DimRows,T> v) {
225 |         assert(idx<DimCols);
226 |         for (size_t i=DimRows; i--; rows[i][idx]=v[i]);
227 |     }
228 | 
229 |     __device__ __host__
230 |     static mat<DimRows,DimCols,T> identity() {
231 |         mat<DimRows,DimCols,T> ret;
232 |         for (size_t i=DimRows; i--; )
233 |             for (size_t j=DimCols;j--; ret[i][j]=(i==j));
234 |         return ret;
235 |     }
236 | 
237 |     __device__ __host__
238 |     T det() const {
239 |         return dt<DimCols,T>::det(*this);
240 |     }
241 | 
242 |     __device__ __host__
243 |     mat<DimRows-1,DimCols-1,T> get_minor(size_t row, size_t col) const {
244 |         mat<DimRows-1,DimCols-1,T> ret;
245 |         for (size_t i=DimRows-1; i--; )
246 |             for (size_t j=DimCols-1;j--; ret[i][j]=rows[i<row?i:i+1][j<col?j:j+1]);
247 |         return ret;
248 |     }
249 | 
250 |     __device__ __host__
251 |     T cofactor(size_t row, size_t col) const {
252 |         return get_minor(row,col).det()*((row+col)%2 ? -1 : 1);
253 |     }
254 | 
255 |     __device__ __host__
256 |     mat<DimRows,DimCols,T> adjugate() const {
257 |         mat<DimRows,DimCols,T> ret;
258 |         for (size_t i=DimRows; i--; )
259 |             for (size_t j=DimCols; j--; ret[i][j]=cofactor(i,j));
260 |         return ret;
261 |     }
262 | 
263 |     __device__ __host__
264 |     mat<DimRows,DimCols,T> invert_transpose() {
265 |         mat<DimRows,DimCols,T> ret = adjugate();
266 |         T tmp = ret[0]*rows[0];
267 |         return ret/tmp;
268 |     }
269 | 
270 |     __device__ __host__
271 |     mat<DimRows,DimCols,T> invert() {
272 |         return invert_transpose().transpose();
273 |     }
274 | 
275 |     __device__ __host__
276 |     mat<DimCols,DimRows,T> transpose() {
277 |         mat<DimCols,DimRows,T> ret;
278 |         for (size_t i=DimCols; i--; ret[i]=this->col(i));
279 |         return ret;
280 |     }
281 | };
282 | 
283 | /////////////////////////////////////////////////////////////////////////////////
284 | 
285 | template<size_t DimRows,size_t DimCols,typename T> __device__ __host__
286 | vec<DimRows,T> operator*(const mat<DimRows,DimCols,T>& lhs, const vec<DimCols,T>& rhs) {
287 |     vec<DimRows,T> ret;
288 |     for (size_t i=DimRows; i--; ret[i]=lhs[i]*rhs);
289 |     return ret;
290 | }
291 | 
292 | template<size_t R1,size_t C1,size_t C2,typename T> __device__ __host__
293 | mat<R1,C2,T> operator*(const mat<R1,C1,T>& lhs, const mat<C1,C2,T>& rhs) {
294 |     mat<R1,C2,T> result;
295 |     for (size_t i=R1; i--; )
296 |         for (size_t j=C2; j--; result[i][j]=lhs[i]*rhs.col(j));
297 |     return result;
298 | }
299 | 
300 | template<size_t DimRows,size_t DimCols,typename T> __device__ __host__
301 | mat<DimCols,DimRows,T> operator/(mat<DimRows,DimCols,T> lhs, const T& rhs) {
302 |     for (size_t i=DimRows; i--; lhs[i]=lhs[i]/rhs);
303 |     return lhs;
304 | }
305 | 
306 | template <size_t DimRows,size_t DimCols,class T>
307 | std::ostream& operator<<(std::ostream& out, mat<DimRows,DimCols,T>& m) {
308 |     for (size_t i=0; i<DimRows; i++) out << m[i] << std::endl;
309 |     return out;
310 | }
311 | 
312 | /////////////////////////////////////////////////////////////////////////////////
313 | 
314 | typedef vec<2,  float> Vec2f;
315 | typedef vec<2,  int>   Vec2i;
316 | typedef vec<3,  float> Vec3f;
317 | typedef vec<3,  int>   Vec3i;
318 | typedef vec<4,  float> Vec4f;
319 | typedef vec<4,  float> Vec4i;
320 | typedef mat<4,4,float> Mat4x4f;
321 | typedef mat<3,3,float> Mat3x3f;
322 | 
323 | typedef vec<3,  float> Vec6f;
324 | typedef mat<6,6,float> Mat6x6f;
325 | 
326 | template <> template <> __device__ __host__
327 | inline vec<3,int>  ::vec(const vec<3,float> &v) : x(int(v.x+.5f)),y(int(v.y+.5f)),z(int(v.z+.5f)) {}
328 | template <> template <> __device__ __host__
329 | inline vec<3,float>::vec(const vec<3,int> &v)   : x(v.x),y(v.y),z(v.z) {}
330 | template <> template <> __device__ __host__
331 | inline vec<2,int>  ::vec(const vec<2,float> &v) : x(int(v.x+.5f)),y(int(v.y+.5f)) {}
332 | template <> template <> __device__ __host__
333 | inline vec<2,float>::vec(const vec<2,int> &v)   : x(v.x),y(v.y) {}
334 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/icp.cpp:
--------------------------------------------------------------------------------
  1 | #include "icp.h"
  2 | #include <Eigen/Core>
  3 | #include <Eigen/Cholesky>
  4 | #include <Eigen/Geometry>
  5 | 
  6 | namespace cuda_icp{
  7 | 
  8 | Eigen::Matrix3d TransformVector3dToMatrix3d(const Eigen::Matrix<double, 3, 1> &input) {
  9 |     Eigen::Matrix3d output =
 10 |             (Eigen::AngleAxisd(input(0), Eigen::Vector3d::UnitZ()))
 11 |                     .matrix();
 12 |     output.block<2, 1>(0, 2) = input.block<2, 1>(1, 0);
 13 |     return output;
 14 | }
 15 | 
 16 | Mat3x3f eigen_to_custom(const Eigen::Matrix3f& extrinsic){
 17 |     Mat3x3f result;
 18 |     for(uint32_t i=0; i<3; i++){
 19 |         for(uint32_t j=0; j<3; j++){
 20 |             result[i][j] = extrinsic(i, j);
 21 |         }
 22 |     }
 23 |     return result;
 24 | }
 25 | 
 26 | Mat3x3f eigen_slover_333(float *A, float *b)
 27 | {
 28 |     Eigen::Matrix<float, 3, 3> A_eigen(A);
 29 |     Eigen::Matrix<float, 3, 1> b_eigen(b);
 30 |     const Eigen::Matrix<double, 3, 1> update = A_eigen.cast<double>().ldlt().solve(b_eigen.cast<double>());
 31 |     Eigen::Matrix3d extrinsic = TransformVector3dToMatrix3d(update);
 32 |     return eigen_to_custom(extrinsic.cast<float>());
 33 | }
 34 | 
 35 | void transform_pcd(std::vector<Vec2f>& model_pcd, Mat3x3f& trans){
 36 | 
 37 | #pragma omp parallel for
 38 |     for(uint32_t i=0; i < model_pcd.size(); i++){
 39 |         Vec2f& pcd = model_pcd[i];
 40 |         float new_x = trans[0][0]*pcd.x + trans[0][1]*pcd.y + trans[0][2];
 41 |         float new_y = trans[1][0]*pcd.x + trans[1][1]*pcd.y + trans[1][2];
 42 |         pcd.x = new_x;
 43 |         pcd.y = new_y;
 44 |     }
 45 | }
 46 | 
 47 | template<class Scene>
 48 | RegistrationResult ICP2D_Point2Plane_cpu(std::vector<Vec2f> &model_pcd, const Scene scene,
 49 |                                        const ICPConvergenceCriteria criteria)
 50 | {
 51 |     RegistrationResult result;
 52 |     RegistrationResult backup;
 53 | 
 54 |     std::vector<float> A_host(9, 0);
 55 |     std::vector<float> b_host(3, 0);
 56 |     thrust__pcd2Ab<Scene> trasnformer(scene);
 57 | 
 58 |     // use one extra turn
 59 |     for(uint32_t iter=0; iter<=criteria.max_iteration_; iter++){
 60 | 
 61 |         Vec11f reducer;
 62 | 
 63 | #pragma omp declare reduction( + : Vec11f : omp_out += omp_in) \
 64 |                        initializer (omp_priv = Vec11f::Zero())
 65 | 
 66 | #pragma omp parallel for reduction(+: reducer)
 67 |         for(size_t pcd_iter=0; pcd_iter<model_pcd.size(); pcd_iter++){
 68 |             Vec11f result = trasnformer(model_pcd[pcd_iter]);
 69 |             reducer += result;
 70 |         }
 71 | 
 72 |         Vec11f& Ab_tight = reducer;
 73 | 
 74 |         backup = result;
 75 | 
 76 |         float& count = Ab_tight[10];
 77 |         float& total_error = Ab_tight[9];
 78 |         if(count == 0) return result;  // avoid divid 0
 79 | 
 80 |         result.fitness_ = float(count) / model_pcd.size();
 81 |         result.inlier_rmse_ = std::sqrt(total_error / count);
 82 | 
 83 |         // last extra iter, just compute fitness & mse
 84 |         if(iter == criteria.max_iteration_) return result;
 85 | 
 86 |         if(std::abs(result.fitness_ - backup.fitness_) < criteria.relative_fitness_ &&
 87 |            std::abs(result.inlier_rmse_ - backup.inlier_rmse_) < criteria.relative_rmse_){
 88 |             return result;
 89 |         }
 90 | 
 91 |         for(int i=0; i<3; i++) b_host[i] = Ab_tight[6 + i];
 92 | 
 93 |         int shift = 0;
 94 |         for(int y=0; y<3; y++){
 95 |             for(int x=y; x<3; x++){
 96 |                 A_host[x + y*3] = Ab_tight[shift];
 97 |                 A_host[y + x*3] = Ab_tight[shift];
 98 |                 shift++;
 99 |             }
100 |         }
101 | 
102 |         Mat3x3f extrinsic = eigen_slover_333(A_host.data(), b_host.data());
103 | 
104 |         transform_pcd(model_pcd, extrinsic);
105 |         result.transformation_ = extrinsic * result.transformation_;
106 |     }
107 | 
108 |     // never arrive here
109 |     return result;
110 | }
111 | 
112 | template RegistrationResult ICP2D_Point2Plane_cpu(std::vector<Vec2f> &model_pcd, const Scene_edge scene,
113 | const ICPConvergenceCriteria criteria);
114 | }
115 | 
116 | 
117 | 
118 | 
119 | 
120 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/icp.cu:
--------------------------------------------------------------------------------
 1 | #include "icp.h"
 2 | #include <thrust/transform_reduce.h>
 3 | #include <thrust/functional.h>
 4 | 
 5 | namespace cuda_icp{
 6 | #define gpuErrchk(ans) { gpuAssert((ans), __FILE__, __LINE__); }
 7 | inline void gpuAssert(cudaError_t code, const char *file, int line, bool abort=true)
 8 | {
 9 |    if (code != cudaSuccess)
10 |    {
11 |       fprintf(stderr,"GPUassert: %s %s %d\n", cudaGetErrorString(code), file, line);
12 |       if (abort) exit(code);
13 |    }
14 | }
15 | 
16 | 
17 | __global__ void transform_pcd_cuda(Vec2f* model_pcd_ptr, uint32_t model_pcd_size, Mat3x3f trans){
18 |     uint32_t i = blockIdx.x*blockDim.x + threadIdx.x;
19 |     if(i >= model_pcd_size) return;
20 | 
21 |     Vec2f& pcd = model_pcd_ptr[i];
22 |     float new_x = trans[0][0]*pcd.x + trans[0][1]*pcd.y + trans[0][2];
23 |     float new_y = trans[1][0]*pcd.x + trans[1][1]*pcd.y + trans[1][2];
24 |     pcd.x = new_x;
25 |     pcd.y = new_y;
26 | }
27 | 
28 | 
29 | template<class Scene>
30 | RegistrationResult ICP2D_Point2Plane_cuda(device_vector_holder<Vec2f> &model_pcd, const Scene scene,
31 |                                         const ICPConvergenceCriteria criteria){
32 |     RegistrationResult result;
33 |     RegistrationResult backup;
34 | 
35 |     thrust::host_vector<float> A_host(9, 0);
36 |     thrust::host_vector<float> b_host(3, 0);
37 | 
38 |     const uint32_t threadsPerBlock = 256;
39 |     const uint32_t numBlocks = (model_pcd.size() + threadsPerBlock - 1)/threadsPerBlock;
40 | 
41 |     for(uint32_t iter=0; iter<= criteria.max_iteration_; iter++){
42 | 
43 |         Vec11f Ab_tight = thrust::transform_reduce(thrust::cuda::par.on(cudaStreamPerThread),
44 |                                         model_pcd.begin_thr(), model_pcd.end_thr(), thrust__pcd2Ab<Scene>(scene),
45 |                                         Vec11f::Zero(), thrust__plus());
46 | 
47 |         cudaStreamSynchronize(cudaStreamPerThread);
48 |         backup = result;
49 | 
50 |         float& count = Ab_tight[10];
51 |         float& total_error = Ab_tight[9];
52 |         if(count == 0) return result;  // avoid divid 0
53 | 
54 |         result.fitness_ = float(count) / model_pcd.size();
55 |         result.inlier_rmse_ = std::sqrt(total_error / count);
56 | 
57 |         // last extra iter, just compute fitness & mse
58 |         if(iter == criteria.max_iteration_) return result;
59 | 
60 |         if(std::abs(result.fitness_ - backup.fitness_) < criteria.relative_fitness_ &&
61 |            std::abs(result.inlier_rmse_ - backup.inlier_rmse_) < criteria.relative_rmse_){
62 |             return result;
63 |         }
64 | 
65 |         for(int i=0; i<3; i++) b_host[i] = Ab_tight[6 + i];
66 | 
67 |         int shift = 0;
68 |         for(int y=0; y<3; y++){
69 |             for(int x=y; x<3; x++){
70 |                 A_host[x + y*3] = Ab_tight[shift];
71 |                 A_host[y + x*3] = Ab_tight[shift];
72 |                 shift++;
73 |             }
74 |         }
75 | 
76 |         Mat3x3f extrinsic = eigen_slover_333(A_host.data(), b_host.data());
77 | 
78 |         transform_pcd_cuda<<<numBlocks, threadsPerBlock>>>(model_pcd.data(), model_pcd.size(), extrinsic);
79 |         cudaStreamSynchronize(cudaStreamPerThread);
80 | 
81 |         result.transformation_ = extrinsic * result.transformation_;
82 |     }
83 | 
84 |     // never arrive here
85 |     return result;
86 | }
87 | 
88 | template RegistrationResult ICP2D_Point2Plane_cuda(device_vector_holder<Vec2f> &model_pcd, const Scene_edge scene,
89 |                                         const ICPConvergenceCriteria criteria);
90 | }
91 | 
92 | 
93 | 
94 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/icp.h:
--------------------------------------------------------------------------------
  1 | #pragma once
  2 | 
  3 | #include "geometry.h"
  4 | 
  5 | #ifdef CUDA_ON
  6 | #include <thrust/scan.h>
  7 | #include <thrust/execution_policy.h>
  8 | #endif
  9 | 
 10 | #include "scene/edge_scene/edge_scene.h"
 11 | 
 12 | namespace cuda_icp {
 13 | 
 14 | // use custom mat/vec here, otherwise we have to mix eigen with cuda
 15 | // then we may face some error due to eigen vesrion
 16 | //class defination refer to open3d
 17 | struct RegistrationResult
 18 | {
 19 |     __device__ __host__
 20 |     RegistrationResult(const Mat3x3f &transformation =
 21 |             Mat3x3f::identity()) : transformation_(transformation),
 22 |             inlier_rmse_(0.0), fitness_(0.0) {}
 23 | 
 24 |     Mat3x3f transformation_;
 25 |     float inlier_rmse_;
 26 |     float fitness_;
 27 | };
 28 | 
 29 | struct ICPConvergenceCriteria
 30 | {
 31 | public:
 32 |     __device__ __host__
 33 |     ICPConvergenceCriteria(float relative_fitness = 1e-3f,
 34 |             float relative_rmse = 1e-3f, int max_iteration = 30) :
 35 |             relative_fitness_(relative_fitness), relative_rmse_(relative_rmse),
 36 |             max_iteration_(max_iteration) {}
 37 | 
 38 |     float relative_fitness_;
 39 |     float relative_rmse_;
 40 |     int max_iteration_;
 41 | };
 42 | 
 43 | // to be used by icp cuda & cpu
 44 | // in this way we can avoid eigen mixed with cuda
 45 | Mat3x3f eigen_slover_333(float* A, float* b);
 46 | 
 47 | 
 48 | template <class Scene>
 49 | RegistrationResult ICP2D_Point2Plane_cpu(std::vector<Vec2f>& model_pcd,
 50 |         const Scene scene,
 51 |         const ICPConvergenceCriteria criteria = ICPConvergenceCriteria());
 52 | 
 53 | extern template RegistrationResult ICP2D_Point2Plane_cpu(std::vector<Vec2f> &model_pcd, const Scene_edge scene,
 54 | const ICPConvergenceCriteria criteria);
 55 | 
 56 | #ifdef CUDA_ON
 57 | template<class Scene>
 58 | RegistrationResult ICP2D_Point2Plane_cuda(device_vector_holder<Vec2f> &model_pcd, const Scene scene,
 59 |                                         const ICPConvergenceCriteria criteria = ICPConvergenceCriteria());
 60 | 
 61 | extern template RegistrationResult ICP2D_Point2Plane_cuda(device_vector_holder<Vec2f> &model_pcd, const Scene_edge scene,
 62 | const ICPConvergenceCriteria criteria);
 63 | 
 64 | #endif
 65 | 
 66 | 
 67 | /// !!!!!!!!!!!!!!!!!! low level
 68 | 
 69 | typedef vec<11,  float> Vec11f;
 70 | // tight: A(symetric 3x3 --> (9-3)/2+3) + ATb 3 + mse(b*b 1) + count 1 = 11
 71 | 
 72 | template<class Scene>
 73 | struct thrust__pcd2Ab
 74 | {
 75 |     Scene __scene;
 76 | 
 77 |     __host__ __device__
 78 |     thrust__pcd2Ab(Scene scene): __scene(scene){
 79 | 
 80 |     }
 81 | 
 82 |     __host__ __device__ Vec11f operator()(const Vec2f &src_pcd) const {
 83 |         Vec11f result;
 84 |         Vec2f dst_pcd, dst_normal; bool valid;
 85 |         __scene.query(src_pcd, dst_pcd, dst_normal, valid);
 86 |         if(!valid) return result;
 87 |         else{
 88 |             result[10] = 1;  //valid count
 89 |             // dot
 90 |             float b_temp = (dst_pcd - src_pcd).x * dst_normal.x +
 91 |                           (dst_pcd - src_pcd).y * dst_normal.y;
 92 |             result[9] = b_temp*b_temp; // mse
 93 | 
 94 |             // cross
 95 |             float A_temp[3];
 96 |             A_temp[0] = dst_normal.y*src_pcd.x - dst_normal.x*src_pcd.y;
 97 | 
 98 |             A_temp[1] = dst_normal.x;
 99 |             A_temp[2] = dst_normal.y;
100 | 
101 |             // ATA lower
102 |             // 0  x  x
103 |             // 1  3  x
104 |             // 2  4  5
105 |             result[ 0] = A_temp[0] * A_temp[0];
106 |             result[ 1] = A_temp[0] * A_temp[1];
107 |             result[ 2] = A_temp[0] * A_temp[2];
108 |             result[ 3] = A_temp[1] * A_temp[1];
109 |             result[ 4] = A_temp[1] * A_temp[2];
110 |             result[ 5] = A_temp[2] * A_temp[2];
111 | 
112 |             // ATb
113 |             result[6] = A_temp[0] * b_temp;
114 |             result[7] = A_temp[1] * b_temp;
115 |             result[8] = A_temp[2] * b_temp;
116 |             return result;
117 |         }
118 |     }
119 | };
120 | 
121 | struct thrust__plus{
122 |     __host__ __device__ Vec11f operator()(const Vec11f &in1, const Vec11f &in2) const{
123 |         return in1 + in2;
124 |     }
125 | };
126 | 
127 | }
128 | 
129 | 
130 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/common.cpp:
--------------------------------------------------------------------------------
1 | #include "common.h"
2 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/common.cu:
--------------------------------------------------------------------------------
 1 | #include "common.h"
 2 | 
 3 | template<typename T>
 4 | device_vector_holder<T>::~device_vector_holder(){
 5 |     __free();
 6 | }
 7 | 
 8 | template<typename T>
 9 | void device_vector_holder<T>::__free(){
10 |     if(valid){
11 |         cudaFree(__gpu_memory);
12 |         valid = false;
13 |         __size = 0;
14 |     }
15 | }
16 | 
17 | template<typename T>
18 | device_vector_holder<T>::device_vector_holder(size_t size_, T init)
19 | {
20 |     __malloc(size_);
21 |     thrust::fill(begin_thr(), end_thr(), init);
22 | }
23 | 
24 | template<typename T>
25 | void device_vector_holder<T>::__malloc(size_t size_){
26 |     if(valid) __free();
27 |     cudaMalloc((void**)&__gpu_memory, size_ * sizeof(T));
28 |     __size = size_;
29 |     valid = true;
30 | }
31 | 
32 | template<typename T>
33 | device_vector_holder<T>::device_vector_holder(size_t size_){
34 |     __malloc(size_);
35 | }
36 | 
37 | template class device_vector_holder<Vec2f>;
38 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/common.h:
--------------------------------------------------------------------------------
 1 | #pragma once
 2 | 
 3 | // common function frequently used by others
 4 | 
 5 | #include "../geometry.h"
 6 | 
 7 | #include <opencv2/imgproc.hpp>
 8 | #include <opencv2/highgui/highgui.hpp>
 9 | #include <opencv2/core/core.hpp>
10 | 
11 | #ifdef CUDA_ON
12 | // thrust device vector can't be used in cpp by design
13 | // same codes in cuda renderer,
14 | // because we don't want these two related to each other
15 | template <typename T>
16 | class device_vector_holder{
17 | public:
18 |     T* __gpu_memory;
19 |     size_t __size;
20 |     bool valid = false;
21 |     device_vector_holder(){}
22 |     device_vector_holder(size_t size);
23 |     device_vector_holder(size_t size, T init);
24 |     ~device_vector_holder();
25 | 
26 |     T* data(){return __gpu_memory;}
27 |     thrust::device_ptr<T> data_thr(){return thrust::device_ptr<T>(__gpu_memory);}
28 |     T* begin(){return __gpu_memory;}
29 |     thrust::device_ptr<T> begin_thr(){return thrust::device_ptr<T>(__gpu_memory);}
30 |     T* end(){return __gpu_memory + __size;}
31 |     thrust::device_ptr<T> end_thr(){return thrust::device_ptr<T>(__gpu_memory + __size);}
32 | 
33 |     size_t size(){return __size;}
34 | 
35 |     void __malloc(size_t size);
36 |     void __free();
37 | };
38 | 
39 | extern template class device_vector_holder<Vec2f>;
40 | #endif
41 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/edge_scene/edge_scene.cpp:
--------------------------------------------------------------------------------
  1 | #include "edge_scene.h"
  2 | 
  3 | using namespace cv;
  4 | using namespace std;
  5 | 
  6 | // https://github.com/songyuncen/EdgesSubPix/blob/master/EdgesSubPix.cpp
  7 | const double scale = 128.0;  // sum of half Canny filter is 128
  8 | 
  9 | static void getCannyKernel(OutputArray _d, double alpha)
 10 | {
 11 |     int r = cvRound(alpha * 3);
 12 |     int ksize = 2 * r + 1;
 13 | 
 14 |     _d.create(ksize, 1, CV_16S, -1, true);
 15 | 
 16 |     Mat k = _d.getMat();
 17 | 
 18 |     vector<float> kerF(ksize, 0.0f);
 19 |     kerF[r] = 0.0f;
 20 |     double a2 = alpha * alpha;
 21 |     float sum = 0.0f;
 22 |     for (int x = 1; x <= r; ++x)
 23 |     {
 24 |         float v = (float)(-x * std::exp(-x * x / (2 * a2)));
 25 |         sum += v;
 26 |         kerF[r + x] = v;
 27 |         kerF[r - x] = -v;
 28 |     }
 29 |     float scale = 128 / sum;
 30 |     for (int i = 0; i < ksize; ++i)
 31 |     {
 32 |         kerF[i] *= scale;
 33 |     }
 34 |     Mat temp(ksize, 1, CV_32F, &kerF[0]);
 35 |     temp.convertTo(k, CV_16S);
 36 | }
 37 | 
 38 | // non-maximum supression and hysteresis
 39 | static void postCannyFilter(const Mat &src, Mat &dx, Mat &dy, int low, int high, Mat &dst)
 40 | {
 41 |     ptrdiff_t mapstep = src.cols + 2;
 42 |     AutoBuffer<uchar> buffer((src.cols + 2)*(src.rows + 2) + mapstep * 3 * sizeof(int));
 43 | 
 44 |     // L2Gradient comparison with square
 45 |     high = high * high;
 46 |     low = low * low;
 47 | 
 48 |     int* mag_buf[3];
 49 |     mag_buf[0] = (int*)(uchar*)buffer;
 50 |     mag_buf[1] = mag_buf[0] + mapstep;
 51 |     mag_buf[2] = mag_buf[1] + mapstep;
 52 |     memset(mag_buf[0], 0, mapstep*sizeof(int));
 53 | 
 54 |     uchar* map = (uchar*)(mag_buf[2] + mapstep);
 55 |     memset(map, 1, mapstep);
 56 |     memset(map + mapstep*(src.rows + 1), 1, mapstep);
 57 | 
 58 |     int maxsize = std::max(1 << 10, src.cols * src.rows / 10);
 59 |     std::vector<uchar*> stack(maxsize);
 60 |     uchar **stack_top = &stack[0];
 61 |     uchar **stack_bottom = &stack[0];
 62 | 
 63 |     /* sector numbers
 64 |     (Top-Left Origin)
 65 |     1   2   3
 66 |     *  *  *
 67 |     * * *
 68 |     0*******0
 69 |     * * *
 70 |     *  *  *
 71 |     3   2   1
 72 |     */
 73 | 
 74 | #define CANNY_PUSH(d)    *(d) = uchar(2), *stack_top++ = (d)
 75 | #define CANNY_POP(d)     (d) = *--stack_top
 76 | 
 77 | #if CV_SSE2
 78 |     bool haveSSE2 = checkHardwareSupport(CV_CPU_SSE2);
 79 | #endif
 80 | 
 81 |     // calculate magnitude and angle of gradient, perform non-maxima suppression.
 82 |     // fill the map with one of the following values:
 83 |     //   0 - the pixel might belong to an edge
 84 |     //   1 - the pixel can not belong to an edge
 85 |     //   2 - the pixel does belong to an edge
 86 |     for (int i = 0; i <= src.rows; i++)
 87 |     {
 88 |         int* _norm = mag_buf[(i > 0) + 1] + 1;
 89 |         if (i < src.rows)
 90 |         {
 91 |             short* _dx = dx.ptr<short>(i);
 92 |             short* _dy = dy.ptr<short>(i);
 93 | 
 94 |             int j = 0, width = src.cols;
 95 | #if CV_SSE2
 96 |             if (haveSSE2)
 97 |             {
 98 |                 for (; j <= width - 8; j += 8)
 99 |                 {
100 |                     __m128i v_dx = _mm_loadu_si128((const __m128i *)(_dx + j));
101 |                     __m128i v_dy = _mm_loadu_si128((const __m128i *)(_dy + j));
102 | 
103 |                     __m128i v_dx_ml = _mm_mullo_epi16(v_dx, v_dx), v_dx_mh = _mm_mulhi_epi16(v_dx, v_dx);
104 |                     __m128i v_dy_ml = _mm_mullo_epi16(v_dy, v_dy), v_dy_mh = _mm_mulhi_epi16(v_dy, v_dy);
105 | 
106 |                     __m128i v_norm = _mm_add_epi32(_mm_unpacklo_epi16(v_dx_ml, v_dx_mh), _mm_unpacklo_epi16(v_dy_ml, v_dy_mh));
107 |                     _mm_storeu_si128((__m128i *)(_norm + j), v_norm);
108 | 
109 |                     v_norm = _mm_add_epi32(_mm_unpackhi_epi16(v_dx_ml, v_dx_mh), _mm_unpackhi_epi16(v_dy_ml, v_dy_mh));
110 |                     _mm_storeu_si128((__m128i *)(_norm + j + 4), v_norm);
111 |                 }
112 |             }
113 | #elif CV_NEON
114 |             for (; j <= width - 8; j += 8)
115 |             {
116 |                 int16x8_t v_dx = vld1q_s16(_dx + j), v_dy = vld1q_s16(_dy + j);
117 |                 int16x4_t v_dxp = vget_low_s16(v_dx), v_dyp = vget_low_s16(v_dy);
118 |                 int32x4_t v_dst = vmlal_s16(vmull_s16(v_dxp, v_dxp), v_dyp, v_dyp);
119 |                 vst1q_s32(_norm + j, v_dst);
120 | 
121 |                 v_dxp = vget_high_s16(v_dx), v_dyp = vget_high_s16(v_dy);
122 |                 v_dst = vmlal_s16(vmull_s16(v_dxp, v_dxp), v_dyp, v_dyp);
123 |                 vst1q_s32(_norm + j + 4, v_dst);
124 |             }
125 | #endif
126 |             for (; j < width; ++j)
127 |                 _norm[j] = int(_dx[j])*_dx[j] + int(_dy[j])*_dy[j];
128 | 
129 |             _norm[-1] = _norm[src.cols] = 0;
130 |         }
131 |         else
132 |             memset(_norm - 1, 0, /* cn* */mapstep*sizeof(int));
133 | 
134 |         // at the very beginning we do not have a complete ring
135 |         // buffer of 3 magnitude rows for non-maxima suppression
136 |         if (i == 0)
137 |             continue;
138 | 
139 |         uchar* _map = map + mapstep*i + 1;
140 |         _map[-1] = _map[src.cols] = 1;
141 | 
142 |         int* _mag = mag_buf[1] + 1; // take the central row
143 |         ptrdiff_t magstep1 = mag_buf[2] - mag_buf[1];
144 |         ptrdiff_t magstep2 = mag_buf[0] - mag_buf[1];
145 | 
146 |         const short* _x = dx.ptr<short>(i - 1);
147 |         const short* _y = dy.ptr<short>(i - 1);
148 | 
149 |         if ((stack_top - stack_bottom) + src.cols > maxsize)
150 |         {
151 |             int sz = (int)(stack_top - stack_bottom);
152 |             maxsize = std::max(maxsize * 3 / 2, sz + src.cols);
153 |             stack.resize(maxsize);
154 |             stack_bottom = &stack[0];
155 |             stack_top = stack_bottom + sz;
156 |         }
157 | 
158 |         int prev_flag = 0;
159 |         for (int j = 0; j < src.cols; j++)
160 |         {
161 |             #define CANNY_SHIFT 15
162 |             const int TG22 = (int)(0.4142135623730950488016887242097*(1 << CANNY_SHIFT) + 0.5);
163 | 
164 |             int m = _mag[j];
165 | 
166 |             if (m > low)
167 |             {
168 |                 int xs = _x[j];
169 |                 int ys = _y[j];
170 |                 int x = std::abs(xs);
171 |                 int y = std::abs(ys) << CANNY_SHIFT;
172 | 
173 |                 int tg22x = x * TG22;
174 | 
175 |                 if (y < tg22x)
176 |                 {
177 |                     if (m > _mag[j - 1] && m >= _mag[j + 1]) goto __ocv_canny_push;
178 |                 }
179 |                 else
180 |                 {
181 |                     int tg67x = tg22x + (x << (CANNY_SHIFT + 1));
182 |                     if (y > tg67x)
183 |                     {
184 |                         if (m > _mag[j + magstep2] && m >= _mag[j + magstep1]) goto __ocv_canny_push;
185 |                     }
186 |                     else
187 |                     {
188 |                         int s = (xs ^ ys) < 0 ? -1 : 1;
189 |                         if (m > _mag[j + magstep2 - s] && m > _mag[j + magstep1 + s]) goto __ocv_canny_push;
190 |                     }
191 |                 }
192 |             }
193 |             prev_flag = 0;
194 |             _map[j] = uchar(1);
195 |             continue;
196 |         __ocv_canny_push:
197 |             if (!prev_flag && m > high && _map[j - mapstep] != 2)
198 |             {
199 |                 CANNY_PUSH(_map + j);
200 |                 prev_flag = 1;
201 |             }
202 |             else
203 |                 _map[j] = 0;
204 |         }
205 | 
206 |         // scroll the ring buffer
207 |         _mag = mag_buf[0];
208 |         mag_buf[0] = mag_buf[1];
209 |         mag_buf[1] = mag_buf[2];
210 |         mag_buf[2] = _mag;
211 |     }
212 | 
213 |     // now track the edges (hysteresis thresholding)
214 |     while (stack_top > stack_bottom)
215 |     {
216 |         uchar* m;
217 |         if ((stack_top - stack_bottom) + 8 > maxsize)
218 |         {
219 |             int sz = (int)(stack_top - stack_bottom);
220 |             maxsize = maxsize * 3 / 2;
221 |             stack.resize(maxsize);
222 |             stack_bottom = &stack[0];
223 |             stack_top = stack_bottom + sz;
224 |         }
225 | 
226 |         CANNY_POP(m);
227 | 
228 |         if (!m[-1])         CANNY_PUSH(m - 1);
229 |         if (!m[1])          CANNY_PUSH(m + 1);
230 |         if (!m[-mapstep - 1]) CANNY_PUSH(m - mapstep - 1);
231 |         if (!m[-mapstep])   CANNY_PUSH(m - mapstep);
232 |         if (!m[-mapstep + 1]) CANNY_PUSH(m - mapstep + 1);
233 |         if (!m[mapstep - 1])  CANNY_PUSH(m + mapstep - 1);
234 |         if (!m[mapstep])    CANNY_PUSH(m + mapstep);
235 |         if (!m[mapstep + 1])  CANNY_PUSH(m + mapstep + 1);
236 |     }
237 | 
238 |     // the final pass, form the final image
239 |     const uchar* pmap = map + mapstep + 1;
240 |     uchar* pdst = dst.ptr();
241 |     for (int i = 0; i < src.rows; i++, pmap += mapstep, pdst += dst.step)
242 |     {
243 |         for (int j = 0; j < src.cols; j++)
244 |             pdst[j] = (uchar)-(pmap[j] >> 1);
245 |     }
246 | }
247 | 
248 | static inline  double getAmplitude(Mat &dx, Mat &dy, int i, int j)
249 | {
250 |     Point2d mag(dx.at<short>(i, j), dy.at<short>(i, j));
251 |     return norm(mag);
252 | }
253 | 
254 | static inline void getMagNeighbourhood(Mat &dx, Mat &dy, Point &p, int w, int h, vector<double> &mag)
255 | {
256 |     int top = p.y - 1 >= 0 ? p.y - 1 : p.y;
257 |     int down = p.y + 1 < h ? p.y + 1 : p.y;
258 |     int left = p.x - 1 >= 0 ? p.x - 1 : p.x;
259 |     int right = p.x + 1 < w ? p.x + 1 : p.x;
260 | 
261 |     mag[0] = getAmplitude(dx, dy, top, left);
262 |     mag[1] = getAmplitude(dx, dy, top, p.x);
263 |     mag[2] = getAmplitude(dx, dy, top, right);
264 |     mag[3] = getAmplitude(dx, dy, p.y, left);
265 |     mag[4] = getAmplitude(dx, dy, p.y, p.x);
266 |     mag[5] = getAmplitude(dx, dy, p.y, right);
267 |     mag[6] = getAmplitude(dx, dy, down, left);
268 |     mag[7] = getAmplitude(dx, dy, down, p.x);
269 |     mag[8] = getAmplitude(dx, dy, down, right);
270 | }
271 | 
272 | static inline void get2ndFacetModelIn3x3(vector<double> &mag, vector<double> &a)
273 | {
274 |     a[0] = (-mag[0] + 2.0 * mag[1] - mag[2] + 2.0 * mag[3] + 5.0 * mag[4] + 2.0 * mag[5] - mag[6] + 2.0 * mag[7] - mag[8]) / 9.0;
275 |     a[1] = (-mag[0] + mag[2] - mag[3] + mag[5] - mag[6] + mag[8]) / 6.0;
276 |     a[2] = (mag[6] + mag[7] + mag[8] - mag[0] - mag[1] - mag[2]) / 6.0;
277 |     a[3] = (mag[0] - 2.0 * mag[1] + mag[2] + mag[3] - 2.0 * mag[4] + mag[5] + mag[6] - 2.0 * mag[7] + mag[8]) / 6.0;
278 |     a[4] = (-mag[0] + mag[2] + mag[6] - mag[8]) / 4.0;
279 |     a[5] = (mag[0] + mag[1] + mag[2] - 2.0 * (mag[3] + mag[4] + mag[5]) + mag[6] + mag[7] + mag[8]) / 6.0;
280 | }
281 | /*
282 |    Compute the eigenvalues and eigenvectors of the Hessian matrix given by
283 |    dfdrr, dfdrc, and dfdcc, and sort them in descending order according to
284 |    their absolute values.
285 | */
286 | static inline void eigenvals(vector<double> &a, double eigval[2], double eigvec[2][2])
287 | {
288 |     // derivatives
289 |     // fx = a[1], fy = a[2]
290 |     // fxy = a[4]
291 |     // fxx = 2 * a[3]
292 |     // fyy = 2 * a[5]
293 |     double dfdrc = a[4];
294 |     double dfdcc = a[3] * 2.0;
295 |     double dfdrr = a[5] * 2.0;
296 |     double theta, t, c, s, e1, e2, n1, n2; /* , phi; */
297 | 
298 |     /* Compute the eigenvalues and eigenvectors of the Hessian matrix. */
299 |     if (dfdrc != 0.0) {
300 |         theta = 0.5*(dfdcc - dfdrr) / dfdrc;
301 |         t = 1.0 / (fabs(theta) + sqrt(theta*theta + 1.0));
302 |         if (theta < 0.0) t = -t;
303 |         c = 1.0 / sqrt(t*t + 1.0);
304 |         s = t*c;
305 |         e1 = dfdrr - t*dfdrc;
306 |         e2 = dfdcc + t*dfdrc;
307 |     }
308 |     else {
309 |         c = 1.0;
310 |         s = 0.0;
311 |         e1 = dfdrr;
312 |         e2 = dfdcc;
313 |     }
314 |     n1 = c;
315 |     n2 = -s;
316 | 
317 |     /* If the absolute value of an eigenvalue is larger than the other, put that
318 |     eigenvalue into first position.  If both are of equal absolute value, put
319 |     the negative one first. */
320 |     if (fabs(e1) > fabs(e2)) {
321 |         eigval[0] = e1;
322 |         eigval[1] = e2;
323 |         eigvec[0][0] = n1;
324 |         eigvec[0][1] = n2;
325 |         eigvec[1][0] = -n2;
326 |         eigvec[1][1] = n1;
327 |     }
328 |     else if (fabs(e1) < fabs(e2)) {
329 |         eigval[0] = e2;
330 |         eigval[1] = e1;
331 |         eigvec[0][0] = -n2;
332 |         eigvec[0][1] = n1;
333 |         eigvec[1][0] = n1;
334 |         eigvec[1][1] = n2;
335 |     }
336 |     else {
337 |         if (e1 < e2) {
338 |             eigval[0] = e1;
339 |             eigval[1] = e2;
340 |             eigvec[0][0] = n1;
341 |             eigvec[0][1] = n2;
342 |             eigvec[1][0] = -n2;
343 |             eigvec[1][1] = n1;
344 |         }
345 |         else {
346 |             eigval[0] = e2;
347 |             eigval[1] = e1;
348 |             eigvec[0][0] = -n2;
349 |             eigvec[0][1] = n1;
350 |             eigvec[1][0] = n1;
351 |             eigvec[1][1] = n2;
352 |         }
353 |     }
354 | }
355 | 
356 | // end https://github.com/songyuncen/EdgesSubPix/blob/master/EdgesSubPix.cpp
357 | 
358 | template<class T>
359 | T pow2(const T& in){return in*in;}
360 | 
361 | void Scene_edge::init_Scene_edge_cpu(cv::Mat img, std::vector<::Vec2f> &pcd_buffer,
362 |                                      std::vector<::Vec2f>& normal_buffer, float max_dist_diff)
363 | {
364 |     width = img.cols;
365 |     height = img.rows;
366 |     this->max_dist_diff = max_dist_diff;
367 | 
368 |     cv::Mat gray;
369 |     if(img.channels() > 1){
370 |         cv::cvtColor(img, gray, cv::COLOR_BGR2GRAY);
371 |     }else{
372 |         gray = img;
373 |     }
374 | 
375 |     double alpha = 1;
376 |     int low = 30;
377 |     int high = 60;
378 | 
379 |     Mat blur;
380 |     GaussianBlur(gray, blur, Size(5, 5), alpha, alpha);
381 | 
382 |     Mat d;
383 |     getCannyKernel(d, alpha);
384 |     Mat one = Mat::ones(Size(1, 1), CV_16S);
385 |     Mat dx, dy;
386 |     sepFilter2D(blur, dx, CV_16S, d, one);
387 |     sepFilter2D(blur, dy, CV_16S, one, d);
388 | 
389 |     // non-maximum supression & hysteresis threshold
390 |     Mat edge = Mat::zeros(gray.size(), CV_8UC1);
391 |     int lowThresh = cvRound(scale * low);
392 |     int highThresh = cvRound(scale * high);
393 |     postCannyFilter(gray, dx, dy, lowThresh, highThresh, edge);
394 | 
395 | //    cv::imshow("edge", edge);
396 | //    cv::waitKey(0);
397 | 
398 |     normal_buffer.clear();
399 |     normal_buffer.resize(img.rows * img.cols);
400 | 
401 |     pcd_buffer.clear();
402 |     pcd_buffer.resize(img.rows * img.cols, ::Vec2f(-1, -1)); // -1 indicate no edge around
403 | 
404 |     std::vector<::Vec2f> pcd_buffer_sub = pcd_buffer;
405 | 
406 |     for(int r=0; r<img.rows; r++){
407 |         for(int c=0; c<img.cols; c++){
408 |             if(edge.at<uchar>(r, c) > 0){  // get normals & pcds at edge only
409 | 
410 |                 int w = dx.cols;
411 |                 int h = dx.rows;
412 |                 Point icontour = {c, r};
413 | 
414 |                 vector<double> magNeighbour(9);
415 |                 getMagNeighbourhood(dx, dy, icontour, w, h, magNeighbour);
416 |                 vector<double> a(9);
417 |                 get2ndFacetModelIn3x3(magNeighbour, a);
418 | 
419 |                 // Hessian eigen vector
420 |                 double eigvec[2][2], eigval[2];
421 |                 eigenvals(a, eigval, eigvec);
422 |                 double t = 0.0;
423 |                 double ny = eigvec[0][0];
424 |                 double nx = eigvec[0][1];
425 |                 if (eigval[0] < 0.0)
426 |                 {
427 |                     double rx = a[1], ry = a[2], rxy = a[4], rxx = a[3] * 2.0, ryy = a[5] * 2.0;
428 |                     t = -(rx * nx + ry * ny) / (rxx * nx * nx + 2.0 * rxy * nx * ny + ryy * ny * ny);
429 |                 }
430 |                 double px = nx * t;
431 |                 double py = ny * t;
432 |                 float x = (float)icontour.x;
433 |                 float y = (float)icontour.y;
434 |                 if (fabs(px) <= 0.5 && fabs(py) <= 0.5)
435 |                 {
436 |                     x += (float)px;
437 |                     y += (float)py;
438 |                 }
439 | 
440 |                 normal_buffer[c + r*img.cols] = {float(nx), float(-ny)};
441 |                 pcd_buffer_sub[c +r*img.cols] = {x, y};
442 |             }
443 |         }
444 |     }
445 |     // get pcd, dilute to neibor
446 |     {
447 |         // may padding to divid and parallel
448 |         cv::Mat dist_buffer(img.size(), CV_32FC1, FLT_MAX);
449 |         int kernel_size = int(max_dist_diff+0.5f);
450 |         for(int r=0+kernel_size; r<img.rows - kernel_size; r++){
451 |             for(int c=0+kernel_size; c<img.cols - kernel_size; c++){
452 | 
453 |                 if(edge.at<uchar>(r, c) > 0){
454 |                     auto pcd = pcd_buffer_sub[c + r*img.cols];
455 |                     for(int i=-kernel_size; i<=kernel_size; i++){
456 |                         for(int j=-kernel_size; j<=kernel_size; j++){
457 | 
458 |                             float dist_sq = pow2(i) + pow2(j);
459 | //                            float dist_sq = pow2(j-(pcd.x-c)) + pow2(i-(pcd.y-r));  // this is better?
460 |                             // don't go too far
461 |                             if(dist_sq > pow2(max_dist_diff)) continue;
462 | 
463 |                             int new_r = r + i;
464 |                             int new_c = c + j;
465 | 
466 |                             // if closer
467 |                             if(dist_sq < dist_buffer.at<float>(new_r, new_c)){
468 |                                 pcd_buffer[new_c + new_r*img.cols] = pcd;
469 |                                 dist_buffer.at<float>(new_r, new_c) = dist_sq;
470 |                             }
471 |                         }
472 |                     }
473 |                 }
474 |             }
475 |         }
476 |     }
477 | 
478 |     pcd_ptr = pcd_buffer.data();
479 |     normal_ptr = normal_buffer.data();
480 | }
481 | 
482 | 
483 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/edge_scene/edge_scene.cu:
--------------------------------------------------------------------------------
 1 | #include "edge_scene.h"
 2 | 
 3 | void Scene_edge::init_Scene_edge_cuda(cv::Mat img, device_vector_holder<Vec2f> &pcd_buffer,
 4 |                                       device_vector_holder<Vec2f>& normal_buffer, float max_dist_diff)
 5 | {
 6 |     std::vector<Vec2f> pcd_buffer_host, normal_buffer_host;
 7 | 
 8 |     init_Scene_edge_cpu(img, pcd_buffer_host, normal_buffer_host, max_dist_diff);
 9 | 
10 |     pcd_buffer.__malloc(pcd_buffer_host.size());
11 |     thrust::copy(pcd_buffer_host.begin(), pcd_buffer_host.end(), pcd_buffer.begin_thr());
12 | 
13 |     normal_buffer.__malloc(normal_buffer_host.size());
14 |     thrust::copy(normal_buffer_host.begin(), normal_buffer_host.end(), normal_buffer.begin_thr());
15 | 
16 |     pcd_ptr = pcd_buffer.data();
17 |     normal_ptr = normal_buffer.data();
18 | }
19 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/cuda_icp/scene/edge_scene/edge_scene.h:
--------------------------------------------------------------------------------
 1 | #pragma once
 2 | 
 3 | #include "../common.h"
 4 | 
 5 | // frame of scene edge
 6 | // o -------> x
 7 | // |
 8 | // |
 9 | // |
10 | // V
11 | // y
12 | 
13 | 
14 | // just implement query func
15 | struct Scene_edge
16 | {
17 |     size_t width = 640, height = 480;
18 |     float max_dist_diff = 4.0f; // pixels
19 | 	::Vec2f* pcd_ptr;  // pointer can unify cpu & cuda version
20 | 	::Vec2f* normal_ptr;  // layout: 1d, width*height length, array of Vec2f
21 | 
22 |     // buffer provided by user, this class only holds pointers,
23 |     // becuase we will pass them to device.
24 |     void init_Scene_edge_cpu(cv::Mat img, std::vector<::Vec2f>& pcd_buffer,
25 |                              std::vector<::Vec2f>& normal_buffer, float max_dist_diff = 4.0f);
26 | 
27 | #ifdef CUDA_ON
28 |     void init_Scene_edge_cuda(cv::Mat img, device_vector_holder<Vec2f>& pcd_buffer,
29 |                               device_vector_holder<Vec2f>& normal_buffer, float max_dist_diff = 4.0f);
30 | #endif
31 | 
32 |     __device__ __host__
33 |     void query(const ::Vec2f& src_pcd, ::Vec2f& dst_pcd, ::Vec2f& dst_normal, bool& valid) const {
34 | 
35 |         size_t x,y;
36 |         x = size_t(src_pcd.x + 0.5f);
37 |         y = size_t(src_pcd.y + 0.5f);
38 | 
39 |         if(x >= width || y >= height){
40 |             valid = false;
41 |             return;
42 |         }
43 | 
44 |         size_t idx = x + y * width;
45 |         if(pcd_ptr[idx].x >= 0){
46 | 
47 |             dst_pcd = pcd_ptr[idx];
48 | 
49 |             idx = size_t(dst_pcd.x) + size_t(dst_pcd.y) * width;
50 |             dst_normal = normal_ptr[idx];
51 | 
52 |             valid = true;
53 | 
54 |         }else valid = false;
55 | 
56 |         return;
57 |     }
58 | };
59 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/demo.ini:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/demo.ini


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/detector.cpp:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/detector.cpp


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/detector.h:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/detector.h


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/line2Dup.cpp:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/line2Dup.cpp


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/line2Dup.h:
--------------------------------------------------------------------------------
  1 | #ifndef CXXLINEMOD_H
  2 | #define CXXLINEMOD_H
  3 | #include <opencv2/core/core.hpp>
  4 | #include <opencv2/imgproc.hpp>
  5 | #include <opencv2/highgui/highgui.hpp>
  6 | #include <map>
  7 | 
  8 | #include ".\\MIPP\mipp.h"  // for SIMD in different platforms
  9 | 
 10 | namespace line2Dup
 11 | {
 12 | 
 13 | struct Feature
 14 | {
 15 |     int x;
 16 |     int y;
 17 |     int label;
 18 | 
 19 |     void read(const cv::FileNode &fn);
 20 |     void write(cv::FileStorage &fs) const;
 21 | 
 22 |     Feature() : x(0), y(0), label(0) {}
 23 |     Feature(int x, int y, int label);
 24 | };
 25 | inline Feature::Feature(int _x, int _y, int _label) : x(_x), y(_y), label(_label) {}
 26 | 
 27 | struct Template
 28 | {
 29 |     int width;
 30 |     int height;
 31 |     int tl_x;
 32 |     int tl_y;
 33 |     int pyramid_level;
 34 |     std::vector<Feature> features;
 35 | 
 36 |     void read(const cv::FileNode &fn);
 37 |     void write(cv::FileStorage &fs) const;
 38 | };
 39 | 
 40 | class ColorGradientPyramid
 41 | {
 42 | public:
 43 |     ColorGradientPyramid(const cv::Mat &src, const cv::Mat &mask,
 44 |                                              float weak_threshold, size_t num_features,
 45 |                                              float strong_threshold);
 46 | 
 47 |     void quantize(cv::Mat &dst) const;
 48 | 
 49 |     bool extractTemplate(Template &templ) const;
 50 | 
 51 |     void pyrDown();
 52 | 
 53 | public:
 54 |     void update();
 55 |     /// Candidate feature with a score
 56 |     struct Candidate
 57 |     {
 58 |         Candidate(int x, int y, int label, float score);
 59 | 
 60 |         /// Sort candidates with high score to the front
 61 |         bool operator<(const Candidate &rhs) const
 62 |         {
 63 |             return score > rhs.score;
 64 |         }
 65 | 
 66 |         Feature f;
 67 |         float score;
 68 |     };
 69 | 
 70 |     cv::Mat src;
 71 |     cv::Mat mask;
 72 | 
 73 |     int pyramid_level;
 74 |     cv::Mat angle;
 75 |     cv::Mat magnitude;
 76 | 
 77 |     float weak_threshold;
 78 |     size_t num_features;
 79 |     float strong_threshold;
 80 |     static bool selectScatteredFeatures(const std::vector<Candidate> &candidates,
 81 |                                                                             std::vector<Feature> &features,
 82 |                                                                             size_t num_features, float distance);
 83 | };
 84 | inline ColorGradientPyramid::Candidate::Candidate(int x, int y, int label, float _score) : f(x, y, label), score(_score) {}
 85 | 
 86 | class ColorGradient
 87 | {
 88 | public:
 89 |     ColorGradient();
 90 |     ColorGradient(float weak_threshold, size_t num_features, float strong_threshold);
 91 | 
 92 |     std::string name() const;
 93 | 
 94 |     float weak_threshold;
 95 |     size_t num_features;
 96 |     float strong_threshold;
 97 |     void read(const cv::FileNode &fn);
 98 |     void write(cv::FileStorage &fs) const;
 99 | 
100 |     cv::Ptr<ColorGradientPyramid> process(const cv::Mat src, const cv::Mat &mask = cv::Mat()) const
101 |     {
102 |         return cv::makePtr<ColorGradientPyramid>(src, mask, weak_threshold, num_features, strong_threshold);
103 |     }
104 | };
105 | 
106 | struct Match
107 | {
108 |     Match()
109 |     {
110 |     }
111 | 
112 |     Match(int x, int y, float similarity, const std::string &class_id, int template_id);
113 | 
114 |     /// Sort matches with high similarity to the front
115 |     bool operator<(const Match &rhs) const
116 |     {
117 |         // Secondarily sort on template_id for the sake of duplicate removal
118 |         if (similarity != rhs.similarity)
119 |             return similarity > rhs.similarity;
120 |         else
121 |             return template_id < rhs.template_id;
122 |     }
123 | 
124 |     bool operator==(const Match &rhs) const
125 |     {
126 |         return x == rhs.x && y == rhs.y && similarity == rhs.similarity && class_id == rhs.class_id;
127 |     }
128 | 
129 |     int x;
130 |     int y;
131 |     float similarity;
132 |     std::string class_id;
133 |     int template_id;
134 | };
135 | 
136 | inline Match::Match(int _x, int _y, float _similarity, const std::string &_class_id, int _template_id)
137 |         : x(_x), y(_y), similarity(_similarity), class_id(_class_id), template_id(_template_id)
138 | {
139 | }
140 | 
141 | class Detector
142 | {
143 | public:
144 |     /**
145 |          * \brief Empty constructor, initialize with read().
146 |          */
147 |     Detector();
148 | 
149 |     Detector(std::vector<int> T);
150 |     Detector(int num_features, std::vector<int> T, float weak_thresh = 30.0f, float strong_thresh = 60.0f);
151 | 
152 |     std::vector<Match> match(cv::Mat sources, float threshold,
153 |                                                      const std::vector<std::string> &class_ids = std::vector<std::string>(),
154 |                                                      const cv::Mat masks = cv::Mat()) const;
155 | 
156 |     int addTemplate(const cv::Mat sources, const std::string &class_id,
157 |                                     const cv::Mat &object_mask, int num_features = 0);
158 | 
159 |     const cv::Ptr<ColorGradient> &getModalities() const { return modality; }
160 | 
161 |     int getT(int pyramid_level) const { return T_at_level[pyramid_level]; }
162 | 
163 |     int pyramidLevels() const { return pyramid_levels; }
164 | 
165 |     const std::vector<Template> &getTemplates(const std::string &class_id, int template_id) const;
166 | 
167 |     int numTemplates() const;
168 |     int numTemplates(const std::string &class_id) const;
169 |     int numClasses() const { return static_cast<int>(class_templates.size()); }
170 | 
171 |     std::vector<std::string> classIds() const;
172 | 
173 |     void read(const cv::FileNode &fn);
174 |     void write(cv::FileStorage &fs) const;
175 | 
176 |     std::string readClass(const cv::FileNode &fn, const std::string &class_id_override = "");
177 |     void writeClass(const std::string &class_id, cv::FileStorage &fs) const;
178 | 
179 |     void readClasses(const std::vector<std::string> &class_ids,
180 |                                      const std::string &format = "templates_%s.yml.gz");
181 |     void writeClasses(const std::string &format = "templates_%s.yml.gz") const;
182 | 
183 | protected:
184 |     cv::Ptr<ColorGradient> modality;
185 |     int pyramid_levels;
186 |     std::vector<int> T_at_level;
187 | 
188 |     typedef std::vector<Template> TemplatePyramid;
189 |     typedef std::map<std::string, std::vector<TemplatePyramid>> TemplatesMap;
190 |     TemplatesMap class_templates;
191 | 
192 |     typedef std::vector<cv::Mat> LinearMemories;
193 |     // Indexed as [pyramid level][ColorGradient][quantized label]
194 |     typedef std::vector<std::vector<LinearMemories>> LinearMemoryPyramid;
195 | 
196 |     void matchClass(const LinearMemoryPyramid &lm_pyramid,
197 |                                     const std::vector<cv::Size> &sizes,
198 |                                     float threshold, std::vector<Match> &matches,
199 |                                     const std::string &class_id,
200 |                                     const std::vector<TemplatePyramid> &template_pyramids) const;
201 | };
202 | 
203 | } // namespace line2Dup
204 | 
205 | namespace shape_based_matching {
206 | class shapeInfo_producer{
207 | public:
208 |     cv::Mat src;
209 |     cv::Mat mask;
210 | 
211 |     std::vector<float> angle_range;
212 |     std::vector<float> scale_range;
213 | 
214 |     float angle_step = 15;
215 |     float scale_step = 0.5;
216 |     float eps = 0.00001f;
217 | 
218 |     class Info{
219 |     public:
220 |         float angle;
221 |         float scale;
222 | 
223 |         Info(float angle_, float scale_){
224 |             angle = angle_;
225 |             scale = scale_;
226 |         }
227 |     };
228 |     std::vector<Info> infos;
229 | 
230 |     shapeInfo_producer(cv::Mat src, cv::Mat mask = cv::Mat()){
231 |         this->src = src;
232 |         if(mask.empty()){
233 |             // make sure we have masks
234 |             this->mask = cv::Mat(src.size(), CV_8UC1, {255});
235 |         }else{
236 |             this->mask = mask;
237 |         }
238 |     }
239 | 
240 |     static cv::Mat transform(cv::Mat src, float angle, float scale){
241 |         cv::Mat dst;
242 | 
243 |         cv::Point2f center(src.cols/2.0f, src.rows/2.0f);
244 |         cv::Mat rot_mat = cv::getRotationMatrix2D(center, angle, scale);
245 |         cv::warpAffine(src, dst, rot_mat, src.size());
246 | 
247 |         return dst;
248 |     }
249 |     static void save_infos(std::vector<shapeInfo_producer::Info>& infos, std::string path = "infos.yaml"){
250 |         cv::FileStorage fs(path, cv::FileStorage::WRITE);
251 | 
252 |         fs << "infos"
253 |            << "[";
254 |         for (int i = 0; i < infos.size(); i++)
255 |         {
256 |             fs << "{";
257 |             fs << "angle" << infos[i].angle;
258 |             fs << "scale" << infos[i].scale;
259 |             fs << "}";
260 |         }
261 |         fs << "]";
262 |     }
263 |     static std::vector<Info> load_infos(std::string path = "info.yaml"){
264 |         cv::FileStorage fs(path, cv::FileStorage::READ);
265 | 
266 |         std::vector<Info> infos;
267 | 
268 |         cv::FileNode infos_fn = fs["infos"];
269 |         cv::FileNodeIterator it = infos_fn.begin(), it_end = infos_fn.end();
270 |         for (int i = 0; it != it_end; ++it, i++)
271 |         {
272 |             infos.emplace_back(float((*it)["angle"]), float((*it)["scale"]));
273 |         }
274 |         return infos;
275 |     }
276 | 
277 |     void produce_infos(){
278 |         assert(angle_range.size() <= 2);
279 |         assert(scale_range.size() <= 2);
280 |         assert(angle_step > eps*10);
281 |         assert(scale_step > eps*10);
282 | 
283 |         // make sure range not empty
284 |         if(angle_range.size() == 0){
285 |             angle_range.push_back(0);
286 |         }
287 |         if(scale_range.size() == 0){
288 |             scale_range.push_back(1);
289 |         }
290 | 
291 |         if(angle_range.size() == 1 && scale_range.size() == 1){
292 |             float angle = angle_range[0];
293 |             float scale = scale_range[0];
294 |             infos.emplace_back(angle, scale);
295 | 
296 |         }else if(angle_range.size() == 1 && scale_range.size() == 2){
297 |             assert(scale_range[1] > scale_range[0]);
298 |             float angle = angle_range[0];
299 |             for(float scale = scale_range[0]; scale <= scale_range[1]+eps; scale += scale_step){
300 |                 infos.emplace_back(angle, scale);
301 |             }
302 |         }else if(angle_range.size() == 2 && scale_range.size() == 1){
303 |             assert(angle_range[1] > angle_range[0]);
304 |             float scale = scale_range[0];
305 |             for(float angle = angle_range[0]; angle <= angle_range[1]+eps; angle += angle_step){
306 |                 infos.emplace_back(angle, scale);
307 |             }
308 |         }else if(angle_range.size() == 2 && scale_range.size() == 2){
309 |             assert(scale_range[1] > scale_range[0]);
310 |             assert(angle_range[1] > angle_range[0]);
311 |             for(float scale = scale_range[0]; scale <= scale_range[1]+eps; scale += scale_step){
312 |                 for(float angle = angle_range[0]; angle <= angle_range[1]+eps; angle += angle_step){
313 |                     infos.emplace_back(angle, scale);
314 |                 }
315 |             }
316 |         }
317 |     }
318 | 
319 |     cv::Mat src_of(const Info& info){
320 |         return transform(src, info.angle, info.scale);
321 |     }
322 | 
323 |     cv::Mat mask_of(const Info& info){
324 |         return (transform(mask, info.angle, info.scale) > 0);
325 |     }
326 | };
327 | 
328 | }
329 | 
330 | #endif
331 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/line2Dup.hpp:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/line2Dup.hpp


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/linemod.cpp:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/linemod.cpp


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/match.cpp:
--------------------------------------------------------------------------------
1 | #include "match.h"


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/match.h:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/match.h


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/match.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/match.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/openCV410.props:
--------------------------------------------------------------------------------
 1 | <?xml version="1.0" encoding="utf-8"?>
 2 | <Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
 3 |   <ImportGroup Label="PropertySheets" />
 4 |   <PropertyGroup Label="UserMacros" />
 5 |   <PropertyGroup>
 6 |     <ExecutablePath>$(ExecutablePath)</ExecutablePath>
 7 |     <IncludePath>$(SolutionDir)..\..\3rdApp\opencv410\include;$(IncludePath)</IncludePath>
 8 |     <LibraryPath>$(SolutionDir)..\..\3rdApp\opencv410\x64\vc15\lib;$(LibraryPath)</LibraryPath>
 9 |   </PropertyGroup>
10 |   <ItemDefinitionGroup>
11 |     <Link>
12 |       <AdditionalDependencies>opencv_calib3d410.lib;opencv_core410.lib;opencv_dnn410.lib;opencv_features2d410.lib;opencv_flann410.lib;opencv_gapi410.lib;opencv_highgui410.lib;opencv_imgcodecs410.lib;opencv_imgproc410.lib;opencv_ml410.lib;opencv_objdetect410.lib;opencv_photo410.lib;opencv_stitching410.lib;opencv_video410.lib;opencv_videoio410.lib;%(AdditionalDependencies)</AdditionalDependencies>
13 |     </Link>
14 |   </ItemDefinitionGroup>
15 |   <ItemGroup />
16 | </Project>


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/openCV410d.props:
--------------------------------------------------------------------------------
 1 | <?xml version="1.0" encoding="utf-8"?>
 2 | <Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
 3 |   <ImportGroup Label="PropertySheets" />
 4 |   <PropertyGroup Label="UserMacros" />
 5 |   <PropertyGroup>
 6 |     <ExecutablePath>$(ExecutablePath)</ExecutablePath>
 7 |     <IncludePath>D:\work\3rdApp\opencv410\include;D:\work\3rdApp\opencv410\include\opencv2;D:\work\3rdApp\eigen;$(IncludePath)</IncludePath>
 8 |     <LibraryPath>D:\work\3rdApp\opencv410\x64\vc15\lib;$(LibraryPath)</LibraryPath>
 9 |   </PropertyGroup>
10 |   <ItemDefinitionGroup>
11 |     <Link>
12 |       <AdditionalDependencies>opencv_calib3d410d.lib;opencv_core410d.lib;opencv_dnn410d.lib;opencv_features2d410d.lib;opencv_flann410d.lib;opencv_gapi410d.lib;opencv_highgui410d.lib;opencv_imgcodecs410d.lib;opencv_imgproc410d.lib;opencv_ml410d.lib;opencv_objdetect410d.lib;opencv_photo410d.lib;opencv_stitching410d.lib;opencv_video410d.lib;opencv_videoio410d.lib;%(AdditionalDependencies)</AdditionalDependencies>
13 |     </Link>
14 |   </ItemDefinitionGroup>
15 |   <ItemGroup />
16 | </Project>


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/pch.cpp:
--------------------------------------------------------------------------------
1 | ﻿// pch.cpp: 与预编译标头对应的源文件；编译成功所必需的
2 | 
3 | #include "pch.h"
4 | 
5 | // 一般情况下，忽略此文件，但如果你使用的是预编译标头，请保留它。
6 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/pch.h:
--------------------------------------------------------------------------------
 1 | ﻿// 入门提示: 
 2 | //   1. 使用解决方案资源管理器窗口添加/管理文件
 3 | //   2. 使用团队资源管理器窗口连接到源代码管理
 4 | //   3. 使用输出窗口查看生成输出和其他消息
 5 | //   4. 使用错误列表窗口查看错误
 6 | //   5. 转到“项目”>“添加新项”以创建新的代码文件，或转到“项目”>“添加现有项”以将现有代码文件添加到项目
 7 | //   6. 将来，若要再次打开此项目，请转到“文件”>“打开”>“项目”并选择 .sln 文件
 8 | 
 9 | #ifndef PCH_H
10 | #define PCH_H
11 | 
12 | // TODO: 添加要在此处预编译的标头
13 | 
14 | #endif //PCH_H
15 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/shape_based_matching-subpixel.cpp:
--------------------------------------------------------------------------------
 1 | ﻿// shape_based_matching-subpixel.cpp : 此文件包含 "main" 函数。程序执行将在此处开始并结束。
 2 | //
 3 | 
 4 | #include "pch.h"
 5 | #include <iostream>
 6 | 
 7 | int main()
 8 | {
 9 |     std::cout << "Hello World!\n"; 
10 | }
11 | 
12 | // 运行程序: Ctrl + F5 或调试 >“开始执行(不调试)”菜单
13 | // 调试程序: F5 或调试 >“开始调试”菜单
14 | 
15 | // 入门提示: 
16 | //   1. 使用解决方案资源管理器窗口添加/管理文件
17 | //   2. 使用团队资源管理器窗口连接到源代码管理
18 | //   3. 使用输出窗口查看生成输出和其他消息
19 | //   4. 使用错误列表窗口查看错误
20 | //   5. 转到“项目”>“添加新项”以创建新的代码文件，或转到“项目”>“添加现有项”以将现有代码文件添加到项目
21 | //   6. 将来，若要再次打开此项目，请转到“文件”>“打开”>“项目”并选择 .sln 文件
22 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/shape_based_matching-subpixel.vcxproj:
--------------------------------------------------------------------------------
  1 | <?xml version="1.0" encoding="utf-8"?>
  2 | <Project DefaultTargets="Build" ToolsVersion="15.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
  3 |   <ItemGroup Label="ProjectConfigurations">
  4 |     <ProjectConfiguration Include="Debug|Win32">
  5 |       <Configuration>Debug</Configuration>
  6 |       <Platform>Win32</Platform>
  7 |     </ProjectConfiguration>
  8 |     <ProjectConfiguration Include="Release|Win32">
  9 |       <Configuration>Release</Configuration>
 10 |       <Platform>Win32</Platform>
 11 |     </ProjectConfiguration>
 12 |     <ProjectConfiguration Include="Debug|x64">
 13 |       <Configuration>Debug</Configuration>
 14 |       <Platform>x64</Platform>
 15 |     </ProjectConfiguration>
 16 |     <ProjectConfiguration Include="Release|x64">
 17 |       <Configuration>Release</Configuration>
 18 |       <Platform>x64</Platform>
 19 |     </ProjectConfiguration>
 20 |   </ItemGroup>
 21 |   <PropertyGroup Label="Globals">
 22 |     <VCProjectVersion>15.0</VCProjectVersion>
 23 |     <ProjectGuid>{43E49A32-B329-4402-ACA1-C14FCD8C5DDE}</ProjectGuid>
 24 |     <Keyword>Win32Proj</Keyword>
 25 |     <RootNamespace>shapebasedmatchingsubpixel</RootNamespace>
 26 |     <WindowsTargetPlatformVersion>10.0.17763.0</WindowsTargetPlatformVersion>
 27 |   </PropertyGroup>
 28 |   <Import Project="$(VCTargetsPath)\Microsoft.Cpp.Default.props" />
 29 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'" Label="Configuration">
 30 |     <ConfigurationType>Application</ConfigurationType>
 31 |     <UseDebugLibraries>true</UseDebugLibraries>
 32 |     <PlatformToolset>v141</PlatformToolset>
 33 |     <CharacterSet>Unicode</CharacterSet>
 34 |   </PropertyGroup>
 35 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'" Label="Configuration">
 36 |     <ConfigurationType>Application</ConfigurationType>
 37 |     <UseDebugLibraries>false</UseDebugLibraries>
 38 |     <PlatformToolset>v141</PlatformToolset>
 39 |     <WholeProgramOptimization>true</WholeProgramOptimization>
 40 |     <CharacterSet>Unicode</CharacterSet>
 41 |   </PropertyGroup>
 42 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'" Label="Configuration">
 43 |     <ConfigurationType>Application</ConfigurationType>
 44 |     <UseDebugLibraries>true</UseDebugLibraries>
 45 |     <PlatformToolset>v141</PlatformToolset>
 46 |     <CharacterSet>Unicode</CharacterSet>
 47 |   </PropertyGroup>
 48 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'" Label="Configuration">
 49 |     <ConfigurationType>Application</ConfigurationType>
 50 |     <UseDebugLibraries>false</UseDebugLibraries>
 51 |     <PlatformToolset>v141</PlatformToolset>
 52 |     <WholeProgramOptimization>true</WholeProgramOptimization>
 53 |     <CharacterSet>Unicode</CharacterSet>
 54 |   </PropertyGroup>
 55 |   <Import Project="$(VCTargetsPath)\Microsoft.Cpp.props" />
 56 |   <ImportGroup Label="ExtensionSettings">
 57 |   </ImportGroup>
 58 |   <ImportGroup Label="Shared">
 59 |   </ImportGroup>
 60 |   <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
 61 |     <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
 62 |   </ImportGroup>
 63 |   <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
 64 |     <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
 65 |   </ImportGroup>
 66 |   <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
 67 |     <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
 68 |     <Import Project="openCV410d.props" />
 69 |   </ImportGroup>
 70 |   <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
 71 |     <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
 72 |     <Import Project="openCV410.props" />
 73 |   </ImportGroup>
 74 |   <PropertyGroup Label="UserMacros" />
 75 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
 76 |     <LinkIncremental>true</LinkIncremental>
 77 |   </PropertyGroup>
 78 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
 79 |     <LinkIncremental>true</LinkIncremental>
 80 |   </PropertyGroup>
 81 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
 82 |     <LinkIncremental>false</LinkIncremental>
 83 |   </PropertyGroup>
 84 |   <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
 85 |     <LinkIncremental>false</LinkIncremental>
 86 |   </PropertyGroup>
 87 |   <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
 88 |     <ClCompile>
 89 |       <PrecompiledHeader>Use</PrecompiledHeader>
 90 |       <WarningLevel>Level3</WarningLevel>
 91 |       <Optimization>Disabled</Optimization>
 92 |       <SDLCheck>true</SDLCheck>
 93 |       <PreprocessorDefinitions>WIN32;_DEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
 94 |       <ConformanceMode>true</ConformanceMode>
 95 |       <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
 96 |     </ClCompile>
 97 |     <Link>
 98 |       <SubSystem>Console</SubSystem>
 99 |       <GenerateDebugInformation>true</GenerateDebugInformation>
100 |     </Link>
101 |   </ItemDefinitionGroup>
102 |   <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
103 |     <ClCompile>
104 |       <PrecompiledHeader>NotUsing</PrecompiledHeader>
105 |       <WarningLevel>Level3</WarningLevel>
106 |       <Optimization>Disabled</Optimization>
107 |       <SDLCheck>true</SDLCheck>
108 |       <PreprocessorDefinitions>_DEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
109 |       <ConformanceMode>true</ConformanceMode>
110 |       <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
111 |     </ClCompile>
112 |     <Link>
113 |       <SubSystem>Console</SubSystem>
114 |       <GenerateDebugInformation>true</GenerateDebugInformation>
115 |     </Link>
116 |   </ItemDefinitionGroup>
117 |   <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
118 |     <ClCompile>
119 |       <PrecompiledHeader>Use</PrecompiledHeader>
120 |       <WarningLevel>Level3</WarningLevel>
121 |       <Optimization>MaxSpeed</Optimization>
122 |       <FunctionLevelLinking>true</FunctionLevelLinking>
123 |       <IntrinsicFunctions>true</IntrinsicFunctions>
124 |       <SDLCheck>true</SDLCheck>
125 |       <PreprocessorDefinitions>WIN32;NDEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
126 |       <ConformanceMode>true</ConformanceMode>
127 |       <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
128 |     </ClCompile>
129 |     <Link>
130 |       <SubSystem>Console</SubSystem>
131 |       <EnableCOMDATFolding>true</EnableCOMDATFolding>
132 |       <OptimizeReferences>true</OptimizeReferences>
133 |       <GenerateDebugInformation>true</GenerateDebugInformation>
134 |     </Link>
135 |   </ItemDefinitionGroup>
136 |   <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
137 |     <ClCompile>
138 |       <PrecompiledHeader>NotUsing</PrecompiledHeader>
139 |       <WarningLevel>Level3</WarningLevel>
140 |       <Optimization>MaxSpeed</Optimization>
141 |       <FunctionLevelLinking>true</FunctionLevelLinking>
142 |       <IntrinsicFunctions>true</IntrinsicFunctions>
143 |       <SDLCheck>true</SDLCheck>
144 |       <PreprocessorDefinitions>NDEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
145 |       <ConformanceMode>true</ConformanceMode>
146 |       <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
147 |     </ClCompile>
148 |     <Link>
149 |       <SubSystem>Console</SubSystem>
150 |       <EnableCOMDATFolding>true</EnableCOMDATFolding>
151 |       <OptimizeReferences>true</OptimizeReferences>
152 |       <GenerateDebugInformation>true</GenerateDebugInformation>
153 |     </Link>
154 |   </ItemDefinitionGroup>
155 |   <ItemGroup>
156 |     <ClInclude Include="cuda_icp\geometry.h" />
157 |     <ClInclude Include="cuda_icp\icp.h" />
158 |     <ClInclude Include="cuda_icp\scene\common.h" />
159 |     <ClInclude Include="cuda_icp\scene\edge_scene\edge_scene.h" />
160 |     <ClInclude Include="line2Dup.h" />
161 |     <ClInclude Include="MIPP\math\avx512_mathfun.h" />
162 |     <ClInclude Include="MIPP\math\avx512_mathfun.hxx" />
163 |     <ClInclude Include="MIPP\math\avx_mathfun.h" />
164 |     <ClInclude Include="MIPP\math\avx_mathfun.hxx" />
165 |     <ClInclude Include="MIPP\math\neon_mathfun.h" />
166 |     <ClInclude Include="MIPP\math\neon_mathfun.hxx" />
167 |     <ClInclude Include="MIPP\math\sse_mathfun.h" />
168 |     <ClInclude Include="MIPP\math\sse_mathfun.hxx" />
169 |     <ClInclude Include="MIPP\mipp.h" />
170 |     <ClInclude Include="MIPP\mipp_impl_AVX.hxx" />
171 |     <ClInclude Include="MIPP\mipp_impl_AVX512.hxx" />
172 |     <ClInclude Include="MIPP\mipp_impl_NEON.hxx" />
173 |     <ClInclude Include="MIPP\mipp_impl_SSE.hxx" />
174 |     <ClInclude Include="MIPP\mipp_object.hxx" />
175 |     <ClInclude Include="MIPP\mipp_scalar_op.h" />
176 |     <ClInclude Include="MIPP\mipp_scalar_op.hxx" />
177 |     <ClInclude Include="pch.h" />
178 |   </ItemGroup>
179 |   <ItemGroup>
180 |     <ClCompile Include="cuda_icp\icp.cpp" />
181 |     <ClCompile Include="cuda_icp\scene\common.cpp" />
182 |     <ClCompile Include="cuda_icp\scene\edge_scene\edge_scene.cpp" />
183 |     <ClCompile Include="line2Dup.cpp" />
184 |     <ClCompile Include="pch.cpp">
185 |       <PrecompiledHeader Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">Create</PrecompiledHeader>
186 |       <PrecompiledHeader Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">Create</PrecompiledHeader>
187 |       <PrecompiledHeader Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">Create</PrecompiledHeader>
188 |       <PrecompiledHeader Condition="'$(Configuration)|$(Platform)'=='Release|x64'">Create</PrecompiledHeader>
189 |     </ClCompile>
190 |     <ClCompile Include="test.cpp" />
191 |   </ItemGroup>
192 |   <ItemGroup>
193 |     <None Include="cuda_icp\icp.cu" />
194 |     <None Include="cuda_icp\scene\common.cu" />
195 |     <None Include="cuda_icp\scene\edge_scene\edge_scene.cu" />
196 |   </ItemGroup>
197 |   <Import Project="$(VCTargetsPath)\Microsoft.Cpp.targets" />
198 |   <ImportGroup Label="ExtensionTargets">
199 |   </ImportGroup>
200 | </Project>


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/shape_based_matching-subpixel.vcxproj.filters:
--------------------------------------------------------------------------------
  1 | ﻿<?xml version="1.0" encoding="utf-8"?>
  2 | <Project ToolsVersion="4.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
  3 |   <ItemGroup>
  4 |     <Filter Include="源文件">
  5 |       <UniqueIdentifier>{4FC737F1-C7A5-4376-A066-2A32D752A2FF}</UniqueIdentifier>
  6 |       <Extensions>cpp;c;cc;cxx;def;odl;idl;hpj;bat;asm;asmx</Extensions>
  7 |     </Filter>
  8 |     <Filter Include="头文件">
  9 |       <UniqueIdentifier>{93995380-89BD-4b04-88EB-625FBE52EBFB}</UniqueIdentifier>
 10 |       <Extensions>h;hh;hpp;hxx;hm;inl;inc;ipp;xsd</Extensions>
 11 |     </Filter>
 12 |     <Filter Include="资源文件">
 13 |       <UniqueIdentifier>{67DA6AB6-F800-4c08-8B7A-83BB121AAD01}</UniqueIdentifier>
 14 |       <Extensions>rc;ico;cur;bmp;dlg;rc2;rct;bin;rgs;gif;jpg;jpeg;jpe;resx;tiff;tif;png;wav;mfcribbon-ms</Extensions>
 15 |     </Filter>
 16 |     <Filter Include="头文件\MIPP">
 17 |       <UniqueIdentifier>{4dd7bf89-fc5c-423d-8499-bb8df48d68ca}</UniqueIdentifier>
 18 |     </Filter>
 19 |     <Filter Include="头文件\MIPP\math">
 20 |       <UniqueIdentifier>{ace5f48a-2b44-41d2-a8f3-e9159dcd9e6a}</UniqueIdentifier>
 21 |     </Filter>
 22 |     <Filter Include="头文件\cuda_icp">
 23 |       <UniqueIdentifier>{5633bf8d-29af-4ff1-ab75-a3aa64b2681e}</UniqueIdentifier>
 24 |     </Filter>
 25 |     <Filter Include="头文件\cuda_icp\scene">
 26 |       <UniqueIdentifier>{577d5ce2-e6aa-4077-bfa3-3ccd8d9fe19b}</UniqueIdentifier>
 27 |     </Filter>
 28 |     <Filter Include="头文件\cuda_icp\scene\edge_scene">
 29 |       <UniqueIdentifier>{36739337-90c8-4310-8d9e-52d8d1ba1241}</UniqueIdentifier>
 30 |     </Filter>
 31 |   </ItemGroup>
 32 |   <ItemGroup>
 33 |     <ClInclude Include="pch.h">
 34 |       <Filter>头文件</Filter>
 35 |     </ClInclude>
 36 |     <ClInclude Include="line2Dup.h">
 37 |       <Filter>头文件</Filter>
 38 |     </ClInclude>
 39 |     <ClInclude Include="MIPP\mipp.h">
 40 |       <Filter>头文件\MIPP</Filter>
 41 |     </ClInclude>
 42 |     <ClInclude Include="MIPP\mipp_impl_AVX.hxx">
 43 |       <Filter>头文件\MIPP</Filter>
 44 |     </ClInclude>
 45 |     <ClInclude Include="MIPP\mipp_impl_AVX512.hxx">
 46 |       <Filter>头文件\MIPP</Filter>
 47 |     </ClInclude>
 48 |     <ClInclude Include="MIPP\mipp_impl_NEON.hxx">
 49 |       <Filter>头文件\MIPP</Filter>
 50 |     </ClInclude>
 51 |     <ClInclude Include="MIPP\mipp_impl_SSE.hxx">
 52 |       <Filter>头文件\MIPP</Filter>
 53 |     </ClInclude>
 54 |     <ClInclude Include="MIPP\mipp_object.hxx">
 55 |       <Filter>头文件\MIPP</Filter>
 56 |     </ClInclude>
 57 |     <ClInclude Include="MIPP\mipp_scalar_op.h">
 58 |       <Filter>头文件\MIPP</Filter>
 59 |     </ClInclude>
 60 |     <ClInclude Include="MIPP\mipp_scalar_op.hxx">
 61 |       <Filter>头文件\MIPP</Filter>
 62 |     </ClInclude>
 63 |     <ClInclude Include="MIPP\math\avx_mathfun.h">
 64 |       <Filter>头文件\MIPP\math</Filter>
 65 |     </ClInclude>
 66 |     <ClInclude Include="MIPP\math\avx_mathfun.hxx">
 67 |       <Filter>头文件\MIPP\math</Filter>
 68 |     </ClInclude>
 69 |     <ClInclude Include="MIPP\math\avx512_mathfun.h">
 70 |       <Filter>头文件\MIPP\math</Filter>
 71 |     </ClInclude>
 72 |     <ClInclude Include="MIPP\math\avx512_mathfun.hxx">
 73 |       <Filter>头文件\MIPP\math</Filter>
 74 |     </ClInclude>
 75 |     <ClInclude Include="MIPP\math\neon_mathfun.h">
 76 |       <Filter>头文件\MIPP\math</Filter>
 77 |     </ClInclude>
 78 |     <ClInclude Include="MIPP\math\neon_mathfun.hxx">
 79 |       <Filter>头文件\MIPP\math</Filter>
 80 |     </ClInclude>
 81 |     <ClInclude Include="MIPP\math\sse_mathfun.h">
 82 |       <Filter>头文件\MIPP\math</Filter>
 83 |     </ClInclude>
 84 |     <ClInclude Include="MIPP\math\sse_mathfun.hxx">
 85 |       <Filter>头文件\MIPP\math</Filter>
 86 |     </ClInclude>
 87 |     <ClInclude Include="cuda_icp\geometry.h">
 88 |       <Filter>头文件\cuda_icp</Filter>
 89 |     </ClInclude>
 90 |     <ClInclude Include="cuda_icp\icp.h">
 91 |       <Filter>头文件\cuda_icp</Filter>
 92 |     </ClInclude>
 93 |     <ClInclude Include="cuda_icp\scene\common.h">
 94 |       <Filter>头文件\cuda_icp\scene</Filter>
 95 |     </ClInclude>
 96 |     <ClInclude Include="cuda_icp\scene\edge_scene\edge_scene.h">
 97 |       <Filter>头文件\cuda_icp\scene\edge_scene</Filter>
 98 |     </ClInclude>
 99 |   </ItemGroup>
100 |   <ItemGroup>
101 |     <ClCompile Include="pch.cpp">
102 |       <Filter>源文件</Filter>
103 |     </ClCompile>
104 |     <ClCompile Include="line2Dup.cpp">
105 |       <Filter>源文件</Filter>
106 |     </ClCompile>
107 |     <ClCompile Include="test.cpp">
108 |       <Filter>源文件</Filter>
109 |     </ClCompile>
110 |     <ClCompile Include="cuda_icp\icp.cpp">
111 |       <Filter>头文件\cuda_icp</Filter>
112 |     </ClCompile>
113 |     <ClCompile Include="cuda_icp\scene\common.cpp">
114 |       <Filter>头文件\cuda_icp\scene</Filter>
115 |     </ClCompile>
116 |     <ClCompile Include="cuda_icp\scene\edge_scene\edge_scene.cpp">
117 |       <Filter>头文件\cuda_icp\scene\edge_scene</Filter>
118 |     </ClCompile>
119 |   </ItemGroup>
120 |   <ItemGroup>
121 |     <None Include="cuda_icp\icp.cu">
122 |       <Filter>头文件\cuda_icp</Filter>
123 |     </None>
124 |     <None Include="cuda_icp\scene\common.cu">
125 |       <Filter>头文件\cuda_icp\scene</Filter>
126 |     </None>
127 |     <None Include="cuda_icp\scene\edge_scene\edge_scene.cu">
128 |       <Filter>头文件\cuda_icp\scene\edge_scene</Filter>
129 |     </None>
130 |   </ItemGroup>
131 | </Project>


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/shape_based_matching-subpixel.vcxproj.user:
--------------------------------------------------------------------------------
1 | ﻿<?xml version="1.0" encoding="utf-8"?>
2 | <Project ToolsVersion="15.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
3 |   <PropertyGroup />
4 | </Project>


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test.cpp:
--------------------------------------------------------------------------------
  1 | #include "line2Dup.h"
  2 | #include <memory>
  3 | #include <iostream>
  4 | #include <assert.h>
  5 | #include <chrono>
  6 | 
  7 | #include "cuda_icp/icp.h"
  8 | 
  9 | using namespace std;
 10 | using namespace cv;
 11 | 
 12 | static std::string prefix = ".\\test\\";
 13 | 
 14 | // NMS, got from cv::dnn so we don't need opencv contrib
 15 | // just collapse it
 16 | namespace  cv_dnn {
 17 | namespace
 18 | {
 19 | 
 20 | template <typename T>
 21 | static inline bool SortScorePairDescend(const std::pair<float, T>& pair1,
 22 |                           const std::pair<float, T>& pair2)
 23 | {
 24 |     return pair1.first > pair2.first;
 25 | }
 26 | 
 27 | } // namespace
 28 | 
 29 | inline void GetMaxScoreIndex(const std::vector<float>& scores, const float threshold, const int top_k,
 30 |                       std::vector<std::pair<float, int> >& score_index_vec)
 31 | {
 32 |     for (size_t i = 0; i < scores.size(); ++i)
 33 |     {
 34 |         if (scores[i] > threshold)
 35 |         {
 36 |             score_index_vec.push_back(std::make_pair(scores[i], i));
 37 |         }
 38 |     }
 39 |     std::stable_sort(score_index_vec.begin(), score_index_vec.end(),
 40 |                      SortScorePairDescend<int>);
 41 |     if (top_k > 0 && top_k < (int)score_index_vec.size())
 42 |     {
 43 |         score_index_vec.resize(top_k);
 44 |     }
 45 | }
 46 | 
 47 | template <typename BoxType>
 48 | inline void NMSFast_(const std::vector<BoxType>& bboxes,
 49 |       const std::vector<float>& scores, const float score_threshold,
 50 |       const float nms_threshold, const float eta, const int top_k,
 51 |       std::vector<int>& indices, float (*computeOverlap)(const BoxType&, const BoxType&))
 52 | {
 53 |     CV_Assert(bboxes.size() == scores.size());
 54 |     std::vector<std::pair<float, int> > score_index_vec;
 55 |     GetMaxScoreIndex(scores, score_threshold, top_k, score_index_vec);
 56 | 
 57 |     // Do nms.
 58 |     float adaptive_threshold = nms_threshold;
 59 |     indices.clear();
 60 |     for (size_t i = 0; i < score_index_vec.size(); ++i) {
 61 |         const int idx = score_index_vec[i].second;
 62 |         bool keep = true;
 63 |         for (int k = 0; k < (int)indices.size() && keep; ++k) {
 64 |             const int kept_idx = indices[k];
 65 |             float overlap = computeOverlap(bboxes[idx], bboxes[kept_idx]);
 66 |             keep = overlap <= adaptive_threshold;
 67 |         }
 68 |         if (keep)
 69 |             indices.push_back(idx);
 70 |         if (keep && eta < 1 && adaptive_threshold > 0.5) {
 71 |           adaptive_threshold *= eta;
 72 |         }
 73 |     }
 74 | }
 75 | 
 76 | 
 77 | // copied from opencv 3.4, not exist in 3.0
 78 | template<typename _Tp> static inline
 79 | double jaccardDistance__(const Rect_<_Tp>& a, const Rect_<_Tp>& b) {
 80 |     _Tp Aa = a.area();
 81 |     _Tp Ab = b.area();
 82 | 
 83 |     if ((Aa + Ab) <= std::numeric_limits<_Tp>::epsilon()) {
 84 |         // jaccard_index = 1 -> distance = 0
 85 |         return 0.0;
 86 |     }
 87 | 
 88 |     double Aab = (a & b).area();
 89 |     // distance = 1 - jaccard_index
 90 |     return 1.0 - Aab / (Aa + Ab - Aab);
 91 | }
 92 | 
 93 | template <typename T>
 94 | static inline float rectOverlap(const T& a, const T& b)
 95 | {
 96 |     return 1.f - static_cast<float>(jaccardDistance__(a, b));
 97 | }
 98 | 
 99 | void NMSBoxes(const std::vector<Rect>& bboxes, const std::vector<float>& scores,
100 |                           const float score_threshold, const float nms_threshold,
101 |                           std::vector<int>& indices, const float eta=1, const int top_k=0)
102 | {
103 |     NMSFast_(bboxes, scores, score_threshold, nms_threshold, eta, top_k, indices, rectOverlap);
104 | }
105 | 
106 | }
107 | 
108 | class Timer
109 | {
110 | public:
111 |     Timer() : beg_(clock_::now()) {}
112 |     void reset() { beg_ = clock_::now(); }
113 |     double elapsed() const {
114 |         return std::chrono::duration_cast<second_>
115 |             (clock_::now() - beg_).count(); }
116 |     void out(std::string message = ""){
117 |         double t = elapsed();
118 |         std::cout << message << "\nelasped time:" << t << "s" << std::endl;
119 |         reset();
120 |     }
121 | private:
122 |     typedef std::chrono::high_resolution_clock clock_;
123 |     typedef std::chrono::duration<double, std::ratio<1> > second_;
124 |     std::chrono::time_point<clock_> beg_;
125 | };
126 | 
127 | void angle_test(string mode = "test", bool viewICP = false){
128 |     //line2Dup::Detector detector(128, {4, 8});
129 | 	line2Dup::Detector detector(35, { 4, 8 });
130 | 
131 |     //mode = "train";
132 |     if(mode == "train")
133 | 	{
134 |         Mat img = imread(prefix+"case1\\train.tif");
135 |         assert(!img.empty() && "check your img path");
136 | 
137 |         Rect roi(130, 110, 270, 270);
138 | 		roi = cv::Rect(0, 0, img.cols, img.rows);
139 |         img = img(roi).clone();
140 |         Mat mask = Mat(img.size(), CV_8UC1, {255});
141 | 
142 |         // padding to avoid rotating out
143 |         int padding = 100;
144 |         cv::Mat padded_img = cv::Mat(img.rows + 2*padding, img.cols + 2*padding, img.type(), cv::Scalar::all(0));
145 |         img.copyTo(padded_img(Rect(padding, padding, img.cols, img.rows)));
146 | 
147 |         cv::Mat padded_mask = cv::Mat(mask.rows + 2*padding, mask.cols + 2*padding, mask.type(), cv::Scalar::all(0));
148 |         mask.copyTo(padded_mask(Rect(padding, padding, img.cols, img.rows)));
149 | 
150 |         shape_based_matching::shapeInfo_producer shapes(padded_img, padded_mask);
151 |         shapes.angle_range = {0, 360};
152 |         shapes.angle_step = 1;
153 |         shapes.produce_infos();
154 |         std::vector<shape_based_matching::shapeInfo_producer::Info> infos_have_templ;
155 |         string class_id = "test";
156 |         for(auto& info: shapes.infos)
157 | 		{
158 |             imshow("train", shapes.src_of(info));
159 |             waitKey(1);
160 | 
161 |             std::cout << "\ninfo.angle: " << info.angle << std::endl;
162 |             int templ_id = detector.addTemplate(shapes.src_of(info), class_id, shapes.mask_of(info));
163 |             std::cout << "templ_id: " << templ_id << std::endl;
164 |             if(templ_id != -1){
165 |                 infos_have_templ.push_back(info);
166 |             }
167 |         }
168 |         detector.writeClasses(prefix+"case1\\test_templ.yaml");
169 |         shapes.save_infos(infos_have_templ, prefix + "case1\\test_info.yaml");
170 |         std::cout << "train end" << std::endl << std::endl;
171 |     }
172 | 	else if(mode=="test")
173 | 	{
174 |         std::vector<std::string> ids;
175 |         ids.push_back("test");
176 |         detector.readClasses(ids, prefix+"case1\\test_templ.yaml");
177 | 
178 |         // angle & scale are saved here, fetched by match id
179 |         auto infos = shape_based_matching::shapeInfo_producer::load_infos(prefix + "case1\\test_info.yaml");
180 | 
181 |         Mat test_img = imread(prefix+"case1\\test.tif");
182 |         assert(!test_img.empty() && "check your img path");
183 | 
184 |         int padding = 100;
185 |         cv::Mat padded_img = cv::Mat(test_img.rows + 2*padding,
186 |                                      test_img.cols + 2*padding, test_img.type(), cv::Scalar::all(0));
187 |         test_img.copyTo(padded_img(Rect(padding, padding, test_img.cols, test_img.rows)));
188 | 
189 |         int stride = 16;
190 |         int n = padded_img.rows/stride;
191 |         int m = padded_img.cols/stride;
192 |         Rect roi(0, 0, stride*m , stride*n);
193 |         Mat img = padded_img(roi).clone();
194 |         assert(img.isContinuous());
195 | 
196 | //        cvtColor(img, img, CV_BGR2GRAY);
197 | 
198 |         std::cout << "test img size: " << img.rows * img.cols << std::endl << std::endl;
199 | 
200 |         Timer timer;
201 |         auto matches = detector.match(img, 90, ids);
202 |         timer.out();
203 | 
204 | 
205 |         std::cout << "matches.size(): " << matches.size() << std::endl;
206 |         size_t top5 = 5;
207 |         if(top5>matches.size()) top5=matches.size();
208 | 
209 |         // construct scene
210 |         Scene_edge scene;
211 |         // buffer
212 |         vector<::Vec2f> pcd_buffer, normal_buffer;
213 |         scene.init_Scene_edge_cpu(img, pcd_buffer, normal_buffer);
214 | 
215 | 		
216 |         if(img.channels() == 1) cvtColor(img, img, cv::COLOR_GRAY2BGR);
217 | 
218 |         cv::Mat edge_global;  // get edge
219 |         {
220 |             cv::Mat gray;
221 |             if(img.channels() > 1){
222 |                 cv::cvtColor(img, gray, cv::COLOR_BGR2GRAY);
223 |             }else{
224 |                 gray = img;
225 |             }
226 | 
227 |             cv::Mat smoothed = gray;
228 |             cv::Canny(smoothed, edge_global, 30, 60);
229 | 
230 |             if(edge_global.channels() == 1) cvtColor(edge_global, edge_global, cv::COLOR_GRAY2BGR);
231 |         }
232 | 
233 | 		int imageWidth = 468;
234 | 
235 |         for(int i=top5-1; i>=0; i--)
236 |         {
237 |             Mat edge = edge_global.clone();
238 | 
239 |             auto match = matches[i];
240 |             auto templ = detector.getTemplates("test",
241 |                                                match.template_id);
242 | 
243 |             // 270 is width of template image
244 |             // 100 is padding when training
245 |             // tl_x/y: template croping topleft corner when training
246 | 
247 |             float r_scaled = imageWidth /2.0f*infos[match.template_id].scale;
248 | 
249 |             // scaling won't affect this, because it has been determined by warpAffine
250 |             // cv::warpAffine(src, dst, rot_mat, src.size()); last param
251 |             float train_img_half_width = imageWidth /2.0f + 100;
252 | 
253 |             // center x,y of train_img in test img
254 |             float x =  match.x - templ[0].tl_x + train_img_half_width;
255 |             float y =  match.y - templ[0].tl_y + train_img_half_width;
256 | 
257 |             vector<::Vec2f> model_pcd(templ[0].features.size());
258 |             for(int i=0; i<templ[0].features.size(); i++){
259 |                 auto& feat = templ[0].features[i];
260 |                 model_pcd[i] = {
261 |                     float(feat.x + match.x),
262 |                     float(feat.y + match.y)
263 |                 };
264 |             }
265 |             cuda_icp::RegistrationResult result = cuda_icp::ICP2D_Point2Plane_cpu(model_pcd, scene);
266 | 
267 |             cv::Vec3b randColor;
268 |             randColor[0] = 0;
269 |             randColor[1] = 0;
270 |             randColor[2] = 255;
271 |             for(int i=0; i<templ[0].features.size(); i++){
272 |                 auto feat = templ[0].features[i];
273 |                 cv::circle(edge, {feat.x+match.x, feat.y+match.y}, 2, randColor, -1);
274 |             }
275 | 
276 |             if(viewICP){
277 |                 imshow("icp", edge);
278 |                 waitKey(0);
279 |             }
280 | 
281 | 
282 |             randColor[0] = 0;
283 |             randColor[1] = 255;
284 |             randColor[2] = 0;
285 |             for(int i=0; i<templ[0].features.size(); i++){
286 |                 auto feat = templ[0].features[i];
287 |                 float x = feat.x + match.x;
288 |                 float y = feat.y + match.y;
289 |                 float new_x = result.transformation_[0][0]*x + result.transformation_[0][1]*y + result.transformation_[0][2];
290 |                 float new_y = result.transformation_[1][0]*x + result.transformation_[1][1]*y + result.transformation_[1][2];
291 | 
292 |                 cv::circle(edge, {int(new_x+0.5f), int(new_y+0.5f)}, 2, randColor, -1);
293 |             }
294 |             if(viewICP){
295 |                 imshow("icp", edge);
296 |                 waitKey(0);
297 |             }
298 | 
299 |             double init_angle = infos[match.template_id].angle;
300 |             init_angle = init_angle >= 180 ? (init_angle-360) : init_angle;
301 | 
302 |             double ori_diff_angle = std::abs(init_angle);
303 |             double icp_diff_angle = std::abs(-std::asin(result.transformation_[1][0])/CV_PI*180 +
304 |                     init_angle);
305 |             double improved_angle = ori_diff_angle - icp_diff_angle;
306 | 
307 |             std::cout << "\n---------------" << std::endl;
308 | 			std::cout << "origin angle: " << ori_diff_angle << "  affine angle: " << icp_diff_angle << std::endl;
309 |             std::cout << "init diff angle: " << ori_diff_angle << std::endl;
310 |             std::cout << "improved angle: " << improved_angle << std::endl;
311 |             std::cout << "match.template_id: " << match.template_id << std::endl;
312 |             std::cout << "match.similarity: " << match.similarity << std::endl;
313 |         }
314 | 
315 |         std::cout << "test end" << std::endl << std::endl;
316 | 		
317 |     }
318 | }
319 | 
320 | void MIPP_test(){
321 |     std::cout << "MIPP tests" << std::endl;
322 |     std::cout << "----------" << std::endl << std::endl;
323 | 
324 |     std::cout << "Instr. type:       " << mipp::InstructionType                  << std::endl;
325 |     std::cout << "Instr. full type:  " << mipp::InstructionFullType              << std::endl;
326 |     std::cout << "Instr. version:    " << mipp::InstructionVersion               << std::endl;
327 |     std::cout << "Instr. size:       " << mipp::RegisterSizeBit       << " bits" << std::endl;
328 |     std::cout << "Instr. lanes:      " << mipp::Lanes                            << std::endl;
329 |     std::cout << "64-bit support:    " << (mipp::Support64Bit    ? "yes" : "no") << std::endl;
330 |     std::cout << "Byte/word support: " << (mipp::SupportByteWord ? "yes" : "no") << std::endl;
331 | 
332 | #ifndef has_max_int8_t
333 |         std::cout << "in this SIMD, int8 max is not inplemented by MIPP" << std::endl;
334 | #endif
335 | 
336 | #ifndef has_shuff_int8_t
337 |         std::cout << "in this SIMD, int8 shuff is not inplemented by MIPP" << std::endl;
338 | #endif
339 | 
340 |     std::cout << "----------" << std::endl << std::endl;
341 | }
342 | 
343 | int main(){
344 | 
345 |     MIPP_test();
346 |     angle_test("test", false);
347 |     return 0;
348 | }
349 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/1.jpg:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/1.jpg


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/2.jpg:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/2.jpg


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/3.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/3.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/4.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/4.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/circle_info.yaml:
--------------------------------------------------------------------------------
  1 | %YAML:1.0
  2 | ---
  3 | infos:
  4 |    -
  5 |       angle: 0.
  6 |       scale: 1.0000000149011612e-01
  7 |    -
  8 |       angle: 0.
  9 |       scale: 1.0999999940395355e-01
 10 |    -
 11 |       angle: 0.
 12 |       scale: 1.4000000059604645e-01
 13 |    -
 14 |       angle: 0.
 15 |       scale: 1.5000000596046448e-01
 16 |    -
 17 |       angle: 0.
 18 |       scale: 1.6000001132488251e-01
 19 |    -
 20 |       angle: 0.
 21 |       scale: 1.7000001668930054e-01
 22 |    -
 23 |       angle: 0.
 24 |       scale: 1.8000002205371857e-01
 25 |    -
 26 |       angle: 0.
 27 |       scale: 1.9000002741813660e-01
 28 |    -
 29 |       angle: 0.
 30 |       scale: 2.0000003278255463e-01
 31 |    -
 32 |       angle: 0.
 33 |       scale: 2.1000003814697266e-01
 34 |    -
 35 |       angle: 0.
 36 |       scale: 2.2000004351139069e-01
 37 |    -
 38 |       angle: 0.
 39 |       scale: 2.3000004887580872e-01
 40 |    -
 41 |       angle: 0.
 42 |       scale: 2.4000005424022675e-01
 43 |    -
 44 |       angle: 0.
 45 |       scale: 2.5000005960464478e-01
 46 |    -
 47 |       angle: 0.
 48 |       scale: 2.6000005006790161e-01
 49 |    -
 50 |       angle: 0.
 51 |       scale: 2.7000004053115845e-01
 52 |    -
 53 |       angle: 0.
 54 |       scale: 2.8000003099441528e-01
 55 |    -
 56 |       angle: 0.
 57 |       scale: 2.9000002145767212e-01
 58 |    -
 59 |       angle: 0.
 60 |       scale: 3.0000001192092896e-01
 61 |    -
 62 |       angle: 0.
 63 |       scale: 3.1000000238418579e-01
 64 |    -
 65 |       angle: 0.
 66 |       scale: 3.1999999284744263e-01
 67 |    -
 68 |       angle: 0.
 69 |       scale: 3.2999998331069946e-01
 70 |    -
 71 |       angle: 0.
 72 |       scale: 3.3999997377395630e-01
 73 |    -
 74 |       angle: 0.
 75 |       scale: 3.4999996423721313e-01
 76 |    -
 77 |       angle: 0.
 78 |       scale: 3.5999995470046997e-01
 79 |    -
 80 |       angle: 0.
 81 |       scale: 3.6999994516372681e-01
 82 |    -
 83 |       angle: 0.
 84 |       scale: 3.7999993562698364e-01
 85 |    -
 86 |       angle: 0.
 87 |       scale: 3.8999992609024048e-01
 88 |    -
 89 |       angle: 0.
 90 |       scale: 3.9999991655349731e-01
 91 |    -
 92 |       angle: 0.
 93 |       scale: 4.0999990701675415e-01
 94 |    -
 95 |       angle: 0.
 96 |       scale: 4.1999989748001099e-01
 97 |    -
 98 |       angle: 0.
 99 |       scale: 4.2999988794326782e-01
100 |    -
101 |       angle: 0.
102 |       scale: 4.3999987840652466e-01
103 |    -
104 |       angle: 0.
105 |       scale: 4.4999986886978149e-01
106 |    -
107 |       angle: 0.
108 |       scale: 4.5999985933303833e-01
109 |    -
110 |       angle: 0.
111 |       scale: 4.6999984979629517e-01
112 |    -
113 |       angle: 0.
114 |       scale: 4.7999984025955200e-01
115 |    -
116 |       angle: 0.
117 |       scale: 4.8999983072280884e-01
118 |    -
119 |       angle: 0.
120 |       scale: 4.9999982118606567e-01
121 |    -
122 |       angle: 0.
123 |       scale: 5.0999981164932251e-01
124 |    -
125 |       angle: 0.
126 |       scale: 5.1999980211257935e-01
127 |    -
128 |       angle: 0.
129 |       scale: 5.2999979257583618e-01
130 |    -
131 |       angle: 0.
132 |       scale: 5.3999978303909302e-01
133 |    -
134 |       angle: 0.
135 |       scale: 5.4999977350234985e-01
136 |    -
137 |       angle: 0.
138 |       scale: 5.5999976396560669e-01
139 |    -
140 |       angle: 0.
141 |       scale: 5.6999975442886353e-01
142 |    -
143 |       angle: 0.
144 |       scale: 5.7999974489212036e-01
145 |    -
146 |       angle: 0.
147 |       scale: 5.8999973535537720e-01
148 |    -
149 |       angle: 0.
150 |       scale: 5.9999972581863403e-01
151 |    -
152 |       angle: 0.
153 |       scale: 6.0999971628189087e-01
154 |    -
155 |       angle: 0.
156 |       scale: 6.1999970674514771e-01
157 |    -
158 |       angle: 0.
159 |       scale: 6.2999969720840454e-01
160 |    -
161 |       angle: 0.
162 |       scale: 6.3999968767166138e-01
163 |    -
164 |       angle: 0.
165 |       scale: 6.4999967813491821e-01
166 |    -
167 |       angle: 0.
168 |       scale: 6.5999966859817505e-01
169 |    -
170 |       angle: 0.
171 |       scale: 6.6999965906143188e-01
172 |    -
173 |       angle: 0.
174 |       scale: 6.7999964952468872e-01
175 |    -
176 |       angle: 0.
177 |       scale: 6.8999963998794556e-01
178 |    -
179 |       angle: 0.
180 |       scale: 6.9999963045120239e-01
181 |    -
182 |       angle: 0.
183 |       scale: 7.0999962091445923e-01
184 |    -
185 |       angle: 0.
186 |       scale: 7.1999961137771606e-01
187 |    -
188 |       angle: 0.
189 |       scale: 7.2999960184097290e-01
190 |    -
191 |       angle: 0.
192 |       scale: 7.3999959230422974e-01
193 |    -
194 |       angle: 0.
195 |       scale: 7.4999958276748657e-01
196 |    -
197 |       angle: 0.
198 |       scale: 7.5999957323074341e-01
199 |    -
200 |       angle: 0.
201 |       scale: 7.6999956369400024e-01
202 |    -
203 |       angle: 0.
204 |       scale: 7.7999955415725708e-01
205 |    -
206 |       angle: 0.
207 |       scale: 7.8999954462051392e-01
208 |    -
209 |       angle: 0.
210 |       scale: 7.9999953508377075e-01
211 |    -
212 |       angle: 0.
213 |       scale: 8.0999952554702759e-01
214 |    -
215 |       angle: 0.
216 |       scale: 8.1999951601028442e-01
217 |    -
218 |       angle: 0.
219 |       scale: 8.2999950647354126e-01
220 |    -
221 |       angle: 0.
222 |       scale: 8.3999949693679810e-01
223 |    -
224 |       angle: 0.
225 |       scale: 8.4999948740005493e-01
226 |    -
227 |       angle: 0.
228 |       scale: 8.5999947786331177e-01
229 |    -
230 |       angle: 0.
231 |       scale: 8.6999946832656860e-01
232 |    -
233 |       angle: 0.
234 |       scale: 8.7999945878982544e-01
235 |    -
236 |       angle: 0.
237 |       scale: 8.8999944925308228e-01
238 |    -
239 |       angle: 0.
240 |       scale: 8.9999943971633911e-01
241 |    -
242 |       angle: 0.
243 |       scale: 9.0999943017959595e-01
244 |    -
245 |       angle: 0.
246 |       scale: 9.1999942064285278e-01
247 |    -
248 |       angle: 0.
249 |       scale: 9.2999941110610962e-01
250 |    -
251 |       angle: 0.
252 |       scale: 9.3999940156936646e-01
253 |    -
254 |       angle: 0.
255 |       scale: 9.4999939203262329e-01
256 |    -
257 |       angle: 0.
258 |       scale: 9.5999938249588013e-01
259 |    -
260 |       angle: 0.
261 |       scale: 9.6999937295913696e-01
262 |    -
263 |       angle: 0.
264 |       scale: 9.7999936342239380e-01
265 |    -
266 |       angle: 0.
267 |       scale: 9.8999935388565063e-01
268 |    -
269 |       angle: 0.
270 |       scale: 9.9999934434890747e-01
271 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/features/nms_templ.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/features/nms_templ.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/features/no_nms_templ.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/features/no_nms_templ.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/1.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/1.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/2.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/2.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/3.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/result/3.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/templ/circle.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case0/templ/circle.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case1/test.tif:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case1/test.tif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case1/test_info.yaml:
--------------------------------------------------------------------------------
   1 | %YAML:1.0
   2 | ---
   3 | infos:
   4 |    -
   5 |       angle: 0.
   6 |       scale: 1.
   7 |    -
   8 |       angle: 1.
   9 |       scale: 1.
  10 |    -
  11 |       angle: 2.
  12 |       scale: 1.
  13 |    -
  14 |       angle: 3.
  15 |       scale: 1.
  16 |    -
  17 |       angle: 4.
  18 |       scale: 1.
  19 |    -
  20 |       angle: 5.
  21 |       scale: 1.
  22 |    -
  23 |       angle: 6.
  24 |       scale: 1.
  25 |    -
  26 |       angle: 7.
  27 |       scale: 1.
  28 |    -
  29 |       angle: 8.
  30 |       scale: 1.
  31 |    -
  32 |       angle: 9.
  33 |       scale: 1.
  34 |    -
  35 |       angle: 10.
  36 |       scale: 1.
  37 |    -
  38 |       angle: 11.
  39 |       scale: 1.
  40 |    -
  41 |       angle: 12.
  42 |       scale: 1.
  43 |    -
  44 |       angle: 13.
  45 |       scale: 1.
  46 |    -
  47 |       angle: 14.
  48 |       scale: 1.
  49 |    -
  50 |       angle: 15.
  51 |       scale: 1.
  52 |    -
  53 |       angle: 16.
  54 |       scale: 1.
  55 |    -
  56 |       angle: 17.
  57 |       scale: 1.
  58 |    -
  59 |       angle: 18.
  60 |       scale: 1.
  61 |    -
  62 |       angle: 19.
  63 |       scale: 1.
  64 |    -
  65 |       angle: 20.
  66 |       scale: 1.
  67 |    -
  68 |       angle: 21.
  69 |       scale: 1.
  70 |    -
  71 |       angle: 22.
  72 |       scale: 1.
  73 |    -
  74 |       angle: 23.
  75 |       scale: 1.
  76 |    -
  77 |       angle: 24.
  78 |       scale: 1.
  79 |    -
  80 |       angle: 25.
  81 |       scale: 1.
  82 |    -
  83 |       angle: 26.
  84 |       scale: 1.
  85 |    -
  86 |       angle: 27.
  87 |       scale: 1.
  88 |    -
  89 |       angle: 28.
  90 |       scale: 1.
  91 |    -
  92 |       angle: 29.
  93 |       scale: 1.
  94 |    -
  95 |       angle: 30.
  96 |       scale: 1.
  97 |    -
  98 |       angle: 31.
  99 |       scale: 1.
 100 |    -
 101 |       angle: 32.
 102 |       scale: 1.
 103 |    -
 104 |       angle: 33.
 105 |       scale: 1.
 106 |    -
 107 |       angle: 34.
 108 |       scale: 1.
 109 |    -
 110 |       angle: 35.
 111 |       scale: 1.
 112 |    -
 113 |       angle: 36.
 114 |       scale: 1.
 115 |    -
 116 |       angle: 37.
 117 |       scale: 1.
 118 |    -
 119 |       angle: 38.
 120 |       scale: 1.
 121 |    -
 122 |       angle: 39.
 123 |       scale: 1.
 124 |    -
 125 |       angle: 40.
 126 |       scale: 1.
 127 |    -
 128 |       angle: 41.
 129 |       scale: 1.
 130 |    -
 131 |       angle: 42.
 132 |       scale: 1.
 133 |    -
 134 |       angle: 43.
 135 |       scale: 1.
 136 |    -
 137 |       angle: 44.
 138 |       scale: 1.
 139 |    -
 140 |       angle: 45.
 141 |       scale: 1.
 142 |    -
 143 |       angle: 46.
 144 |       scale: 1.
 145 |    -
 146 |       angle: 47.
 147 |       scale: 1.
 148 |    -
 149 |       angle: 48.
 150 |       scale: 1.
 151 |    -
 152 |       angle: 49.
 153 |       scale: 1.
 154 |    -
 155 |       angle: 50.
 156 |       scale: 1.
 157 |    -
 158 |       angle: 51.
 159 |       scale: 1.
 160 |    -
 161 |       angle: 52.
 162 |       scale: 1.
 163 |    -
 164 |       angle: 53.
 165 |       scale: 1.
 166 |    -
 167 |       angle: 54.
 168 |       scale: 1.
 169 |    -
 170 |       angle: 55.
 171 |       scale: 1.
 172 |    -
 173 |       angle: 56.
 174 |       scale: 1.
 175 |    -
 176 |       angle: 57.
 177 |       scale: 1.
 178 |    -
 179 |       angle: 58.
 180 |       scale: 1.
 181 |    -
 182 |       angle: 59.
 183 |       scale: 1.
 184 |    -
 185 |       angle: 60.
 186 |       scale: 1.
 187 |    -
 188 |       angle: 61.
 189 |       scale: 1.
 190 |    -
 191 |       angle: 62.
 192 |       scale: 1.
 193 |    -
 194 |       angle: 63.
 195 |       scale: 1.
 196 |    -
 197 |       angle: 64.
 198 |       scale: 1.
 199 |    -
 200 |       angle: 65.
 201 |       scale: 1.
 202 |    -
 203 |       angle: 66.
 204 |       scale: 1.
 205 |    -
 206 |       angle: 67.
 207 |       scale: 1.
 208 |    -
 209 |       angle: 68.
 210 |       scale: 1.
 211 |    -
 212 |       angle: 69.
 213 |       scale: 1.
 214 |    -
 215 |       angle: 70.
 216 |       scale: 1.
 217 |    -
 218 |       angle: 71.
 219 |       scale: 1.
 220 |    -
 221 |       angle: 72.
 222 |       scale: 1.
 223 |    -
 224 |       angle: 73.
 225 |       scale: 1.
 226 |    -
 227 |       angle: 74.
 228 |       scale: 1.
 229 |    -
 230 |       angle: 75.
 231 |       scale: 1.
 232 |    -
 233 |       angle: 76.
 234 |       scale: 1.
 235 |    -
 236 |       angle: 77.
 237 |       scale: 1.
 238 |    -
 239 |       angle: 78.
 240 |       scale: 1.
 241 |    -
 242 |       angle: 79.
 243 |       scale: 1.
 244 |    -
 245 |       angle: 80.
 246 |       scale: 1.
 247 |    -
 248 |       angle: 81.
 249 |       scale: 1.
 250 |    -
 251 |       angle: 82.
 252 |       scale: 1.
 253 |    -
 254 |       angle: 83.
 255 |       scale: 1.
 256 |    -
 257 |       angle: 84.
 258 |       scale: 1.
 259 |    -
 260 |       angle: 85.
 261 |       scale: 1.
 262 |    -
 263 |       angle: 86.
 264 |       scale: 1.
 265 |    -
 266 |       angle: 87.
 267 |       scale: 1.
 268 |    -
 269 |       angle: 88.
 270 |       scale: 1.
 271 |    -
 272 |       angle: 89.
 273 |       scale: 1.
 274 |    -
 275 |       angle: 90.
 276 |       scale: 1.
 277 |    -
 278 |       angle: 91.
 279 |       scale: 1.
 280 |    -
 281 |       angle: 92.
 282 |       scale: 1.
 283 |    -
 284 |       angle: 93.
 285 |       scale: 1.
 286 |    -
 287 |       angle: 94.
 288 |       scale: 1.
 289 |    -
 290 |       angle: 95.
 291 |       scale: 1.
 292 |    -
 293 |       angle: 96.
 294 |       scale: 1.
 295 |    -
 296 |       angle: 97.
 297 |       scale: 1.
 298 |    -
 299 |       angle: 98.
 300 |       scale: 1.
 301 |    -
 302 |       angle: 99.
 303 |       scale: 1.
 304 |    -
 305 |       angle: 100.
 306 |       scale: 1.
 307 |    -
 308 |       angle: 101.
 309 |       scale: 1.
 310 |    -
 311 |       angle: 102.
 312 |       scale: 1.
 313 |    -
 314 |       angle: 103.
 315 |       scale: 1.
 316 |    -
 317 |       angle: 104.
 318 |       scale: 1.
 319 |    -
 320 |       angle: 105.
 321 |       scale: 1.
 322 |    -
 323 |       angle: 106.
 324 |       scale: 1.
 325 |    -
 326 |       angle: 107.
 327 |       scale: 1.
 328 |    -
 329 |       angle: 108.
 330 |       scale: 1.
 331 |    -
 332 |       angle: 109.
 333 |       scale: 1.
 334 |    -
 335 |       angle: 110.
 336 |       scale: 1.
 337 |    -
 338 |       angle: 111.
 339 |       scale: 1.
 340 |    -
 341 |       angle: 112.
 342 |       scale: 1.
 343 |    -
 344 |       angle: 113.
 345 |       scale: 1.
 346 |    -
 347 |       angle: 114.
 348 |       scale: 1.
 349 |    -
 350 |       angle: 115.
 351 |       scale: 1.
 352 |    -
 353 |       angle: 116.
 354 |       scale: 1.
 355 |    -
 356 |       angle: 117.
 357 |       scale: 1.
 358 |    -
 359 |       angle: 118.
 360 |       scale: 1.
 361 |    -
 362 |       angle: 119.
 363 |       scale: 1.
 364 |    -
 365 |       angle: 120.
 366 |       scale: 1.
 367 |    -
 368 |       angle: 121.
 369 |       scale: 1.
 370 |    -
 371 |       angle: 122.
 372 |       scale: 1.
 373 |    -
 374 |       angle: 123.
 375 |       scale: 1.
 376 |    -
 377 |       angle: 124.
 378 |       scale: 1.
 379 |    -
 380 |       angle: 125.
 381 |       scale: 1.
 382 |    -
 383 |       angle: 126.
 384 |       scale: 1.
 385 |    -
 386 |       angle: 127.
 387 |       scale: 1.
 388 |    -
 389 |       angle: 128.
 390 |       scale: 1.
 391 |    -
 392 |       angle: 129.
 393 |       scale: 1.
 394 |    -
 395 |       angle: 130.
 396 |       scale: 1.
 397 |    -
 398 |       angle: 131.
 399 |       scale: 1.
 400 |    -
 401 |       angle: 132.
 402 |       scale: 1.
 403 |    -
 404 |       angle: 133.
 405 |       scale: 1.
 406 |    -
 407 |       angle: 134.
 408 |       scale: 1.
 409 |    -
 410 |       angle: 135.
 411 |       scale: 1.
 412 |    -
 413 |       angle: 136.
 414 |       scale: 1.
 415 |    -
 416 |       angle: 137.
 417 |       scale: 1.
 418 |    -
 419 |       angle: 138.
 420 |       scale: 1.
 421 |    -
 422 |       angle: 139.
 423 |       scale: 1.
 424 |    -
 425 |       angle: 140.
 426 |       scale: 1.
 427 |    -
 428 |       angle: 141.
 429 |       scale: 1.
 430 |    -
 431 |       angle: 142.
 432 |       scale: 1.
 433 |    -
 434 |       angle: 143.
 435 |       scale: 1.
 436 |    -
 437 |       angle: 144.
 438 |       scale: 1.
 439 |    -
 440 |       angle: 145.
 441 |       scale: 1.
 442 |    -
 443 |       angle: 146.
 444 |       scale: 1.
 445 |    -
 446 |       angle: 147.
 447 |       scale: 1.
 448 |    -
 449 |       angle: 148.
 450 |       scale: 1.
 451 |    -
 452 |       angle: 149.
 453 |       scale: 1.
 454 |    -
 455 |       angle: 150.
 456 |       scale: 1.
 457 |    -
 458 |       angle: 151.
 459 |       scale: 1.
 460 |    -
 461 |       angle: 152.
 462 |       scale: 1.
 463 |    -
 464 |       angle: 153.
 465 |       scale: 1.
 466 |    -
 467 |       angle: 154.
 468 |       scale: 1.
 469 |    -
 470 |       angle: 155.
 471 |       scale: 1.
 472 |    -
 473 |       angle: 156.
 474 |       scale: 1.
 475 |    -
 476 |       angle: 157.
 477 |       scale: 1.
 478 |    -
 479 |       angle: 158.
 480 |       scale: 1.
 481 |    -
 482 |       angle: 159.
 483 |       scale: 1.
 484 |    -
 485 |       angle: 160.
 486 |       scale: 1.
 487 |    -
 488 |       angle: 161.
 489 |       scale: 1.
 490 |    -
 491 |       angle: 162.
 492 |       scale: 1.
 493 |    -
 494 |       angle: 163.
 495 |       scale: 1.
 496 |    -
 497 |       angle: 164.
 498 |       scale: 1.
 499 |    -
 500 |       angle: 165.
 501 |       scale: 1.
 502 |    -
 503 |       angle: 166.
 504 |       scale: 1.
 505 |    -
 506 |       angle: 167.
 507 |       scale: 1.
 508 |    -
 509 |       angle: 168.
 510 |       scale: 1.
 511 |    -
 512 |       angle: 169.
 513 |       scale: 1.
 514 |    -
 515 |       angle: 170.
 516 |       scale: 1.
 517 |    -
 518 |       angle: 171.
 519 |       scale: 1.
 520 |    -
 521 |       angle: 172.
 522 |       scale: 1.
 523 |    -
 524 |       angle: 173.
 525 |       scale: 1.
 526 |    -
 527 |       angle: 174.
 528 |       scale: 1.
 529 |    -
 530 |       angle: 175.
 531 |       scale: 1.
 532 |    -
 533 |       angle: 176.
 534 |       scale: 1.
 535 |    -
 536 |       angle: 177.
 537 |       scale: 1.
 538 |    -
 539 |       angle: 178.
 540 |       scale: 1.
 541 |    -
 542 |       angle: 179.
 543 |       scale: 1.
 544 |    -
 545 |       angle: 180.
 546 |       scale: 1.
 547 |    -
 548 |       angle: 181.
 549 |       scale: 1.
 550 |    -
 551 |       angle: 182.
 552 |       scale: 1.
 553 |    -
 554 |       angle: 183.
 555 |       scale: 1.
 556 |    -
 557 |       angle: 184.
 558 |       scale: 1.
 559 |    -
 560 |       angle: 185.
 561 |       scale: 1.
 562 |    -
 563 |       angle: 186.
 564 |       scale: 1.
 565 |    -
 566 |       angle: 187.
 567 |       scale: 1.
 568 |    -
 569 |       angle: 188.
 570 |       scale: 1.
 571 |    -
 572 |       angle: 189.
 573 |       scale: 1.
 574 |    -
 575 |       angle: 190.
 576 |       scale: 1.
 577 |    -
 578 |       angle: 191.
 579 |       scale: 1.
 580 |    -
 581 |       angle: 192.
 582 |       scale: 1.
 583 |    -
 584 |       angle: 193.
 585 |       scale: 1.
 586 |    -
 587 |       angle: 194.
 588 |       scale: 1.
 589 |    -
 590 |       angle: 195.
 591 |       scale: 1.
 592 |    -
 593 |       angle: 196.
 594 |       scale: 1.
 595 |    -
 596 |       angle: 197.
 597 |       scale: 1.
 598 |    -
 599 |       angle: 198.
 600 |       scale: 1.
 601 |    -
 602 |       angle: 199.
 603 |       scale: 1.
 604 |    -
 605 |       angle: 200.
 606 |       scale: 1.
 607 |    -
 608 |       angle: 201.
 609 |       scale: 1.
 610 |    -
 611 |       angle: 202.
 612 |       scale: 1.
 613 |    -
 614 |       angle: 203.
 615 |       scale: 1.
 616 |    -
 617 |       angle: 204.
 618 |       scale: 1.
 619 |    -
 620 |       angle: 205.
 621 |       scale: 1.
 622 |    -
 623 |       angle: 206.
 624 |       scale: 1.
 625 |    -
 626 |       angle: 207.
 627 |       scale: 1.
 628 |    -
 629 |       angle: 208.
 630 |       scale: 1.
 631 |    -
 632 |       angle: 209.
 633 |       scale: 1.
 634 |    -
 635 |       angle: 210.
 636 |       scale: 1.
 637 |    -
 638 |       angle: 211.
 639 |       scale: 1.
 640 |    -
 641 |       angle: 212.
 642 |       scale: 1.
 643 |    -
 644 |       angle: 213.
 645 |       scale: 1.
 646 |    -
 647 |       angle: 214.
 648 |       scale: 1.
 649 |    -
 650 |       angle: 215.
 651 |       scale: 1.
 652 |    -
 653 |       angle: 216.
 654 |       scale: 1.
 655 |    -
 656 |       angle: 217.
 657 |       scale: 1.
 658 |    -
 659 |       angle: 218.
 660 |       scale: 1.
 661 |    -
 662 |       angle: 219.
 663 |       scale: 1.
 664 |    -
 665 |       angle: 220.
 666 |       scale: 1.
 667 |    -
 668 |       angle: 221.
 669 |       scale: 1.
 670 |    -
 671 |       angle: 222.
 672 |       scale: 1.
 673 |    -
 674 |       angle: 223.
 675 |       scale: 1.
 676 |    -
 677 |       angle: 224.
 678 |       scale: 1.
 679 |    -
 680 |       angle: 225.
 681 |       scale: 1.
 682 |    -
 683 |       angle: 226.
 684 |       scale: 1.
 685 |    -
 686 |       angle: 227.
 687 |       scale: 1.
 688 |    -
 689 |       angle: 228.
 690 |       scale: 1.
 691 |    -
 692 |       angle: 229.
 693 |       scale: 1.
 694 |    -
 695 |       angle: 230.
 696 |       scale: 1.
 697 |    -
 698 |       angle: 231.
 699 |       scale: 1.
 700 |    -
 701 |       angle: 232.
 702 |       scale: 1.
 703 |    -
 704 |       angle: 233.
 705 |       scale: 1.
 706 |    -
 707 |       angle: 234.
 708 |       scale: 1.
 709 |    -
 710 |       angle: 235.
 711 |       scale: 1.
 712 |    -
 713 |       angle: 236.
 714 |       scale: 1.
 715 |    -
 716 |       angle: 237.
 717 |       scale: 1.
 718 |    -
 719 |       angle: 238.
 720 |       scale: 1.
 721 |    -
 722 |       angle: 239.
 723 |       scale: 1.
 724 |    -
 725 |       angle: 240.
 726 |       scale: 1.
 727 |    -
 728 |       angle: 241.
 729 |       scale: 1.
 730 |    -
 731 |       angle: 242.
 732 |       scale: 1.
 733 |    -
 734 |       angle: 243.
 735 |       scale: 1.
 736 |    -
 737 |       angle: 244.
 738 |       scale: 1.
 739 |    -
 740 |       angle: 245.
 741 |       scale: 1.
 742 |    -
 743 |       angle: 246.
 744 |       scale: 1.
 745 |    -
 746 |       angle: 247.
 747 |       scale: 1.
 748 |    -
 749 |       angle: 248.
 750 |       scale: 1.
 751 |    -
 752 |       angle: 249.
 753 |       scale: 1.
 754 |    -
 755 |       angle: 250.
 756 |       scale: 1.
 757 |    -
 758 |       angle: 251.
 759 |       scale: 1.
 760 |    -
 761 |       angle: 252.
 762 |       scale: 1.
 763 |    -
 764 |       angle: 253.
 765 |       scale: 1.
 766 |    -
 767 |       angle: 254.
 768 |       scale: 1.
 769 |    -
 770 |       angle: 255.
 771 |       scale: 1.
 772 |    -
 773 |       angle: 256.
 774 |       scale: 1.
 775 |    -
 776 |       angle: 257.
 777 |       scale: 1.
 778 |    -
 779 |       angle: 258.
 780 |       scale: 1.
 781 |    -
 782 |       angle: 259.
 783 |       scale: 1.
 784 |    -
 785 |       angle: 260.
 786 |       scale: 1.
 787 |    -
 788 |       angle: 261.
 789 |       scale: 1.
 790 |    -
 791 |       angle: 262.
 792 |       scale: 1.
 793 |    -
 794 |       angle: 263.
 795 |       scale: 1.
 796 |    -
 797 |       angle: 264.
 798 |       scale: 1.
 799 |    -
 800 |       angle: 265.
 801 |       scale: 1.
 802 |    -
 803 |       angle: 266.
 804 |       scale: 1.
 805 |    -
 806 |       angle: 267.
 807 |       scale: 1.
 808 |    -
 809 |       angle: 268.
 810 |       scale: 1.
 811 |    -
 812 |       angle: 269.
 813 |       scale: 1.
 814 |    -
 815 |       angle: 270.
 816 |       scale: 1.
 817 |    -
 818 |       angle: 271.
 819 |       scale: 1.
 820 |    -
 821 |       angle: 272.
 822 |       scale: 1.
 823 |    -
 824 |       angle: 273.
 825 |       scale: 1.
 826 |    -
 827 |       angle: 274.
 828 |       scale: 1.
 829 |    -
 830 |       angle: 275.
 831 |       scale: 1.
 832 |    -
 833 |       angle: 276.
 834 |       scale: 1.
 835 |    -
 836 |       angle: 277.
 837 |       scale: 1.
 838 |    -
 839 |       angle: 278.
 840 |       scale: 1.
 841 |    -
 842 |       angle: 279.
 843 |       scale: 1.
 844 |    -
 845 |       angle: 280.
 846 |       scale: 1.
 847 |    -
 848 |       angle: 281.
 849 |       scale: 1.
 850 |    -
 851 |       angle: 282.
 852 |       scale: 1.
 853 |    -
 854 |       angle: 283.
 855 |       scale: 1.
 856 |    -
 857 |       angle: 284.
 858 |       scale: 1.
 859 |    -
 860 |       angle: 285.
 861 |       scale: 1.
 862 |    -
 863 |       angle: 286.
 864 |       scale: 1.
 865 |    -
 866 |       angle: 287.
 867 |       scale: 1.
 868 |    -
 869 |       angle: 288.
 870 |       scale: 1.
 871 |    -
 872 |       angle: 289.
 873 |       scale: 1.
 874 |    -
 875 |       angle: 290.
 876 |       scale: 1.
 877 |    -
 878 |       angle: 291.
 879 |       scale: 1.
 880 |    -
 881 |       angle: 292.
 882 |       scale: 1.
 883 |    -
 884 |       angle: 293.
 885 |       scale: 1.
 886 |    -
 887 |       angle: 294.
 888 |       scale: 1.
 889 |    -
 890 |       angle: 295.
 891 |       scale: 1.
 892 |    -
 893 |       angle: 296.
 894 |       scale: 1.
 895 |    -
 896 |       angle: 297.
 897 |       scale: 1.
 898 |    -
 899 |       angle: 298.
 900 |       scale: 1.
 901 |    -
 902 |       angle: 299.
 903 |       scale: 1.
 904 |    -
 905 |       angle: 300.
 906 |       scale: 1.
 907 |    -
 908 |       angle: 301.
 909 |       scale: 1.
 910 |    -
 911 |       angle: 302.
 912 |       scale: 1.
 913 |    -
 914 |       angle: 303.
 915 |       scale: 1.
 916 |    -
 917 |       angle: 304.
 918 |       scale: 1.
 919 |    -
 920 |       angle: 305.
 921 |       scale: 1.
 922 |    -
 923 |       angle: 306.
 924 |       scale: 1.
 925 |    -
 926 |       angle: 307.
 927 |       scale: 1.
 928 |    -
 929 |       angle: 308.
 930 |       scale: 1.
 931 |    -
 932 |       angle: 309.
 933 |       scale: 1.
 934 |    -
 935 |       angle: 310.
 936 |       scale: 1.
 937 |    -
 938 |       angle: 311.
 939 |       scale: 1.
 940 |    -
 941 |       angle: 312.
 942 |       scale: 1.
 943 |    -
 944 |       angle: 313.
 945 |       scale: 1.
 946 |    -
 947 |       angle: 314.
 948 |       scale: 1.
 949 |    -
 950 |       angle: 315.
 951 |       scale: 1.
 952 |    -
 953 |       angle: 316.
 954 |       scale: 1.
 955 |    -
 956 |       angle: 317.
 957 |       scale: 1.
 958 |    -
 959 |       angle: 318.
 960 |       scale: 1.
 961 |    -
 962 |       angle: 319.
 963 |       scale: 1.
 964 |    -
 965 |       angle: 320.
 966 |       scale: 1.
 967 |    -
 968 |       angle: 321.
 969 |       scale: 1.
 970 |    -
 971 |       angle: 322.
 972 |       scale: 1.
 973 |    -
 974 |       angle: 323.
 975 |       scale: 1.
 976 |    -
 977 |       angle: 324.
 978 |       scale: 1.
 979 |    -
 980 |       angle: 325.
 981 |       scale: 1.
 982 |    -
 983 |       angle: 326.
 984 |       scale: 1.
 985 |    -
 986 |       angle: 327.
 987 |       scale: 1.
 988 |    -
 989 |       angle: 328.
 990 |       scale: 1.
 991 |    -
 992 |       angle: 329.
 993 |       scale: 1.
 994 |    -
 995 |       angle: 330.
 996 |       scale: 1.
 997 |    -
 998 |       angle: 331.
 999 |       scale: 1.
1000 |    -
1001 |       angle: 332.
1002 |       scale: 1.
1003 |    -
1004 |       angle: 333.
1005 |       scale: 1.
1006 |    -
1007 |       angle: 334.
1008 |       scale: 1.
1009 |    -
1010 |       angle: 335.
1011 |       scale: 1.
1012 |    -
1013 |       angle: 336.
1014 |       scale: 1.
1015 |    -
1016 |       angle: 337.
1017 |       scale: 1.
1018 |    -
1019 |       angle: 338.
1020 |       scale: 1.
1021 |    -
1022 |       angle: 339.
1023 |       scale: 1.
1024 |    -
1025 |       angle: 340.
1026 |       scale: 1.
1027 |    -
1028 |       angle: 341.
1029 |       scale: 1.
1030 |    -
1031 |       angle: 342.
1032 |       scale: 1.
1033 |    -
1034 |       angle: 343.
1035 |       scale: 1.
1036 |    -
1037 |       angle: 344.
1038 |       scale: 1.
1039 |    -
1040 |       angle: 345.
1041 |       scale: 1.
1042 |    -
1043 |       angle: 346.
1044 |       scale: 1.
1045 |    -
1046 |       angle: 347.
1047 |       scale: 1.
1048 |    -
1049 |       angle: 348.
1050 |       scale: 1.
1051 |    -
1052 |       angle: 349.
1053 |       scale: 1.
1054 |    -
1055 |       angle: 350.
1056 |       scale: 1.
1057 |    -
1058 |       angle: 351.
1059 |       scale: 1.
1060 |    -
1061 |       angle: 352.
1062 |       scale: 1.
1063 |    -
1064 |       angle: 353.
1065 |       scale: 1.
1066 |    -
1067 |       angle: 354.
1068 |       scale: 1.
1069 |    -
1070 |       angle: 355.
1071 |       scale: 1.
1072 |    -
1073 |       angle: 356.
1074 |       scale: 1.
1075 |    -
1076 |       angle: 357.
1077 |       scale: 1.
1078 |    -
1079 |       angle: 358.
1080 |       scale: 1.
1081 |    -
1082 |       angle: 359.
1083 |       scale: 1.
1084 |    -
1085 |       angle: 360.
1086 |       scale: 1.
1087 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case1/train.tif:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case1/train.tif


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/result.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/result.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/templ.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/templ.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/together.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/result/together.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/test.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/test.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/test_info.yaml:
--------------------------------------------------------------------------------
   1 | %YAML:1.0
   2 | ---
   3 | infos:
   4 |    -
   5 |       angle: 0.
   6 |       scale: 1.
   7 |    -
   8 |       angle: 1.
   9 |       scale: 1.
  10 |    -
  11 |       angle: 2.
  12 |       scale: 1.
  13 |    -
  14 |       angle: 3.
  15 |       scale: 1.
  16 |    -
  17 |       angle: 4.
  18 |       scale: 1.
  19 |    -
  20 |       angle: 5.
  21 |       scale: 1.
  22 |    -
  23 |       angle: 6.
  24 |       scale: 1.
  25 |    -
  26 |       angle: 7.
  27 |       scale: 1.
  28 |    -
  29 |       angle: 8.
  30 |       scale: 1.
  31 |    -
  32 |       angle: 9.
  33 |       scale: 1.
  34 |    -
  35 |       angle: 10.
  36 |       scale: 1.
  37 |    -
  38 |       angle: 11.
  39 |       scale: 1.
  40 |    -
  41 |       angle: 12.
  42 |       scale: 1.
  43 |    -
  44 |       angle: 13.
  45 |       scale: 1.
  46 |    -
  47 |       angle: 14.
  48 |       scale: 1.
  49 |    -
  50 |       angle: 15.
  51 |       scale: 1.
  52 |    -
  53 |       angle: 16.
  54 |       scale: 1.
  55 |    -
  56 |       angle: 17.
  57 |       scale: 1.
  58 |    -
  59 |       angle: 18.
  60 |       scale: 1.
  61 |    -
  62 |       angle: 19.
  63 |       scale: 1.
  64 |    -
  65 |       angle: 20.
  66 |       scale: 1.
  67 |    -
  68 |       angle: 21.
  69 |       scale: 1.
  70 |    -
  71 |       angle: 22.
  72 |       scale: 1.
  73 |    -
  74 |       angle: 23.
  75 |       scale: 1.
  76 |    -
  77 |       angle: 24.
  78 |       scale: 1.
  79 |    -
  80 |       angle: 25.
  81 |       scale: 1.
  82 |    -
  83 |       angle: 26.
  84 |       scale: 1.
  85 |    -
  86 |       angle: 27.
  87 |       scale: 1.
  88 |    -
  89 |       angle: 28.
  90 |       scale: 1.
  91 |    -
  92 |       angle: 29.
  93 |       scale: 1.
  94 |    -
  95 |       angle: 30.
  96 |       scale: 1.
  97 |    -
  98 |       angle: 31.
  99 |       scale: 1.
 100 |    -
 101 |       angle: 32.
 102 |       scale: 1.
 103 |    -
 104 |       angle: 33.
 105 |       scale: 1.
 106 |    -
 107 |       angle: 34.
 108 |       scale: 1.
 109 |    -
 110 |       angle: 35.
 111 |       scale: 1.
 112 |    -
 113 |       angle: 36.
 114 |       scale: 1.
 115 |    -
 116 |       angle: 37.
 117 |       scale: 1.
 118 |    -
 119 |       angle: 38.
 120 |       scale: 1.
 121 |    -
 122 |       angle: 39.
 123 |       scale: 1.
 124 |    -
 125 |       angle: 40.
 126 |       scale: 1.
 127 |    -
 128 |       angle: 41.
 129 |       scale: 1.
 130 |    -
 131 |       angle: 42.
 132 |       scale: 1.
 133 |    -
 134 |       angle: 43.
 135 |       scale: 1.
 136 |    -
 137 |       angle: 44.
 138 |       scale: 1.
 139 |    -
 140 |       angle: 45.
 141 |       scale: 1.
 142 |    -
 143 |       angle: 46.
 144 |       scale: 1.
 145 |    -
 146 |       angle: 47.
 147 |       scale: 1.
 148 |    -
 149 |       angle: 48.
 150 |       scale: 1.
 151 |    -
 152 |       angle: 49.
 153 |       scale: 1.
 154 |    -
 155 |       angle: 50.
 156 |       scale: 1.
 157 |    -
 158 |       angle: 51.
 159 |       scale: 1.
 160 |    -
 161 |       angle: 52.
 162 |       scale: 1.
 163 |    -
 164 |       angle: 53.
 165 |       scale: 1.
 166 |    -
 167 |       angle: 54.
 168 |       scale: 1.
 169 |    -
 170 |       angle: 55.
 171 |       scale: 1.
 172 |    -
 173 |       angle: 56.
 174 |       scale: 1.
 175 |    -
 176 |       angle: 57.
 177 |       scale: 1.
 178 |    -
 179 |       angle: 58.
 180 |       scale: 1.
 181 |    -
 182 |       angle: 59.
 183 |       scale: 1.
 184 |    -
 185 |       angle: 60.
 186 |       scale: 1.
 187 |    -
 188 |       angle: 61.
 189 |       scale: 1.
 190 |    -
 191 |       angle: 62.
 192 |       scale: 1.
 193 |    -
 194 |       angle: 63.
 195 |       scale: 1.
 196 |    -
 197 |       angle: 64.
 198 |       scale: 1.
 199 |    -
 200 |       angle: 65.
 201 |       scale: 1.
 202 |    -
 203 |       angle: 66.
 204 |       scale: 1.
 205 |    -
 206 |       angle: 67.
 207 |       scale: 1.
 208 |    -
 209 |       angle: 68.
 210 |       scale: 1.
 211 |    -
 212 |       angle: 69.
 213 |       scale: 1.
 214 |    -
 215 |       angle: 70.
 216 |       scale: 1.
 217 |    -
 218 |       angle: 71.
 219 |       scale: 1.
 220 |    -
 221 |       angle: 72.
 222 |       scale: 1.
 223 |    -
 224 |       angle: 73.
 225 |       scale: 1.
 226 |    -
 227 |       angle: 74.
 228 |       scale: 1.
 229 |    -
 230 |       angle: 75.
 231 |       scale: 1.
 232 |    -
 233 |       angle: 76.
 234 |       scale: 1.
 235 |    -
 236 |       angle: 77.
 237 |       scale: 1.
 238 |    -
 239 |       angle: 78.
 240 |       scale: 1.
 241 |    -
 242 |       angle: 79.
 243 |       scale: 1.
 244 |    -
 245 |       angle: 80.
 246 |       scale: 1.
 247 |    -
 248 |       angle: 81.
 249 |       scale: 1.
 250 |    -
 251 |       angle: 82.
 252 |       scale: 1.
 253 |    -
 254 |       angle: 83.
 255 |       scale: 1.
 256 |    -
 257 |       angle: 84.
 258 |       scale: 1.
 259 |    -
 260 |       angle: 85.
 261 |       scale: 1.
 262 |    -
 263 |       angle: 86.
 264 |       scale: 1.
 265 |    -
 266 |       angle: 87.
 267 |       scale: 1.
 268 |    -
 269 |       angle: 88.
 270 |       scale: 1.
 271 |    -
 272 |       angle: 89.
 273 |       scale: 1.
 274 |    -
 275 |       angle: 90.
 276 |       scale: 1.
 277 |    -
 278 |       angle: 91.
 279 |       scale: 1.
 280 |    -
 281 |       angle: 92.
 282 |       scale: 1.
 283 |    -
 284 |       angle: 93.
 285 |       scale: 1.
 286 |    -
 287 |       angle: 94.
 288 |       scale: 1.
 289 |    -
 290 |       angle: 95.
 291 |       scale: 1.
 292 |    -
 293 |       angle: 96.
 294 |       scale: 1.
 295 |    -
 296 |       angle: 97.
 297 |       scale: 1.
 298 |    -
 299 |       angle: 98.
 300 |       scale: 1.
 301 |    -
 302 |       angle: 99.
 303 |       scale: 1.
 304 |    -
 305 |       angle: 100.
 306 |       scale: 1.
 307 |    -
 308 |       angle: 101.
 309 |       scale: 1.
 310 |    -
 311 |       angle: 102.
 312 |       scale: 1.
 313 |    -
 314 |       angle: 103.
 315 |       scale: 1.
 316 |    -
 317 |       angle: 104.
 318 |       scale: 1.
 319 |    -
 320 |       angle: 105.
 321 |       scale: 1.
 322 |    -
 323 |       angle: 106.
 324 |       scale: 1.
 325 |    -
 326 |       angle: 107.
 327 |       scale: 1.
 328 |    -
 329 |       angle: 108.
 330 |       scale: 1.
 331 |    -
 332 |       angle: 109.
 333 |       scale: 1.
 334 |    -
 335 |       angle: 110.
 336 |       scale: 1.
 337 |    -
 338 |       angle: 111.
 339 |       scale: 1.
 340 |    -
 341 |       angle: 112.
 342 |       scale: 1.
 343 |    -
 344 |       angle: 113.
 345 |       scale: 1.
 346 |    -
 347 |       angle: 114.
 348 |       scale: 1.
 349 |    -
 350 |       angle: 115.
 351 |       scale: 1.
 352 |    -
 353 |       angle: 116.
 354 |       scale: 1.
 355 |    -
 356 |       angle: 117.
 357 |       scale: 1.
 358 |    -
 359 |       angle: 118.
 360 |       scale: 1.
 361 |    -
 362 |       angle: 119.
 363 |       scale: 1.
 364 |    -
 365 |       angle: 120.
 366 |       scale: 1.
 367 |    -
 368 |       angle: 121.
 369 |       scale: 1.
 370 |    -
 371 |       angle: 122.
 372 |       scale: 1.
 373 |    -
 374 |       angle: 123.
 375 |       scale: 1.
 376 |    -
 377 |       angle: 124.
 378 |       scale: 1.
 379 |    -
 380 |       angle: 125.
 381 |       scale: 1.
 382 |    -
 383 |       angle: 126.
 384 |       scale: 1.
 385 |    -
 386 |       angle: 127.
 387 |       scale: 1.
 388 |    -
 389 |       angle: 128.
 390 |       scale: 1.
 391 |    -
 392 |       angle: 129.
 393 |       scale: 1.
 394 |    -
 395 |       angle: 130.
 396 |       scale: 1.
 397 |    -
 398 |       angle: 131.
 399 |       scale: 1.
 400 |    -
 401 |       angle: 132.
 402 |       scale: 1.
 403 |    -
 404 |       angle: 133.
 405 |       scale: 1.
 406 |    -
 407 |       angle: 134.
 408 |       scale: 1.
 409 |    -
 410 |       angle: 135.
 411 |       scale: 1.
 412 |    -
 413 |       angle: 136.
 414 |       scale: 1.
 415 |    -
 416 |       angle: 137.
 417 |       scale: 1.
 418 |    -
 419 |       angle: 138.
 420 |       scale: 1.
 421 |    -
 422 |       angle: 139.
 423 |       scale: 1.
 424 |    -
 425 |       angle: 140.
 426 |       scale: 1.
 427 |    -
 428 |       angle: 141.
 429 |       scale: 1.
 430 |    -
 431 |       angle: 142.
 432 |       scale: 1.
 433 |    -
 434 |       angle: 143.
 435 |       scale: 1.
 436 |    -
 437 |       angle: 144.
 438 |       scale: 1.
 439 |    -
 440 |       angle: 145.
 441 |       scale: 1.
 442 |    -
 443 |       angle: 146.
 444 |       scale: 1.
 445 |    -
 446 |       angle: 147.
 447 |       scale: 1.
 448 |    -
 449 |       angle: 148.
 450 |       scale: 1.
 451 |    -
 452 |       angle: 149.
 453 |       scale: 1.
 454 |    -
 455 |       angle: 150.
 456 |       scale: 1.
 457 |    -
 458 |       angle: 151.
 459 |       scale: 1.
 460 |    -
 461 |       angle: 152.
 462 |       scale: 1.
 463 |    -
 464 |       angle: 153.
 465 |       scale: 1.
 466 |    -
 467 |       angle: 154.
 468 |       scale: 1.
 469 |    -
 470 |       angle: 155.
 471 |       scale: 1.
 472 |    -
 473 |       angle: 156.
 474 |       scale: 1.
 475 |    -
 476 |       angle: 157.
 477 |       scale: 1.
 478 |    -
 479 |       angle: 158.
 480 |       scale: 1.
 481 |    -
 482 |       angle: 159.
 483 |       scale: 1.
 484 |    -
 485 |       angle: 160.
 486 |       scale: 1.
 487 |    -
 488 |       angle: 161.
 489 |       scale: 1.
 490 |    -
 491 |       angle: 162.
 492 |       scale: 1.
 493 |    -
 494 |       angle: 163.
 495 |       scale: 1.
 496 |    -
 497 |       angle: 164.
 498 |       scale: 1.
 499 |    -
 500 |       angle: 165.
 501 |       scale: 1.
 502 |    -
 503 |       angle: 166.
 504 |       scale: 1.
 505 |    -
 506 |       angle: 167.
 507 |       scale: 1.
 508 |    -
 509 |       angle: 168.
 510 |       scale: 1.
 511 |    -
 512 |       angle: 169.
 513 |       scale: 1.
 514 |    -
 515 |       angle: 170.
 516 |       scale: 1.
 517 |    -
 518 |       angle: 171.
 519 |       scale: 1.
 520 |    -
 521 |       angle: 172.
 522 |       scale: 1.
 523 |    -
 524 |       angle: 173.
 525 |       scale: 1.
 526 |    -
 527 |       angle: 174.
 528 |       scale: 1.
 529 |    -
 530 |       angle: 175.
 531 |       scale: 1.
 532 |    -
 533 |       angle: 176.
 534 |       scale: 1.
 535 |    -
 536 |       angle: 177.
 537 |       scale: 1.
 538 |    -
 539 |       angle: 178.
 540 |       scale: 1.
 541 |    -
 542 |       angle: 179.
 543 |       scale: 1.
 544 |    -
 545 |       angle: 180.
 546 |       scale: 1.
 547 |    -
 548 |       angle: 181.
 549 |       scale: 1.
 550 |    -
 551 |       angle: 182.
 552 |       scale: 1.
 553 |    -
 554 |       angle: 183.
 555 |       scale: 1.
 556 |    -
 557 |       angle: 184.
 558 |       scale: 1.
 559 |    -
 560 |       angle: 185.
 561 |       scale: 1.
 562 |    -
 563 |       angle: 186.
 564 |       scale: 1.
 565 |    -
 566 |       angle: 187.
 567 |       scale: 1.
 568 |    -
 569 |       angle: 188.
 570 |       scale: 1.
 571 |    -
 572 |       angle: 189.
 573 |       scale: 1.
 574 |    -
 575 |       angle: 190.
 576 |       scale: 1.
 577 |    -
 578 |       angle: 191.
 579 |       scale: 1.
 580 |    -
 581 |       angle: 192.
 582 |       scale: 1.
 583 |    -
 584 |       angle: 193.
 585 |       scale: 1.
 586 |    -
 587 |       angle: 194.
 588 |       scale: 1.
 589 |    -
 590 |       angle: 195.
 591 |       scale: 1.
 592 |    -
 593 |       angle: 196.
 594 |       scale: 1.
 595 |    -
 596 |       angle: 197.
 597 |       scale: 1.
 598 |    -
 599 |       angle: 198.
 600 |       scale: 1.
 601 |    -
 602 |       angle: 199.
 603 |       scale: 1.
 604 |    -
 605 |       angle: 200.
 606 |       scale: 1.
 607 |    -
 608 |       angle: 201.
 609 |       scale: 1.
 610 |    -
 611 |       angle: 202.
 612 |       scale: 1.
 613 |    -
 614 |       angle: 203.
 615 |       scale: 1.
 616 |    -
 617 |       angle: 204.
 618 |       scale: 1.
 619 |    -
 620 |       angle: 205.
 621 |       scale: 1.
 622 |    -
 623 |       angle: 206.
 624 |       scale: 1.
 625 |    -
 626 |       angle: 207.
 627 |       scale: 1.
 628 |    -
 629 |       angle: 208.
 630 |       scale: 1.
 631 |    -
 632 |       angle: 209.
 633 |       scale: 1.
 634 |    -
 635 |       angle: 210.
 636 |       scale: 1.
 637 |    -
 638 |       angle: 211.
 639 |       scale: 1.
 640 |    -
 641 |       angle: 212.
 642 |       scale: 1.
 643 |    -
 644 |       angle: 213.
 645 |       scale: 1.
 646 |    -
 647 |       angle: 214.
 648 |       scale: 1.
 649 |    -
 650 |       angle: 215.
 651 |       scale: 1.
 652 |    -
 653 |       angle: 216.
 654 |       scale: 1.
 655 |    -
 656 |       angle: 217.
 657 |       scale: 1.
 658 |    -
 659 |       angle: 218.
 660 |       scale: 1.
 661 |    -
 662 |       angle: 219.
 663 |       scale: 1.
 664 |    -
 665 |       angle: 220.
 666 |       scale: 1.
 667 |    -
 668 |       angle: 221.
 669 |       scale: 1.
 670 |    -
 671 |       angle: 222.
 672 |       scale: 1.
 673 |    -
 674 |       angle: 223.
 675 |       scale: 1.
 676 |    -
 677 |       angle: 224.
 678 |       scale: 1.
 679 |    -
 680 |       angle: 225.
 681 |       scale: 1.
 682 |    -
 683 |       angle: 226.
 684 |       scale: 1.
 685 |    -
 686 |       angle: 227.
 687 |       scale: 1.
 688 |    -
 689 |       angle: 228.
 690 |       scale: 1.
 691 |    -
 692 |       angle: 229.
 693 |       scale: 1.
 694 |    -
 695 |       angle: 230.
 696 |       scale: 1.
 697 |    -
 698 |       angle: 231.
 699 |       scale: 1.
 700 |    -
 701 |       angle: 232.
 702 |       scale: 1.
 703 |    -
 704 |       angle: 233.
 705 |       scale: 1.
 706 |    -
 707 |       angle: 234.
 708 |       scale: 1.
 709 |    -
 710 |       angle: 235.
 711 |       scale: 1.
 712 |    -
 713 |       angle: 236.
 714 |       scale: 1.
 715 |    -
 716 |       angle: 237.
 717 |       scale: 1.
 718 |    -
 719 |       angle: 238.
 720 |       scale: 1.
 721 |    -
 722 |       angle: 239.
 723 |       scale: 1.
 724 |    -
 725 |       angle: 240.
 726 |       scale: 1.
 727 |    -
 728 |       angle: 241.
 729 |       scale: 1.
 730 |    -
 731 |       angle: 242.
 732 |       scale: 1.
 733 |    -
 734 |       angle: 243.
 735 |       scale: 1.
 736 |    -
 737 |       angle: 244.
 738 |       scale: 1.
 739 |    -
 740 |       angle: 245.
 741 |       scale: 1.
 742 |    -
 743 |       angle: 246.
 744 |       scale: 1.
 745 |    -
 746 |       angle: 247.
 747 |       scale: 1.
 748 |    -
 749 |       angle: 248.
 750 |       scale: 1.
 751 |    -
 752 |       angle: 249.
 753 |       scale: 1.
 754 |    -
 755 |       angle: 250.
 756 |       scale: 1.
 757 |    -
 758 |       angle: 251.
 759 |       scale: 1.
 760 |    -
 761 |       angle: 252.
 762 |       scale: 1.
 763 |    -
 764 |       angle: 253.
 765 |       scale: 1.
 766 |    -
 767 |       angle: 254.
 768 |       scale: 1.
 769 |    -
 770 |       angle: 255.
 771 |       scale: 1.
 772 |    -
 773 |       angle: 256.
 774 |       scale: 1.
 775 |    -
 776 |       angle: 257.
 777 |       scale: 1.
 778 |    -
 779 |       angle: 258.
 780 |       scale: 1.
 781 |    -
 782 |       angle: 259.
 783 |       scale: 1.
 784 |    -
 785 |       angle: 260.
 786 |       scale: 1.
 787 |    -
 788 |       angle: 261.
 789 |       scale: 1.
 790 |    -
 791 |       angle: 262.
 792 |       scale: 1.
 793 |    -
 794 |       angle: 263.
 795 |       scale: 1.
 796 |    -
 797 |       angle: 264.
 798 |       scale: 1.
 799 |    -
 800 |       angle: 265.
 801 |       scale: 1.
 802 |    -
 803 |       angle: 266.
 804 |       scale: 1.
 805 |    -
 806 |       angle: 267.
 807 |       scale: 1.
 808 |    -
 809 |       angle: 268.
 810 |       scale: 1.
 811 |    -
 812 |       angle: 269.
 813 |       scale: 1.
 814 |    -
 815 |       angle: 270.
 816 |       scale: 1.
 817 |    -
 818 |       angle: 271.
 819 |       scale: 1.
 820 |    -
 821 |       angle: 272.
 822 |       scale: 1.
 823 |    -
 824 |       angle: 273.
 825 |       scale: 1.
 826 |    -
 827 |       angle: 274.
 828 |       scale: 1.
 829 |    -
 830 |       angle: 275.
 831 |       scale: 1.
 832 |    -
 833 |       angle: 276.
 834 |       scale: 1.
 835 |    -
 836 |       angle: 277.
 837 |       scale: 1.
 838 |    -
 839 |       angle: 278.
 840 |       scale: 1.
 841 |    -
 842 |       angle: 279.
 843 |       scale: 1.
 844 |    -
 845 |       angle: 280.
 846 |       scale: 1.
 847 |    -
 848 |       angle: 281.
 849 |       scale: 1.
 850 |    -
 851 |       angle: 282.
 852 |       scale: 1.
 853 |    -
 854 |       angle: 283.
 855 |       scale: 1.
 856 |    -
 857 |       angle: 284.
 858 |       scale: 1.
 859 |    -
 860 |       angle: 285.
 861 |       scale: 1.
 862 |    -
 863 |       angle: 286.
 864 |       scale: 1.
 865 |    -
 866 |       angle: 287.
 867 |       scale: 1.
 868 |    -
 869 |       angle: 288.
 870 |       scale: 1.
 871 |    -
 872 |       angle: 289.
 873 |       scale: 1.
 874 |    -
 875 |       angle: 290.
 876 |       scale: 1.
 877 |    -
 878 |       angle: 291.
 879 |       scale: 1.
 880 |    -
 881 |       angle: 292.
 882 |       scale: 1.
 883 |    -
 884 |       angle: 293.
 885 |       scale: 1.
 886 |    -
 887 |       angle: 294.
 888 |       scale: 1.
 889 |    -
 890 |       angle: 295.
 891 |       scale: 1.
 892 |    -
 893 |       angle: 296.
 894 |       scale: 1.
 895 |    -
 896 |       angle: 297.
 897 |       scale: 1.
 898 |    -
 899 |       angle: 298.
 900 |       scale: 1.
 901 |    -
 902 |       angle: 299.
 903 |       scale: 1.
 904 |    -
 905 |       angle: 300.
 906 |       scale: 1.
 907 |    -
 908 |       angle: 301.
 909 |       scale: 1.
 910 |    -
 911 |       angle: 302.
 912 |       scale: 1.
 913 |    -
 914 |       angle: 303.
 915 |       scale: 1.
 916 |    -
 917 |       angle: 304.
 918 |       scale: 1.
 919 |    -
 920 |       angle: 305.
 921 |       scale: 1.
 922 |    -
 923 |       angle: 306.
 924 |       scale: 1.
 925 |    -
 926 |       angle: 307.
 927 |       scale: 1.
 928 |    -
 929 |       angle: 308.
 930 |       scale: 1.
 931 |    -
 932 |       angle: 309.
 933 |       scale: 1.
 934 |    -
 935 |       angle: 310.
 936 |       scale: 1.
 937 |    -
 938 |       angle: 311.
 939 |       scale: 1.
 940 |    -
 941 |       angle: 312.
 942 |       scale: 1.
 943 |    -
 944 |       angle: 313.
 945 |       scale: 1.
 946 |    -
 947 |       angle: 314.
 948 |       scale: 1.
 949 |    -
 950 |       angle: 315.
 951 |       scale: 1.
 952 |    -
 953 |       angle: 316.
 954 |       scale: 1.
 955 |    -
 956 |       angle: 317.
 957 |       scale: 1.
 958 |    -
 959 |       angle: 318.
 960 |       scale: 1.
 961 |    -
 962 |       angle: 319.
 963 |       scale: 1.
 964 |    -
 965 |       angle: 320.
 966 |       scale: 1.
 967 |    -
 968 |       angle: 321.
 969 |       scale: 1.
 970 |    -
 971 |       angle: 322.
 972 |       scale: 1.
 973 |    -
 974 |       angle: 323.
 975 |       scale: 1.
 976 |    -
 977 |       angle: 324.
 978 |       scale: 1.
 979 |    -
 980 |       angle: 325.
 981 |       scale: 1.
 982 |    -
 983 |       angle: 326.
 984 |       scale: 1.
 985 |    -
 986 |       angle: 327.
 987 |       scale: 1.
 988 |    -
 989 |       angle: 328.
 990 |       scale: 1.
 991 |    -
 992 |       angle: 329.
 993 |       scale: 1.
 994 |    -
 995 |       angle: 330.
 996 |       scale: 1.
 997 |    -
 998 |       angle: 331.
 999 |       scale: 1.
1000 |    -
1001 |       angle: 332.
1002 |       scale: 1.
1003 |    -
1004 |       angle: 333.
1005 |       scale: 1.
1006 |    -
1007 |       angle: 334.
1008 |       scale: 1.
1009 |    -
1010 |       angle: 335.
1011 |       scale: 1.
1012 |    -
1013 |       angle: 336.
1014 |       scale: 1.
1015 |    -
1016 |       angle: 337.
1017 |       scale: 1.
1018 |    -
1019 |       angle: 338.
1020 |       scale: 1.
1021 |    -
1022 |       angle: 339.
1023 |       scale: 1.
1024 |    -
1025 |       angle: 340.
1026 |       scale: 1.
1027 |    -
1028 |       angle: 341.
1029 |       scale: 1.
1030 |    -
1031 |       angle: 342.
1032 |       scale: 1.
1033 |    -
1034 |       angle: 343.
1035 |       scale: 1.
1036 |    -
1037 |       angle: 344.
1038 |       scale: 1.
1039 |    -
1040 |       angle: 345.
1041 |       scale: 1.
1042 |    -
1043 |       angle: 346.
1044 |       scale: 1.
1045 |    -
1046 |       angle: 347.
1047 |       scale: 1.
1048 |    -
1049 |       angle: 348.
1050 |       scale: 1.
1051 |    -
1052 |       angle: 349.
1053 |       scale: 1.
1054 |    -
1055 |       angle: 350.
1056 |       scale: 1.
1057 |    -
1058 |       angle: 351.
1059 |       scale: 1.
1060 |    -
1061 |       angle: 352.
1062 |       scale: 1.
1063 |    -
1064 |       angle: 353.
1065 |       scale: 1.
1066 |    -
1067 |       angle: 354.
1068 |       scale: 1.
1069 |    -
1070 |       angle: 355.
1071 |       scale: 1.
1072 |    -
1073 |       angle: 356.
1074 |       scale: 1.
1075 |    -
1076 |       angle: 357.
1077 |       scale: 1.
1078 |    -
1079 |       angle: 358.
1080 |       scale: 1.
1081 |    -
1082 |       angle: 359.
1083 |       scale: 1.
1084 |    -
1085 |       angle: 360.
1086 |       scale: 1.
1087 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/train.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/test/case2/train.png


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/ori_16bit_experiment/LUT16.txt:
--------------------------------------------------------------------------------
 1 | CV_DECL_ALIGNED(16) static const unsigned char SIMILARITY_LUT[1024] = {
 2 | 0, 8, 7, 8, 6, 8, 7, 8, 5, 8, 7, 8, 6, 8, 7, 8, 0, 4, 3, 4, 2, 4, 3, 4, 1, 4, 3, 4, 2, 4, 3, 4, 
 3 | 0, 0, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 0, 4, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7,
 4 | 0, 7, 8, 8, 7, 7, 8, 8, 6, 7, 8, 8, 7, 7, 8, 8, 0, 5, 4, 5, 3, 5, 4, 5, 2, 5, 4, 5, 3, 5, 4, 5, 
 5 | 0, 1, 0, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 0, 3, 4, 4, 5, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6,
 6 | 0, 6, 7, 7, 8, 8, 8, 8, 7, 7, 7, 7, 8, 8, 8, 8, 0, 6, 5, 6, 4, 6, 5, 6, 3, 6, 5, 6, 4, 6, 5, 6, 
 7 | 0, 2, 1, 2, 0, 2, 1, 2, 1, 2, 1, 2, 1, 2, 1, 2, 0, 2, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5,
 8 | 0, 5, 6, 6, 7, 7, 7, 7, 8, 8, 8, 8, 8, 8, 8, 8, 0, 7, 6, 7, 5, 7, 6, 7, 4, 7, 6, 7, 5, 7, 6, 7, 
 9 | 0, 3, 2, 3, 1, 3, 2, 3, 0, 3, 2, 3, 1, 3, 2, 3, 0, 1, 2, 2, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4,
10 | 0, 4, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 0, 8, 7, 8, 6, 8, 7, 8, 5, 8, 7, 8, 6, 8, 7, 8, 
11 | 0, 4, 3, 4, 2, 4, 3, 4, 1, 4, 3, 4, 2, 4, 3, 4, 0, 0, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3,
12 | 0, 3, 4, 4, 5, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6, 0, 7, 8, 8, 7, 7, 8, 8, 6, 7, 8, 8, 7, 7, 8, 8, 
13 | 0, 5, 4, 5, 3, 5, 4, 5, 2, 5, 4, 5, 3, 5, 4, 5, 0, 1, 0, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2,
14 | 0, 2, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5, 0, 6, 7, 7, 8, 8, 8, 8, 7, 7, 7, 7, 8, 8, 8, 8, 
15 | 0, 6, 5, 6, 4, 6, 5, 6, 3, 6, 5, 6, 4, 6, 5, 6, 0, 2, 1, 2, 0, 2, 1, 2, 1, 2, 1, 2, 1, 2, 1, 2,
16 | 0, 1, 2, 2, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 0, 5, 6, 6, 7, 7, 7, 7, 8, 8, 8, 8, 8, 8, 8, 8, 
17 | 0, 7, 6, 7, 5, 7, 6, 7, 4, 7, 6, 7, 5, 7, 6, 7, 0, 3, 2, 3, 1, 3, 2, 3, 0, 3, 2, 3, 1, 3, 2, 3,
18 | 0, 0, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 0, 4, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 
19 | 0, 8, 7, 8, 6, 8, 7, 8, 5, 8, 7, 8, 6, 8, 7, 8, 0, 4, 3, 4, 2, 4, 3, 4, 1, 4, 3, 4, 2, 4, 3, 4,
20 | 0, 1, 0, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 0, 3, 4, 4, 5, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6, 
21 | 0, 7, 8, 8, 7, 7, 8, 8, 6, 7, 8, 8, 7, 7, 8, 8, 0, 5, 4, 5, 3, 5, 4, 5, 2, 5, 4, 5, 3, 5, 4, 5,
22 | 0, 2, 1, 2, 0, 2, 1, 2, 1, 2, 1, 2, 1, 2, 1, 2, 0, 2, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5, 
23 | 0, 6, 7, 7, 8, 8, 8, 8, 7, 7, 7, 7, 8, 8, 8, 8, 0, 6, 5, 6, 4, 6, 5, 6, 3, 6, 5, 6, 4, 6, 5, 6,
24 | 0, 3, 2, 3, 1, 3, 2, 3, 0, 3, 2, 3, 1, 3, 2, 3, 0, 1, 2, 2, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 
25 | 0, 5, 6, 6, 7, 7, 7, 7, 8, 8, 8, 8, 8, 8, 8, 8, 0, 7, 6, 7, 5, 7, 6, 7, 4, 7, 6, 7, 5, 7, 6, 7,
26 | 0, 4, 3, 4, 2, 4, 3, 4, 1, 4, 3, 4, 2, 4, 3, 4, 0, 0, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 
27 | 0, 4, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 0, 8, 7, 8, 6, 8, 7, 8, 5, 8, 7, 8, 6, 8, 7, 8,
28 | 0, 5, 4, 5, 3, 5, 4, 5, 2, 5, 4, 5, 3, 5, 4, 5, 0, 1, 0, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 
29 | 0, 3, 4, 4, 5, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6, 0, 7, 8, 8, 7, 7, 8, 8, 6, 7, 8, 8, 7, 7, 8, 8,
30 | 0, 6, 5, 6, 4, 6, 5, 6, 3, 6, 5, 6, 4, 6, 5, 6, 0, 2, 1, 2, 0, 2, 1, 2, 1, 2, 1, 2, 1, 2, 1, 2, 
31 | 0, 2, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5, 0, 6, 7, 7, 8, 8, 8, 8, 7, 7, 7, 7, 8, 8, 8, 8,
32 | 0, 7, 6, 7, 5, 7, 6, 7, 4, 7, 6, 7, 5, 7, 6, 7, 0, 3, 2, 3, 1, 3, 2, 3, 0, 3, 2, 3, 1, 3, 2, 3, 
33 | 0, 1, 2, 2, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 0, 5, 6, 6, 7, 7, 7, 7, 8, 8, 8, 8, 8, 8, 8, 8};
34 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/test/ori_16bit_experiment/LUT_gen.cpp:
--------------------------------------------------------------------------------
 1 | #include <iostream>
 2 | #include <vector>
 3 | using namespace std;
 4 | 
 5 | struct Node {
 6 |     int value;
 7 |     int prev;
 8 |     int next;
 9 | };
10 | 
11 | int main()
12 | {
13 |     std::vector<Node> nodes(16);
14 |     for(int i=0; i<16; i++){
15 |         nodes[i].value = (1 << i);
16 |         nodes[i].prev = i-1;
17 |         nodes[i].next = i+1;
18 |     }
19 |     nodes[0].prev = 15;
20 |     nodes[15].next = 0;
21 | 
22 |     uint8_t LUT[16*4*16] = {0};
23 | 
24 |     for(int i=0; i<16; i++){ // 16 ori
25 |         for(int m=0; m<4; m++){ // 4 seg
26 |             for(int n=0; n<16; n++){ // 16 index
27 | 
28 |                 if(n==0){ // no ori
29 |                    LUT[n+m*16+i*16*4] = 0;
30 |                    continue;
31 |                 }
32 | 
33 |                 int res = (n << (m*4));
34 |                 auto current_node_go_forward = nodes[i];
35 |                 auto current_node_go_back = nodes[i];
36 |                 int angle_diff = 0;
37 |                 while(1){
38 |                     if((current_node_go_forward.value & res) > 0 ||
39 |                        (current_node_go_back.value & res) > 0){
40 |                         break;
41 |                     }else{
42 |                         current_node_go_back = nodes[current_node_go_back.prev];
43 |                         current_node_go_forward = nodes[current_node_go_forward.next];
44 |                         angle_diff ++;
45 |                     }
46 |                 }
47 |                 LUT[n+m*16+i*16*4] = 8 - angle_diff;
48 |             }
49 |         }
50 |     }
51 | 
52 |     for(int i=0; i<16; i++){
53 |         for(int m=0; m<64; m++){
54 |             cout << int(LUT[i*64 + m]) << ", ";
55 |         }
56 |         cout << "\n";
57 |     }
58 | 
59 |     return 0;
60 | }
61 | 
62 | 


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/shape_based_matching-subpixel/x64/Debug/vc141.idb:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/shape_based_matching-subpixel/x64/Debug/vc141.idb


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.exe:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.exe


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.ilk:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.ilk


--------------------------------------------------------------------------------
/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.pdb:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/daxiaHuang/shape_based_matching_subpixel/6955d0b3e785716297ede4fbd6102283a0ab4043/shape_based_matching-subpixel/x64/Debug/shape_based_matching-subpixel.pdb


--------------------------------------------------------------------------------