-
Notifications
You must be signed in to change notification settings - Fork 85
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TVM RPC Error "PLS isn't existed" on Khadas VIM3 Pro (Amlogic A311D) #189
Comments
It seems that the system is not able to locate runtime compiler header file cl_viv_vx_ext.h. You can either copy this file over to current run directory, or set VIVANTE_SDK_DIR to point to the location which contains this header file (include/CL/cl_viv_vx_ext.h).
|
@thezha Thanks for your reply. export VIVANTE_SDK_DIR=/usr And ran again TIM_VX unit_test, get some errors without "Cannot find the header file cl_viv_vx_ext.h." error. The following is full output: Read More
Running main() from /home/khadas/TIM-VX-1.1.32/_deps/googletest-src/googletest/src/gtest_main.cc
[==========] Running 104 tests from 33 test suites.
[----------] Global test environment set-up.
[----------] 1 test from Context
[ RUN ] Context.create
[ OK ] Context.create (43 ms)
[----------] 1 test from Context (43 ms total)
[----------] 2 tests from graph
[ RUN ] graph.gen_binary_graph_with_empty_graph
E [_graph_optimization_convert_int8_to_uint8:792]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
E [vsi_nn_OptimizeGraph:827]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
[ OK ] graph.gen_binary_graph_with_empty_graph (6 ms)
[ RUN ] graph.gen_binary_graph_with_simple_add
[ OK ] graph.gen_binary_graph_with_simple_add (20 ms)
[----------] 2 tests from graph (26 ms total)
[----------] 2 tests from Linear
[ RUN ] Linear.shape_5_1_fp32
[ OK ] Linear.shape_5_1_fp32 (7 ms)
[ RUN ] Linear.shape_5_1_fp32_omit_b
[ OK ] Linear.shape_5_1_fp32_omit_b (5 ms)
[----------] 2 tests from Linear (13 ms total)
[----------] 3 tests from Conv1d
[ RUN ] Conv1d.shape_3_6_1_float_ksize_1_stride_1_weights_3_no_bias_whcn
[ OK ] Conv1d.shape_3_6_1_float_ksize_1_stride_1_weights_3_no_bias_whcn (14 ms)
[ RUN ] Conv1d.shape_6_2_1_uint8_ksize_6_stride_1_weights_2_whcn
[ OK ] Conv1d.shape_6_2_1_uint8_ksize_6_stride_1_weights_2_whcn (7 ms)
[ RUN ] Conv1d.shape_6_2_1_uint8_ksize_3_stride_1_pad_1_weights_2_no_bias_whcn
[ OK ] Conv1d.shape_6_2_1_uint8_ksize_3_stride_1_pad_1_weights_2_no_bias_whcn (6 ms)
[----------] 3 tests from Conv1d (27 ms total)
[----------] 19 tests from Conv2d
[ RUN ] Conv2d.shape_4_2_1_1_float32_PaddingTest
[ OK ] Conv2d.shape_4_2_1_1_float32_PaddingTest (17 ms)
[ RUN ] Conv2d.shape_4_2_2_2_float32_PointwiseTest
[ OK ] Conv2d.shape_4_2_2_2_float32_PointwiseTest (16 ms)
[ RUN ] Conv2d.shape_4_2_1_2_float32_SimpleTest
[ OK ] Conv2d.shape_4_2_1_2_float32_SimpleTest (12 ms)
[ RUN ] Conv2d.shape_4_2_2_2_float32_SimpleChannelsTest
[ OK ] Conv2d.shape_4_2_2_2_float32_SimpleChannelsTest (11 ms)
[ RUN ] Conv2d.shape_6_3_1_1_float32_SimpleAnisotropicStridesTest
[ OK ] Conv2d.shape_6_3_1_1_float32_SimpleAnisotropicStridesTest (11 ms)
[ RUN ] Conv2d.shape_4_3_1_1_float32_HandCalculatedTest
[ OK ] Conv2d.shape_4_3_1_1_float32_HandCalculatedTest (12 ms)
[ RUN ] Conv2d.shape_4_3_1_1_float32_HandCalculatedConstFilterTest
[ OK ] Conv2d.shape_4_3_1_1_float32_HandCalculatedConstFilterTest (12 ms)
[ RUN ] Conv2d.shape_4_3_1_1_float32_HandCalculatedBiasTest
[ OK ] Conv2d.shape_4_3_1_1_float32_HandCalculatedBiasTest (12 ms)
[ RUN ] Conv2d.shape_4_3_1_1_float32_HandCalculatedValidTest
[ OK ] Conv2d.shape_4_3_1_1_float32_HandCalculatedValidTest (12 ms)
[ RUN ] Conv2d.shape_4_2_2_2_float32_DisabledPointwiseMultifilterTest
[ OK ] Conv2d.shape_4_2_2_2_float32_DisabledPointwiseMultifilterTest (9 ms)
[ RUN ] Conv2d.shape_9_9_1_1_float32_SimpleDilationTest
[ OK ] Conv2d.shape_9_9_1_1_float32_SimpleDilationTest (12 ms)
[ RUN ] Conv2d.shape_4_2_1_2_float32_StrideTest
[ OK ] Conv2d.shape_4_2_1_2_float32_StrideTest (12 ms)
[ RUN ] Conv2d.shape_4_2_1_2_float32_InputAndFilterSameWidthHeightTest
[ OK ] Conv2d.shape_4_2_1_2_float32_InputAndFilterSameWidthHeightTest (8 ms)
[ RUN ] Conv2d.shape_4_2_1_2_uint8_QuantizedTest1
[ OK ] Conv2d.shape_4_2_1_2_uint8_QuantizedTest1 (6 ms)
[ RUN ] Conv2d.shape_4_2_1_2_uint8_QuantizedTest2
[ OK ] Conv2d.shape_4_2_1_2_uint8_QuantizedTest2 (6 ms)
[ RUN ] Conv2d.shape_6_3_1_1_uint8_AnisotropicStridesQuantizedTest
[ OK ] Conv2d.shape_6_3_1_1_uint8_AnisotropicStridesQuantizedTest (6 ms)
[ RUN ] Conv2d.shape_9_9_1_1_uint8_DilationQuantizedTest
[ OK ] Conv2d.shape_9_9_1_1_uint8_DilationQuantizedTest (6 ms)
[ RUN ] Conv2d.shape_3_2_2_1_int8_QuantizedPerTensorTest
[ OK ] Conv2d.shape_3_2_2_1_int8_QuantizedPerTensorTest (19 ms)
[ RUN ] Conv2d.shape_3_2_2_1_int8_QuantizedPerChannelTest
[ OK ] Conv2d.shape_3_2_2_1_int8_QuantizedPerChannelTest (12 ms)
[----------] 19 tests from Conv2d (213 ms total)
[----------] 2 tests from DeConv1d
[ RUN ] DeConv1d.no_bias_layout_whcn_depthwise_shape_3_2_1
/home/khadas/TIM-VX-1.1.32/src/tim/vx/ops/deconv1d_test.cc:69: Failure
Expected equality of these values:
golden
Which is: { 27, 81, 30, 9, 3, 21, 15, 27, 0, 0 }
output_data
Which is: { 48, 96, 57, 9, 3, 0, 0, 0, 0, 0 }
Result mismatch
[ FAILED ] DeConv1d.no_bias_layout_whcn_depthwise_shape_3_2_1 (9 ms)
[ RUN ] DeConv1d.layout_whcn_shape_3_1_1
[ OK ] DeConv1d.layout_whcn_shape_3_1_1 (92 ms)
[----------] 2 tests from DeConv1d (101 ms total)
[----------] 2 tests from DeConv2d
[ RUN ] DeConv2d.shape_3_3_2_1_float_depthwise
/home/khadas/TIM-VX-1.1.32/src/tim/vx/ops/deconv2d_test.cc:85: Failure
Expected equality of these values:
golden
Which is: { 27, 72, 18, 24, 3, 81, 45, 90, 15, 21, 30, 26, 43, 22, 11, 9, 5, 25, 10, 14, 3, 2, 9, 4, 6, 21, 27, 52, 63, 7, 15, 6, ... }
output_data
Which is: { 48, 99, 70, 87, 10, 96, 51, 134, 29, 42, 57, 26, 168, 94, 33, 9, 5, 65, 26, 38, 3, 2, 81, 4, 22, 0, 0, 0, 0, 0, 0, 0, ... }
Result mismatch
[ FAILED ] DeConv2d.shape_3_3_2_1_float_depthwise (9 ms)
[ RUN ] DeConv2d.shape_3_3_1_1_float
[ OK ] DeConv2d.shape_3_3_1_1_float (9 ms)
[----------] 2 tests from DeConv2d (18 ms total)
[----------] 16 tests from DepthwiseConv
[ RUN ] DepthwiseConv.shape_2_3_2_1_float32_SimpleTest
[ OK ] DepthwiseConv.shape_2_3_2_1_float32_SimpleTest (19 ms)
[ RUN ] DepthwiseConv.shape_2_3_2_1_float32_StrideValidTest
[ OK ] DepthwiseConv.shape_2_3_2_1_float32_StrideValidTest (12 ms)
[ RUN ] DepthwiseConv.shape_2_3_2_1_float32_StrideSameTest
[ OK ] DepthwiseConv.shape_2_3_2_1_float32_StrideSameTest (11 ms)
[ RUN ] DepthwiseConv.shape_2_3_2_1_float32_StrideSameDilationTest
[ OK ] DepthwiseConv.shape_2_3_2_1_float32_StrideSameDilationTest (11 ms)
[ RUN ] DepthwiseConv.shape_2_3_2_1_float32_PaddingTest
[ OK ] DepthwiseConv.shape_2_3_2_1_float32_PaddingTest (12 ms)
[ RUN ] DepthwiseConv.shape_9_9_1_1_float32_DilationValidTest
[ OK ] DepthwiseConv.shape_9_9_1_1_float32_DilationValidTest (11 ms)
[ RUN ] DepthwiseConv.shape_3_3_1_1_float32_DilationSameTest
[ OK ] DepthwiseConv.shape_3_3_1_1_float32_DilationSameTest (12 ms)
[ RUN ] DepthwiseConv.shape_3_3_4_2_float32_BatchValidTest
[ OK ] DepthwiseConv.shape_3_3_4_2_float32_BatchValidTest (11 ms)
[ RUN ] DepthwiseConv.shape_2_2_1_4_float32_BatchSameTest
[ OK ] DepthwiseConv.shape_2_2_1_4_float32_BatchSameTest (12 ms)
[ RUN ] DepthwiseConv.shape_2_3_2_1_uint8_QuantizedTest
[ OK ] DepthwiseConv.shape_2_3_2_1_uint8_QuantizedTest (6 ms)
[ RUN ] DepthwiseConv.shape_9_9_1_1_uint8_QuantizedDilationdValidTest
[ OK ] DepthwiseConv.shape_9_9_1_1_uint8_QuantizedDilationdValidTest (6 ms)
[ RUN ] DepthwiseConv.shape_3_3_1_1_uint8_QuantizedDilationdSameTest
[ OK ] DepthwiseConv.shape_3_3_1_1_uint8_QuantizedDilationdSameTest (6 ms)
[ RUN ] DepthwiseConv.shape_3_2_2_1_int8_PerTensorTest
[ OK ] DepthwiseConv.shape_3_2_2_1_int8_PerTensorTest (13 ms)
[ RUN ] DepthwiseConv.shape_3_2_2_1_int8_PerAxisTest
[ OK ] DepthwiseConv.shape_3_2_2_1_int8_PerAxisTest (12 ms)
[ RUN ] DepthwiseConv.shape_3_3_8_1_int8_PerChannelValidTest
[ OK ] DepthwiseConv.shape_3_3_8_1_int8_PerChannelValidTest (12 ms)
[ RUN ] DepthwiseConv.shape_3_3_8_1_int8_PerChannelSameTest
[ OK ] DepthwiseConv.shape_3_3_8_1_int8_PerChannelSameTest (13 ms)
[----------] 16 tests from DepthwiseConv (181 ms total)
[----------] 3 tests from FloorDiv
[ RUN ] FloorDiv.shape_1_fp32
[ OK ] FloorDiv.shape_1_fp32 (69 ms)
[ RUN ] FloorDiv.shape_5_1_broadcast_float32
[ OK ] FloorDiv.shape_5_1_broadcast_float32 (38 ms)
[ RUN ] FloorDiv.shape_5_1_broadcast_uint8
[ OK ] FloorDiv.shape_5_1_broadcast_uint8 (256 ms)
[----------] 3 tests from FloorDiv (364 ms total)
[----------] 3 tests from GroupedConv2d
[ RUN ] GroupedConv2d.shape_3_3_6_1_float_group_1_no_bias_whcn
[ OK ] GroupedConv2d.shape_3_3_6_1_float_group_1_no_bias_whcn (7 ms)
[ RUN ] GroupedConv2d.shape_3_3_6_1_float_group_2_whcn
[ OK ] GroupedConv2d.shape_3_3_6_1_float_group_2_whcn (7 ms)
[ RUN ] GroupedConv2d.shape_3_3_6_1_uint8_group_6_whcn
[ OK ] GroupedConv2d.shape_3_3_6_1_uint8_group_6_whcn (15 ms)
[----------] 3 tests from GroupedConv2d (29 ms total)
[----------] 2 tests from InstanceNorm
[ RUN ] InstanceNorm.shape_3_6_1_float
[ OK ] InstanceNorm.shape_3_6_1_float (125 ms)
[ RUN ] InstanceNorm.shape_3_3_6_1_float
[ OK ] InstanceNorm.shape_3_3_6_1_float (80 ms)
[----------] 2 tests from InstanceNorm (205 ms total)
[----------] 2 tests from LayerNorm
[ RUN ] LayerNorm.axis_0_shape_3_6_1_float
[ OK ] LayerNorm.axis_0_shape_3_6_1_float (60 ms)
[ RUN ] LayerNorm.axis_0_shape_2_3_6_1_float
[ OK ] LayerNorm.axis_0_shape_2_3_6_1_float (58 ms)
[----------] 2 tests from LayerNorm (118 ms total)
[----------] 3 tests from LogSoftmax
[ RUN ] LogSoftmax.shape_6_1_float_axis_0
[ OK ] LogSoftmax.shape_6_1_float_axis_0 (123 ms)
[ RUN ] LogSoftmax.shape_3_6_1_float_axis_1
[ OK ] LogSoftmax.shape_3_6_1_float_axis_1 (48 ms)
[ RUN ] LogSoftmax.shape_3_6_1_uint8_axis_1
[ OK ] LogSoftmax.shape_3_6_1_uint8_axis_1 (958 ms)
[----------] 3 tests from LogSoftmax (1129 ms total)
[----------] 3 tests from Matmul
[ RUN ] Matmul.shape_2_6_shape_6_2_float
[ OK ] Matmul.shape_2_6_shape_6_2_float (38 ms)
[ RUN ] Matmul.shape_2_3_2_shape_2_3_2_float_transpose_b
[ OK ] Matmul.shape_2_3_2_shape_2_3_2_float_transpose_b (42 ms)
[ RUN ] Matmul.shape_2_3_2_shape_2_3_2_uint8_transpose_a
[ OK ] Matmul.shape_2_3_2_shape_2_3_2_uint8_transpose_a (169 ms)
[----------] 3 tests from Matmul (249 ms total)
[----------] 2 tests from MaxpoolWithArgmax
[ RUN ] MaxpoolWithArgmax.shape_3_3_1_fp32_kernel_2_stride_2
[ OK ] MaxpoolWithArgmax.shape_3_3_1_fp32_kernel_2_stride_2 (49 ms)
[ RUN ] MaxpoolWithArgmax.shape_4_4_1_uint8_kernel_2_stride_2
[ OK ] MaxpoolWithArgmax.shape_4_4_1_uint8_kernel_2_stride_2 (124 ms)
[----------] 2 tests from MaxpoolWithArgmax (173 ms total)
[----------] 2 tests from MaxUnpool2d
[ RUN ] MaxUnpool2d.shape_2_2_1_fp32_kernel_2_stride_2
[ OK ] MaxUnpool2d.shape_2_2_1_fp32_kernel_2_stride_2 (52 ms)
[ RUN ] MaxUnpool2d.shape_2_2_1_uint8_kernel_2_stride_2
[ OK ] MaxUnpool2d.shape_2_2_1_uint8_kernel_2_stride_2 (150 ms)
[----------] 2 tests from MaxUnpool2d (202 ms total)
[----------] 2 tests from Moments
[ RUN ] Moments.shape_6_3_1_float_axes_0_1
[ OK ] Moments.shape_6_3_1_float_axes_0_1 (62 ms)
[ RUN ] Moments.shape_3_6_1_float_axes_1_keepdims
[ OK ] Moments.shape_3_6_1_float_axes_1_keepdims (37 ms)
[----------] 2 tests from Moments (99 ms total)
[----------] 1 test from Equal
[ RUN ] Equal.shape_1_uint8
[ OK ] Equal.shape_1_uint8 (523 ms)
[----------] 1 test from Equal (523 ms total)
[----------] 1 test from NotEqual
[ RUN ] NotEqual.shape_5_fp32
[ OK ] NotEqual.shape_5_fp32 (64 ms)
[----------] 1 test from NotEqual (64 ms total)
[----------] 1 test from Less
[ RUN ] Less.shape_5_1_fp32
[ OK ] Less.shape_5_1_fp32 (62 ms)
[----------] 1 test from Less (63 ms total)
[----------] 1 test from GreaterOrEqual
[ RUN ] GreaterOrEqual.shape_5_2_1_fp32
[ OK ] GreaterOrEqual.shape_5_2_1_fp32 (62 ms)
[----------] 1 test from GreaterOrEqual (63 ms total)
[----------] 1 test from Greater
[ RUN ] Greater.shape_5_2_1_1_fp32
[ OK ] Greater.shape_5_2_1_1_fp32 (62 ms)
[----------] 1 test from Greater (63 ms total)
[----------] 1 test from LessOrEqual
[ RUN ] LessOrEqual.shape_1_5_2_1_1_fp32
[ OK ] LessOrEqual.shape_1_5_2_1_1_fp32 (62 ms)
[----------] 1 test from LessOrEqual (62 ms total)
[----------] 2 tests from Reorg
[ RUN ] Reorg.shape_4_4_4_1_u8
[ OK ] Reorg.shape_4_4_4_1_u8 (6 ms)
[ RUN ] Reorg.shape_4_4_4_1_fp32
[ OK ] Reorg.shape_4_4_4_1_fp32 (6 ms)
[----------] 2 tests from Reorg (12 ms total)
[----------] 3 tests from Resize1d
[ RUN ] Resize1d.shape_4_2_1_float_nearest_whcn
[ OK ] Resize1d.shape_4_2_1_float_nearest_whcn (29 ms)
[ RUN ] Resize1d.shape_4_2_1_uint8_nearest_whcn
[ OK ] Resize1d.shape_4_2_1_uint8_nearest_whcn (100 ms)
[ RUN ] Resize1d.shape_5_1_1_float_bilinear_align_corners_whcn
[ OK ] Resize1d.shape_5_1_1_float_bilinear_align_corners_whcn (35 ms)
[----------] 3 tests from Resize1d (164 ms total)
[----------] 2 tests from ScatterND
[ RUN ] ScatterND.shape_4_4_4
[ OK ] ScatterND.shape_4_4_4 (41 ms)
[ RUN ] ScatterND.shape_9
[ OK ] ScatterND.shape_9 (74 ms)
[----------] 2 tests from ScatterND (115 ms total)
[----------] 1 test from Floor
[ RUN ] Floor.shape_5_1_fp32
[ OK ] Floor.shape_5_1_fp32 (5 ms)
[----------] 1 test from Floor (5 ms total)
[----------] 1 test from Cast
[ RUN ] Cast.shape_5_1_fp32_to_int32
[ OK ] Cast.shape_5_1_fp32_to_int32 (35 ms)
[----------] 1 test from Cast (35 ms total)
[----------] 1 test from SpatialTransformer
[ RUN ] SpatialTransformer.shape_1_3_3_1_u8
[ OK ] SpatialTransformer.shape_1_3_3_1_u8 (138 ms)
[----------] 1 test from SpatialTransformer (139 ms total)
[----------] 2 tests from Tile
[ RUN ] Tile.shape_3_2_float_multiples_2_1
[ OK ] Tile.shape_3_2_float_multiples_2_1 (45 ms)
[ RUN ] Tile.shape_3_2_1_int8_multiples_2_2_1
[ OK ] Tile.shape_3_2_1_int8_multiples_2_2_1 (315 ms)
[----------] 2 tests from Tile (360 ms total)
[----------] 14 tests from TransposeConv2d
[ RUN ] TransposeConv2d.shape_4_4_1_1_float32_SimpleTest
[ OK ] TransposeConv2d.shape_4_4_1_1_float32_SimpleTest (8 ms)
[ RUN ] TransposeConv2d.shape_4_4_2_1_float32_SameTest
[ OK ] TransposeConv2d.shape_4_4_2_1_float32_SameTest (9 ms)
[ RUN ] TransposeConv2d.shape_4_4_2_1_float32_ValidTest
[ OK ] TransposeConv2d.shape_4_4_2_1_float32_ValidTest (8 ms)
[ RUN ] TransposeConv2d.shape_2_2_1_1_float32_StrideTest
[ OK ] TransposeConv2d.shape_2_2_1_1_float32_StrideTest (9 ms)
[ RUN ] TransposeConv2d.shape_2_2_1_1_float32_ChannelTest
[ OK ] TransposeConv2d.shape_2_2_1_1_float32_ChannelTest (9 ms)
[ RUN ] TransposeConv2d.shape_2_1_1_1_float32_AccuracyTest
[ OK ] TransposeConv2d.shape_2_1_1_1_float32_AccuracyTest (9 ms)
[ RUN ] TransposeConv2d.shape_2_2_1_1_float32_BiasChannelTest
[ OK ] TransposeConv2d.shape_2_2_1_1_float32_BiasChannelTest (12 ms)
[ RUN ] TransposeConv2d.shape_4_4_1_1_uint8_QuantizedTest
[ OK ] TransposeConv2d.shape_4_4_1_1_uint8_QuantizedTest (6 ms)
[ RUN ] TransposeConv2d.shape_4_4_2_1_uint8_QuantizedTwoFiltersTest
[ OK ] TransposeConv2d.shape_4_4_2_1_uint8_QuantizedTwoFiltersTest (5 ms)
[ RUN ] TransposeConv2d.shape_4_4_2_1_uint8_QuantizedValidTest
[ OK ] TransposeConv2d.shape_4_4_2_1_uint8_QuantizedValidTest (5 ms)
[ RUN ] TransposeConv2d.shape_4_4_1_1_uint8_QuantizedBiasTest
[ OK ] TransposeConv2d.shape_4_4_1_1_uint8_QuantizedBiasTest (5 ms)
[ RUN ] TransposeConv2d.shape_4_4_1_1_int8_QuantizedPerChannelOneTest
Segmentation fault |
@thezha The Galcore version is 6.4.4.3.310723AAA. Is any relation with TIM-VX version? |
@leokuo725 I recommend that you get the latest driver SDK/galcore from here and push to the device. https://github.com/VeriSilicon/TIM-VX/releases/tag/v1.1.34.fix |
@sunshinemyson I build the TIM-VX v1.1.34.fix. Some errors happened. Read More
[ 94%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/activations_test.cc.o
/usr/bin/ld: ../../src/tim/vx/internal/libtim_internal.a(matrixmul_vx.c.o): in function `_matrixmulsetup':
matrixmul_vx.c:(.text+0x120): undefined reference to `vxBatchGemmNode'
collect2: error: ld returned 1 exit status
make[2]: *** [samples/benchmark_test/CMakeFiles/benchmark_test.dir/build.make:100: samples/benchmark_test/benchmark_test] Error 1
make[1]: *** [CMakeFiles/Makefile2:527: samples/benchmark_test/CMakeFiles/benchmark_test.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
[ 94%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/addn_test.cc.o
[ 94%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/avg_pool_test.cc.o
[ 95%] Linking CXX executable lenet
[ 95%] Linking CXX executable multi_thread_test
/usr/bin/ld: ../../src/tim/vx/internal/libtim_internal.a(matrixmul_vx.c.o): in function `_matrixmulsetup':
matrixmul_vx.c:(.text+0x120): undefined reference to `vxBatchGemmNode'
collect2: error: ld returned 1 exit status
make[2]: *** [samples/lenet/CMakeFiles/lenet.dir/build.make:100: samples/lenet/lenet] Error 1
make[1]: *** [CMakeFiles/Makefile2:555: samples/lenet/CMakeFiles/lenet.dir/all] Error 2
[ 95%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/conv1d_test.cc.o
/usr/bin/ld: ../../src/tim/vx/internal/libtim_internal.a(matrixmul_vx.c.o): in function `_matrixmulsetup':
matrixmul_vx.c:(.text+0x120): undefined reference to `vxBatchGemmNode'
collect2: error: ld returned 1 exit status
make[2]: *** [samples/multi_thread_test/CMakeFiles/multi_thread_test.dir/build.make:100: samples/multi_thread_test/multi_thread_test] Error 1
make[1]: *** [CMakeFiles/Makefile2:583: samples/multi_thread_test/CMakeFiles/multi_thread_test.dir/all] Error 2
[ 95%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/conv2d_test.cc.o
[ 95%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/deconv1d_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/deconv2d_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/depthwiseConv_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/elementwise_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/groupedconv2d_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/instancenormalization_test.cc.o
[ 96%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/layernormalization_test.cc.o
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/logsoftmax_test.cc.o
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/matmul_test.cc.o
[ 97%] Linking CXX shared library libtim-vx.so
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/maxpoolwithargmax_test.cc.o
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/maxunpool2d_test.cc.o
[ 97%] Built target tim-vx
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/moments_test.cc.o
[ 97%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/relational_operations_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/reorg_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/resize1d_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/scatternd_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/shuffle_channel_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/simple_operations_test.cc.o
[ 98%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/spatial_transformer_test.cc.o
[ 99%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/tile_test.cc.o
[ 99%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/transposeConv_test.cc.o
[ 99%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/unidirectional_sequence_lstm_test.cc.o
[ 99%] Building CXX object src/tim/CMakeFiles/unit_test.dir/vx/ops/unstack_test.cc.o
[ 99%] Building CXX object src/tim/CMakeFiles/unit_test.dir/transform/layout_inference_test.cc.o
[100%] Linking CXX executable unit_test
/usr/bin/ld: vx/internal/libtim_internal.a(matrixmul_vx.c.o): in function `_matrixmulsetup':
matrixmul_vx.c:(.text+0x120): undefined reference to `vxBatchGemmNode'
collect2: error: ld returned 1 exit status
make[2]: *** [src/tim/CMakeFiles/unit_test.dir/build.make:549: src/tim/unit_test] Error 1
make[1]: *** [CMakeFiles/Makefile2:418: src/tim/CMakeFiles/unit_test.dir/all] Error 2
make: *** [Makefile:130: all] Error 2
khadas@Khadas:~/TIM-VX-1.1.34.fix/build$ |
I try cross-compile the TIM-VX now. It can be compiled. |
Is TIM-VX unit test running OK now? |
@thezha Read More
Running main() from /media/data/home/leokuo/TIM-VX-1.1.34.fix/build_x86/_deps/googletest-src/googletest/src/gtest_main.cc
[==========] Running 122 tests from 39 test suites.
[----------] Global test environment set-up.
[----------] 1 test from Context
[ RUN ] Context.create
[ OK ] Context.create (121 ms)
[----------] 1 test from Context (121 ms total)
[----------] 2 tests from graph
[ RUN ] graph.gen_binary_graph_with_empty_graph
E [_graph_optimization_convert_int8_to_uint8:810]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
E [vsi_nn_OptimizeGraph:845]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
[ OK ] graph.gen_binary_graph_with_empty_graph (140 ms)
[ RUN ] graph.gen_binary_graph_with_simple_add
[ OK ] graph.gen_binary_graph_with_simple_add (294 ms)
[----------] 2 tests from graph (434 ms total)
[----------] 2 tests from Linear
[ RUN ] Linear.shape_5_1_fp32
[ OK ] Linear.shape_5_1_fp32 (180 ms)
[ RUN ] Linear.shape_5_1_fp32_omit_b
[ OK ] Linear.shape_5_1_fp32_omit_b (179 ms)
[----------] 2 tests from Linear (359 ms total)
[----------] 2 tests from Gelu
[ RUN ] Gelu.shape_5_1_fp32_approximate
W [_setup:243]Call vxTensorTableLookupLayer fail.
[ OK ] Gelu.shape_5_1_fp32_approximate (160 ms)
[ RUN ] Gelu.shape_5_1_uint8_Quantized
[ OK ] Gelu.shape_5_1_uint8_Quantized (128 ms)
[----------] 2 tests from Gelu (288 ms total)
[----------] 3 tests from AddN
[ RUN ] AddN.shape_2_2_int32
[ OK ] AddN.shape_2_2_int32 (230 ms)
[ RUN ] AddN.shape_3_1_float32
[ OK ] AddN.shape_3_1_float32 (230 ms)
[ RUN ] AddN.shape_2_2_uint8_Quantized
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/test_utils.h:118: Failure
The difference between expected[i] and actual[i] is 4, which exceeds abs_error, where
expected[i] evaluates to 131,
actual[i] evaluates to 127, and
abs_error evaluates to 1.
at index:0
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/test_utils.h:118: Failure
The difference between expected[i] and actual[i] is 11, which exceeds abs_error, where
expected[i] evaluates to 138,
actual[i] evaluates to 127, and
abs_error evaluates to 1.
at index:1
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/test_utils.h:118: Failure
The difference between expected[i] and actual[i] is 6, which exceeds abs_error, where
expected[i] evaluates to 133,
actual[i] evaluates to 127, and
abs_error evaluates to 1.
at index:2
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/test_utils.h:118: Failure
The difference between expected[i] and actual[i] is 17, which exceeds abs_error, where
expected[i] evaluates to 144,
actual[i] evaluates to 127, and
abs_error evaluates to 1.
at index:3
[ FAILED ] AddN.shape_2_2_uint8_Quantized (998 ms)
[----------] 3 tests from AddN (1458 ms total)
[----------] 4 tests from AVG
[ RUN ] AVG.shape_3_3_1_2_fp32_kernel_2_stride_1
[ OK ] AVG.shape_3_3_1_2_fp32_kernel_2_stride_1 (1055 ms)
[ RUN ] AVG.shape_3_3_1_1_fp32_kernel_2_stride_1
[ OK ] AVG.shape_3_3_1_1_fp32_kernel_2_stride_1 (1068 ms)
[ RUN ] AVG.shape_3_3_1_1_uint8_kernel_2_stride_1
[ OK ] AVG.shape_3_3_1_1_uint8_kernel_2_stride_1 (127 ms)
[ RUN ] AVG.shape_60_52_3_5_fp32_kernel_35_stride_5
[ OK ] AVG.shape_60_52_3_5_fp32_kernel_35_stride_5 (5096 ms)
[----------] 4 tests from AVG (7346 ms total)
[----------] 2 tests from AVG_ANDROID
[ RUN ] AVG_ANDROID.shape_60_52_3_5_fp32_kernel_35_stride_5
[ OK ] AVG_ANDROID.shape_60_52_3_5_fp32_kernel_35_stride_5 (5113 ms)
[ RUN ] AVG_ANDROID.shape_60_52_3_5_uint8_kernel_35_stride_5
Segmentation fault (core dumped) If I execute the old version(1.1.32) unit test with new version SDK(6.4.8) and Galcore version 6.4.6.2. Read More```console khadas@Khadas:~/TIM-VX-1.1.32/install/bin$ ./unit_test Running main() from /home/khadas/TIM-VX-1.1.32/_deps/googletest-src/googletest/src/gtest_main.cc [==========] Running 104 tests from 33 test suites. [----------] Global test environment set-up. [----------] 1 test from Context [ RUN ] Context.create [ OK ] Context.create (25 ms) [----------] 1 test from Context (25 ms total) [----------] 2 tests from graph [----------] 2 tests from Linear
|
please run ldd on the unit_test for both x86 and khadas and supply the output here.
|
@thezha ~/TIM-VX-1.1.34.fix/build_x86/src/tim$ ldd unit_test
linux-vdso.so.1 (0x00007ffcf6fbb000)
libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007fe03d4e0000)
libOpenVX.so.1 => /usr/lib/x86_64-linux-gnu/libOpenVX.so.1 (0x00007fe03cbe0000)
libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007fe03c7d3000)
libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fe03c435000)
libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007fe03c21d000)
libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fe03be2c000)
/lib64/ld-linux-x86-64.so.2 (0x00007fe03e04b000)
libVSC.so => /usr/lib/x86_64-linux-gnu/libVSC.so (0x00007fe03ac18000)
libGAL.so => /usr/lib/x86_64-linux-gnu/libGAL.so (0x00007fe03a7e1000)
libArchModelSw.so => /usr/lib/x86_64-linux-gnu/libArchModelSw.so (0x00007fe03a580000)
libNNArchPerf.so => /usr/lib/x86_64-linux-gnu/libNNArchPerf.so (0x00007fe03a314000)
libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007fe03a110000)
librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00007fe039f08000)
libEmulator.so => /usr/lib/x86_64-linux-gnu/libEmulator.so (0x00007fe039aa9000)
libvdtproxy.so => /usr/lib/x86_64-linux-gnu/libvdtproxy.so (0x00007fe0398a6000)
Khadas(TIM-VX 1.1.32): ~/TIM-VX-1.1.32/src/tim$ ldd unit_test
linux-vdso.so.1 (0x0000007f7a974000)
libpthread.so.0 => /lib/aarch64-linux-gnu/libpthread.so.0 (0x0000007f7a204000)
libOpenVX.so => /lib/libOpenVX.so (0x0000007f79fe3000)
libstdc++.so.6 => /lib/aarch64-linux-gnu/libstdc++.so.6 (0x0000007f79dfe000)
libm.so.6 => /lib/aarch64-linux-gnu/libm.so.6 (0x0000007f79d51000)
libgcc_s.so.1 => /lib/aarch64-linux-gnu/libgcc_s.so.1 (0x0000007f79d2d000)
libc.so.6 => /lib/aarch64-linux-gnu/libc.so.6 (0x0000007f79bba000)
/lib/ld-linux-aarch64.so.1 (0x0000007f7a944000)
libVSC.so => /lib/libVSC.so (0x0000007f78c01000)
libGAL.so => /lib/libGAL.so (0x0000007f789ff000)
libArchModelSw.so => /lib/libArchModelSw.so (0x0000007f789b0000)
libNNArchPerf.so => /lib/libNNArchPerf.so (0x0000007f78943000)
libdl.so.2 => /lib/aarch64-linux-gnu/libdl.so.2 (0x0000007f7892f000)
librt.so.1 => /lib/aarch64-linux-gnu/librt.so.1 (0x0000007f78917000)
I cannot find unit_test in TIM-VX v1.1.34.fix(Khadas). |
unit_test is not enabled by default, it must be built with 'cmake -DTIM_VX_ENABLE_TEST=ON ..' From your LDD result, it seems that you copied the SDK libraries to system library folders, this is not advised because they are not part of the system library. You should remove them from system library path /usr/lib/x86_64-linux-gnu and use LD_LIBRARY_PATH instead. something like this.
|
If I want to cross compile for A311D, should I build with "cmake -DCONFIG=A311D -DTIM_VX_ENABLE_TEST=ON .." ?
At Khadas side, May I copy from TIM-VX/build/install/lib/* to /usr/lib?
|
I tried it. If I set both -DCONFIG=A311D and -DTIM_VX_ENABLE_TEST=ON, there is no unit_test in the src/tim/ ~/TIM-VX-1.1.34.fix/build2/src/tim$ ls
CMakeFiles libtim-vx.so Makefile vx
cmake_install.cmake libtim-vx-static.a utils
|
It is recommended to copy the entire aarch64_A311D_6.4.8/ folder onto board somewhere and set LD_LIBRARY_PATH to point to it. Something like this,
Also, inside aarch64_A311D_6.4.8/ folder there is a corresponding galcore.ko, and you should use that. |
@sunshinemyson Any idea? |
This is an issue from CMake. Because we hard-reset compiler configuration in A311D.cmake, the cmake will reconfig the project and the TIM_VX_ENABLE_TEST will be reseted. To fix this issue, you need comment out following config in the A311D.cmake, and create a toolchain config locally. Here is my config for your reference |
I followed above steps. Commented out in cmake/A311D.cmake and put toolchain-vim3.cmake.txt into cmake folder. [ 99%] Linking CXX shared library libtim-vx.so
../../aarch64_A311D_6.4.8/lib/libCLC.so: error adding symbols: File in wrong format
collect2: error: ld returned 1 exit status
src/tim/CMakeFiles/tim-vx.dir/build.make:1080: recipe for target 'src/tim/libtim-vx.so' failed
make[2]: *** [src/tim/libtim-vx.so] Error 1
CMakeFiles/Makefile2:221: recipe for target 'src/tim/CMakeFiles/tim-vx.dir/all' failed
make[1]: *** [src/tim/CMakeFiles/tim-vx.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
[ 99%] Linking CXX executable benchmark_test
../../aarch64_A311D_6.4.8/lib/libCLC.so: error adding symbols: File in wrong format
collect2: error: ld returned 1 exit status
samples/benchmark_test/CMakeFiles/benchmark_test.dir/build.make:112: recipe for target 'samples/benchmark_test/benchmark_test' failed
make[2]: *** [samples/benchmark_test/benchmark_test] Error 1
CMakeFiles/Makefile2:300: recipe for target 'samples/benchmark_test/CMakeFiles/benchmark_test.dir/all' failed
make[1]: *** [samples/benchmark_test/CMakeFiles/benchmark_test.dir/all] Error 2
[ 99%] Linking CXX executable lenet
../../aarch64_A311D_6.4.8/lib/libCLC.so: error adding symbols: File in wrong format
collect2: error: ld returned 1 exit status
samples/lenet/CMakeFiles/lenet.dir/build.make:112: recipe for target 'samples/lenet/lenet' failed
make[2]: *** [samples/lenet/lenet] Error 1
CMakeFiles/Makefile2:327: recipe for target 'samples/lenet/CMakeFiles/lenet.dir/all' failed
make[1]: *** [samples/lenet/CMakeFiles/lenet.dir/all] Error 2
[100%] Linking CXX executable multi_thread_test
../../aarch64_A311D_6.4.8/lib/libCLC.so: error adding symbols: File in wrong format
collect2: error: ld returned 1 exit status
samples/multi_thread_test/CMakeFiles/multi_thread_test.dir/build.make:112: recipe for target 'samples/multi_thread_test/multi_thread_test' failed
make[2]: *** [samples/multi_thread_test/multi_thread_test] Error 1
CMakeFiles/Makefile2:354: recipe for target 'samples/multi_thread_test/CMakeFiles/multi_thread_test.dir/all' failed
make[1]: *** [samples/multi_thread_test/CMakeFiles/multi_thread_test.dir/all] Error 2
Makefile:135: recipe for target 'all' failed
make: *** [all] Error 2
|
it looks like you link target so with host build. did you set toolchain by -DCMAKE_TOOLCHAIN_FILE ? |
I used, and got error. cmake -DCONFIG=A311D -DTIM_VX_ENABLE_TEST=ON -DCMAKE_TOOLCHAIN_FILE=TIM-VX-1.1.34.fix/cmake/toolchain-vim3.cmake ..
-- The C compiler identification is unknown
-- The CXX compiler identification is unknown
CMake Error at CMakeLists.txt:2 (project):
The CMAKE_C_COMPILER:
/opt/test_hub/vosp/toolchain/vim3_A311D/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc
is not a full path to an existing compiler tool.
Tell CMake where to find the compiler by setting either the environment
variable "CC" or the CMake cache entry CMAKE_C_COMPILER to the full path to
the compiler, or to the compiler name if it is in the PATH.
CMake Error at CMakeLists.txt:2 (project):
The CMAKE_CXX_COMPILER:
/opt/test_hub/vosp/toolchain/vim3_A311D/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-g++
is not a full path to an existing compiler tool.
Tell CMake where to find the compiler by setting either the environment
variable "CXX" or the CMake cache entry CMAKE_CXX_COMPILER to the full path
to the compiler, or to the compiler name if it is in the PATH.
-- Configuring incomplete, errors occurred!
I want to set CROSS_COMPILE_ENV to ${PROJECT_BINARY_DIR}/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu. #Original
set(CROSS_COMPILE_ENV "/opt/test_hub/vosp/toolchain/vim3_A311D/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu")
#Modified
set(CROSS_COMPILE_ENV "${PROJECT_BINARY_DIR}/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu") Then, got another error. cmake -DCONFIG=A311D -DTIM_VX_ENABLE_TEST=ON -DCMAKE_TOOLCHAIN_FILE=/media/data/home/leokuo/TIM-VX-1.1.34.fix/cmake/toolchain-vim3.cmake ..
-- The C compiler identification is GNU 7.3.1
-- The CXX compiler identification is GNU 7.3.1
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - failed
-- Check for working C compiler: /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc
-- Check for working C compiler: /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc - broken
CMake Error at /media/data/shared/cmake-3.20.0-rc2-linux-x86_64/share/cmake-3.20/Modules/CMakeTestCCompiler.cmake:66 (message):
The C compiler
"/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc"
is not able to compile a simple test program.
It fails with the following output:
Change Dir: /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp
Run Build Command(s):/usr/bin/make -f Makefile cmTC_7fde6/fast && /usr/bin/make -f CMakeFiles/cmTC_7fde6.dir/build.make CMakeFiles/cmTC_7fde6.dir/build
make[1]: Entering directory '/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp'
Building C object CMakeFiles/cmTC_7fde6.dir/testCCompiler.c.o
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc --sysroot=/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/aarch64-linux-gnu/libc -mtune=cortex-a53 -o CMakeFiles/cmTC_7fde6.dir/testCCompiler.c.o -c /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp/testCCompiler.c
Linking C executable cmTC_7fde6
/media/data/shared/cmake-3.20.0-rc2-linux-x86_64/bin/cmake -E cmake_link_script CMakeFiles/cmTC_7fde6.dir/link.txt --verbose=1
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/aarch64-linux-gnu-gcc --sysroot=/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/aarch64-linux-gnu/libc CMakeFiles/cmTC_7fde6.dir/testCCompiler.c.o -o cmTC_7fde6
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/../lib/gcc/aarch64-linux-gnu/7.3.1/../../../../aarch64-linux-gnu/bin/ld: cannot find crt1.o: No such file or directory
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/../lib/gcc/aarch64-linux-gnu/7.3.1/../../../../aarch64-linux-gnu/bin/ld: cannot find crti.o: No such file or directory
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/../lib/gcc/aarch64-linux-gnu/7.3.1/../../../../aarch64-linux-gnu/bin/ld: cannot find -lc
/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu/bin/../lib/gcc/aarch64-linux-gnu/7.3.1/../../../../aarch64-linux-gnu/bin/ld: cannot find crtn.o: No such file or directory
collect2: error: ld returned 1 exit status
CMakeFiles/cmTC_7fde6.dir/build.make:98: recipe for target 'cmTC_7fde6' failed
make[1]: *** [cmTC_7fde6] Error 1
make[1]: Leaving directory '/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeTmp'
Makefile:127: recipe for target 'cmTC_7fde6/fast' failed
make: *** [cmTC_7fde6/fast] Error 2
CMake will not be able to correctly generate this project.
Call Stack (most recent call first):
CMakeLists.txt:2 (project)
-- Configuring incomplete, errors occurred!
See also "/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeOutput.log".
See also "/media/data/home/leokuo/TIM-VX-1.1.34.fix/build/CMakeFiles/CMakeError.log".
Should I need to change any environment about PATH or CMAKE? |
You should download toolchain from https://cnbj1.fds.api.xiaomi.com/mace/third-party/gcc-linaro/gcc-linaro-7.3.1-2018.05-x86_64_aarch64-linux-gnu.tar.xz, and change the toolchain configuration with your local install directory. This is my local host directory, you need change it. |
@sunshinemyson cmake -DCONFIG=A311D -DTIM_VX_ENABLE_TEST=ON -DCMAKE_TOOLCHAIN_FILE=/media/data/home/leokuo/TIM-VX-1.1.34.fix/cmake/toolchain-vim3.cmake ..
make -j32
make install
ls -al src/tim/
total 31868
drwxr-xr-x 5 leokuo leokuo 4096 Oct 14 13:48 .
drwxr-xr-x 3 leokuo leokuo 4096 Oct 14 11:51 ..
drwxr-xr-x 4 leokuo leokuo 4096 Oct 14 11:51 CMakeFiles
-rw-r--r-- 1 leokuo leokuo 5960 Oct 14 11:51 cmake_install.cmake
-rwxr-xr-x 1 leokuo leokuo 11956368 Oct 14 13:48 libtim-vx.so
-rw-r--r-- 1 leokuo leokuo 20526940 Oct 14 13:48 libtim-vx-static.a
-rw-r--r-- 1 leokuo leokuo 112260 Oct 14 11:51 Makefile
drwxr-xr-x 3 leokuo leokuo 4096 Oct 14 11:51 utils
drwxr-xr-x 3 leokuo leokuo 4096 Oct 14 11:51 vx |
@sunshinemyson khadas@Khadas:~/TIM-VX-1.1.34.fix/build/src/tim$ ./unit_test
Running main() from /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/_deps/googletest-src/googletest/src/gtest_main.cc
[==========] Running 122 tests from 39 test suites.
[----------] Global test environment set-up.
[----------] 1 test from Context
[ RUN ] Context.create
[ OK ] Context.create (23 ms)
[----------] 1 test from Context (23 ms total)
[----------] 2 tests from graph
[ RUN ] graph.gen_binary_graph_with_empty_graph
E [_graph_optimization_convert_int8_to_uint8:810]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
E [vsi_nn_OptimizeGraph:845]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
[ OK ] graph.gen_binary_graph_with_empty_graph (3 ms)
[ RUN ] graph.gen_binary_graph_with_simple_add
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:61: Failure
Value of: graph->CompileToBinary(nbg_buf.data(), &bin_size)
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:72: Failure
Expected equality of these values:
output
Which is: 0
expected_out
Which is: 2
E [compute_node:379]Create node[0] NBG fail
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:86: Failure
Value of: nbg_graph->Compile()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:87: Failure
Value of: nbg_graph->Run()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:91: Failure
Expected equality of these values:
output
Which is: 0
expected_out
Which is: 2
[ FAILED ] graph.gen_binary_graph_with_simple_add (8 ms)
[----------] 2 tests from graph (11 ms total)
[----------] 2 tests from Linear
[ RUN ] Linear.shape_5_1_fp32
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:55: Failure
Value of: graph->Compile()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:56: Failure
Value of: graph->Run()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:59: Failure
Expected equality of these values:
golden
Which is: { -0.5, 1.9, 2, 2.55, inf }
output
Which is: { 0, 0, 0, 0, 0 }
[ FAILED ] Linear.shape_5_1_fp32 (7 ms)
[ RUN ] Linear.shape_5_1_fp32_omit_b
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:86: Failure
Value of: graph->Compile()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:87: Failure
Value of: graph->Run()
Actual: false
Expected: true
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:90: Failure
Expected equality of these values:
golden
Which is: { -5, -0.2, 0, 1.1, inf }
output
Which is: { 0, 0, 0, 0, 0 }
[ FAILED ] Linear.shape_5_1_fp32_omit_b (7 ms)
[----------] 2 tests from Linear (14 ms total)
[----------] 2 tests from Gelu
[ RUN ] Gelu.shape_5_1_fp32_approximate
W [_setup:243]Call vxTensorTableLookupLayer fail.
Segmentation fault
|
@leo, Please try to set VIV_VX_DEBUG_LEVEL=1 and share the log again. It's interesting because I can get a full pass on my side. And your graph cannot be compiled successfully all the time. |
@sunshinemyson khadas@Khadas:~/TIM-VX-1.1.34.fix/build/src/tim$ ./unit_test
Running main() from /media/data/home/leokuo/TIM-VX-1.1.34.fix/build/_deps/googletest-src/googletest/src/gtest_main.cc
[==========] Running 122 tests from 39 test suites.
[----------] Global test environment set-up.
[----------] 1 test from Context
[ RUN ] Context.create
#productname=VIPNano-QI, pid=0x88
#productname=VIPNano-QI, pid=0x88
Created VX Thread: 0x79fa81b0
Created VX Thread: 0x7ad621b0
Exit VX Thread: 0x79fa81b0
#productname=VIPNano-QI, pid=0x88
Created VX Thread: 0x79fa81b0
Exit VX Thread: 0x79fa81b0
Exit VX Thread: 0x7ad621b0
[ OK ] Context.create (30 ms)
[----------] 1 test from Context (30 ms total)
[----------] 2 tests from graph
[ RUN ] graph.gen_binary_graph_with_empty_graph
#productname=VIPNano-QI, pid=0x88
Created VX Thread: 0x7ad621b0
E [_graph_optimization_convert_int8_to_uint8:810]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
E [vsi_nn_OptimizeGraph:845]CHECK STATUS(-1:A generic error code, used when no other describes the error.)
Exit VX Thread: 0x7ad621b0
[ OK ] graph.gen_binary_graph_with_empty_graph (5 ms)
[ RUN ] graph.gen_binary_graph_with_simple_add
Created VX Thread: 0x7ad621b0
#productname=VIPNano-QI, pid=0x88
prev_ptrs = 0x3cb77740
prev_ptrs = 0x3cbb07c0
prev_ptrs = 0x3cbb0fc0
---------------------------Begin VerifyTiling -------------------------
AXI-SRAM = 1048576 Bytes VIP-SRAM = 522240 Bytes SWTILING_PHASE_FEATURES[0, 1, 1]
0 SH [( 1 1 1 1, 4, 0x0x3cb77b60(0x0x3cb77b60, 0x(nil)) -> 1 1 1 1, 4, 0x0x3cbb1280(0x0x3cbb1280, 0x(nil))) k(0 0 0, 0) pad(0 0) pool(0 0, 1 1)]
id IN [ x y w h ] OUT [ x y w h ] (tx, ty, kpc) (ic, kc, kc/ks, ks/eks, kernel_type)
0 SH DD 0x(nil) [ 0 0 0 0] -> DD 0x(nil) [ 0 0 0 0] ( 0, 0, 0) ( 0, 0, 0.000000%, 0.000000%, NONE)
PreLoadWeightBiases = 1048576 100.000000%
---------------------------End VerifyTiling -------------------------
KernelStreamSize: 0x0, statesSize: 0x380, shShareMemSize: 0x0, shIntrSize: 0x0, shParaSize: 0x0, swParaSize: 0x0, lcdTensorSize: 0x0, shaderStatesSize: 0x380, tensorStatic: 0x0
NBG: operationSize: 0x78, nnSize: 0x0, tpSize: 0x0, shSize: 0x4, swSize: 0x0, layerParamSize: 0x0, lcdtSize: 0x48, patchSize: 0x364, lcdSize 0x480
NBG: entranceSize: 0x1f0, nbIOSize: 0x15c, layeSize: 0x4c, sectionsSize: 0x450, inputoutput size: 0x0, InitCommands size: 0x540
NBG: lcdSize: 0x480, headerSize : 0x7e8
Calculate NBG size : 4776 bytes
generate NBG into memory start.
vxoBinaryGraph_SaveBinaryEntrance[14907]: collect input count=0, output count=0
vxoBinaryGraph_SaveBinaryEntrance[14982]: total operation count=1
generate NBG, device count=1, core count per-device: 1,
input table address: 0x44fd9740 0x44fd67c0
output table address: 0x44fd3fc0
vxoBinaryGraph_SaveBinaryEntranceExt[14131]: graph input/output=2/1, refine input count=2, output count=1
NBG network name field : dummy_network_name
vxoBinaryGraph_SaveBinaryEntranceExt[14697]: header input count=2, output count=1
generate NGB, save initialize commands
generate NBG, map VIP-SRAM start address=0x400000
generate NBG, patch AXI-SRAM startAddress=0xff000000, endAddress=0xff100000
vxoBinaryGraph_SaveInitialOperation[10003]:fail to search AXI-SRAM address in init command buffer
Dump HEX data size 0x20
0801028A 00000011 08010E13 00000002 08010E21 00220000 3CF03630 00000000
vxoBinaryGraph_SaveBinaryEntrance[15553]: failed to save initial operation
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:61: Failure
Value of: graph->CompileToBinary(nbg_buf.data(), &bin_size)
Actual: false
Expected: true
prev_ptrs = 0x3cb77740
prev_ptrs = 0x3cbb07c0
prev_ptrs = 0x3cbb0fc0
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:72: Failure
Expected equality of these values:
output
Which is: 0
expected_out
Which is: 2
prev_ptrs = 0x3cebda00
prev_ptrs = 0x3cebe2c0
prev_ptrs = 0x3cebea80
prev_ptrs = 0x3cebda00
prev_ptrs = 0x3cebe2c0
binary graph format version, 0x1000c
readBinDynamic[1861]: lcd size if 0, error
fail in read Binary Dynamic
fail to load binary from pointer to create graph
NBG error, please provide genereating NBG logs first
fail to import kernel from VPMN
, error code: -1
E [compute_node:379]Create node[0] NBG fail
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:86: Failure
Value of: nbg_graph->Compile()
Actual: false
Expected: true
vxProcessGraph[15913]: Process Graph fail!
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:87: Failure
Value of: nbg_graph->Run()
Actual: false
Expected: true
prev_ptrs = 0x3cebea80
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/graph_test.cc:91: Failure
Expected equality of these values:
output
Which is: 0
expected_out
Which is: 2
prev_ptrs = 0x3cebda00
prev_ptrs = 0x3cebe2c0
prev_ptrs = 0x3cebea80
prev_ptrs = 0x3cb77740
prev_ptrs = 0x3cbb07c0
prev_ptrs = 0x3cbb0fc0
Exit VX Thread: 0x7ad621b0
[ FAILED ] graph.gen_binary_graph_with_simple_add (9 ms)
[----------] 2 tests from graph (14 ms total)
[----------] 2 tests from Linear
[ RUN ] Linear.shape_5_1_fp32
Created VX Thread: 0x7ad621b0
#productname=VIPNano-QI, pid=0x88
prev_ptrs = 0x3cebfcc0
prev_ptrs = 0x3cbb2ec0
prev_ptrs = 0x3cebfcc0
Save binary graph for VIPLite.
network binary graph file has been opened
---------------------------Begin VerifyTiling -------------------------
AXI-SRAM = 1048576 Bytes VIP-SRAM = 522240 Bytes SWTILING_PHASE_FEATURES[0, 1, 1]
0 SH [( 5 1 1 1, 20, 0x0x3cbb1280(0x0x3cbb1280, 0x(nil)) -> 5 1 1 1, 20, 0x0x3cbb0a90(0x0x3cbb0a90, 0x(nil))) k(0 0 0, 0) pad(0 0) pool(0 0, 1 1)]
id IN [ x y w h ] OUT [ x y w h ] (tx, ty, kpc) (ic, kc, kc/ks, ks/eks, kernel_type)
0 SH DD 0x(nil) [ 0 0 0 0] -> DD 0x(nil) [ 0 0 0 0] ( 0, 0, 0) ( 0, 0, 0.000000%, 0.000000%, NONE)
PreLoadWeightBiases = 1048576 100.000000%
---------------------------End VerifyTiling -------------------------
KernelStreamSize: 0x0, statesSize: 0x340, shShareMemSize: 0x0, shIntrSize: 0x0, shParaSize: 0x100, swParaSize: 0x0, lcdTensorSize: 0x0, shaderStatesSize: 0x340, tensorStatic: 0x0
NBG: operationSize: 0x78, nnSize: 0x0, tpSize: 0x0, shSize: 0x4, swSize: 0x0, layerParamSize: 0x0, lcdtSize: 0x50, patchSize: 0x380, lcdSize 0x540
NBG: entranceSize: 0x1f0, nbIOSize: 0xe8, layeSize: 0x4c, sectionsSize: 0x474, inputoutput size: 0x0, InitCommands size: 0x540
NBG: lcdSize: 0x540, headerSize : 0x798
Calculate NBG size : 4888 bytes
vxoBinaryGraph_SaveBinaryEntrance[14907]: collect input count=1, output count=1
vxoBinaryGraph_SaveBinaryEntrance[14982]: total operation count=1
generate NBG, device count=1, core count per-device: 1,
input table address: 0x44fc7cc0
output table address: 0x44fc4ec0
vxoBinaryGraph_SaveBinaryEntranceExt[14131]: graph input/output=1/1, refine input count=1, output count=1
NBG network name field : dummy_network_name
vxoBinaryGraph_SaveBinaryEntranceExt[14697]: header input count=1, output count=1
generate NGB, save initialize commands
generate NBG, map VIP-SRAM start address=0x400000
generate NBG, patch AXI-SRAM startAddress=0xff000000, endAddress=0xff100000
vxoBinaryGraph_SaveInitialOperation[10003]:fail to search AXI-SRAM address in init command buffer
Dump HEX data size 0x20
0801028A 00000011 08010E13 00000002 08010E21 00220000 3CECD780 00000000
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
vxoBinaryGraph_SaveBinaryEntrance[15553]: failed to save initial operation
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:55: Failure
Value of: graph->Compile()
Actual: false
Expected: true
---------------------------Begin VerifyTiling -------------------------
AXI-SRAM = 1048576 Bytes VIP-SRAM = 522240 Bytes SWTILING_PHASE_FEATURES[0, 1, 1]
0 SH [( 5 1 1 1, 20, 0x0x3cbb1280(0x0x3cbb1280, 0x(nil)) -> 5 1 1 1, 20, 0x0x3cbb0a90(0x0x3cbb0a90, 0x(nil))) k(0 0 0, 0) pad(0 0) pool(0 0, 1 1)]
id IN [ x y w h ] OUT [ x y w h ] (tx, ty, kpc) (ic, kc, kc/ks, ks/eks, kernel_type)
0 SH DD 0x(nil) [ 0 0 0 0] -> DD 0x(nil) [ 0 0 0 0] ( 0, 0, 0) ( 0, 0, 0.000000%, 0.000000%, NONE)
PreLoadWeightBiases = 1048576 100.000000%
---------------------------End VerifyTiling -------------------------
KernelStreamSize: 0x0, statesSize: 0x340, shShareMemSize: 0x0, shIntrSize: 0x0, shParaSize: 0x100, swParaSize: 0x0, lcdTensorSize: 0x0, shaderStatesSize: 0x340, tensorStatic: 0x0
NBG: operationSize: 0x78, nnSize: 0x0, tpSize: 0x0, shSize: 0x4, swSize: 0x0, layerParamSize: 0x0, lcdtSize: 0x50, patchSize: 0x380, lcdSize 0x540
NBG: entranceSize: 0x1f0, nbIOSize: 0xe8, layeSize: 0x4c, sectionsSize: 0x474, inputoutput size: 0x0, InitCommands size: 0x540
NBG: lcdSize: 0x540, headerSize : 0x798
Calculate NBG size : 4888 bytes
vxoBinaryGraph_CollectInputAndOutput[13820]: input node param count is bigger than 1018224656 > 5
vxoBinaryGraph_SaveBinaryEntrance[14903]: failed to collect input and output of network
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
vxProcessGraph[15913]: Process Graph fail!
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:56: Failure
Value of: graph->Run()
Actual: false
Expected: true
prev_ptrs = 0x3cbb2ec0
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:59: Failure
Expected equality of these values:
golden
Which is: { -0.5, 1.9, 2, 2.55, inf }
output
Which is: { 0, 0, 0, 0, 0 }
prev_ptrs = 0x3cebfcc0
prev_ptrs = 0x3cbb2ec0
Exit VX Thread: 0x7ad621b0
[ FAILED ] Linear.shape_5_1_fp32 (7 ms)
[ RUN ] Linear.shape_5_1_fp32_omit_b
Created VX Thread: 0x7ad621b0
#productname=VIPNano-QI, pid=0x88
prev_ptrs = 0x3cbb2ec0
prev_ptrs = 0x3cebfcc0
prev_ptrs = 0x3cbb2ec0
Save binary graph for VIPLite.
network binary graph file has been opened
---------------------------Begin VerifyTiling -------------------------
AXI-SRAM = 1048576 Bytes VIP-SRAM = 522240 Bytes SWTILING_PHASE_FEATURES[0, 1, 1]
0 SH [( 5 1 1 1, 20, 0x0x3cbb0a90(0x0x3cbb0a90, 0x(nil)) -> 5 1 1 1, 20, 0x0x3cbb1280(0x0x3cbb1280, 0x(nil))) k(0 0 0, 0) pad(0 0) pool(0 0, 1 1)]
id IN [ x y w h ] OUT [ x y w h ] (tx, ty, kpc) (ic, kc, kc/ks, ks/eks, kernel_type)
0 SH DD 0x(nil) [ 0 0 0 0] -> DD 0x(nil) [ 0 0 0 0] ( 0, 0, 0) ( 0, 0, 0.000000%, 0.000000%, NONE)
PreLoadWeightBiases = 1048576 100.000000%
---------------------------End VerifyTiling -------------------------
KernelStreamSize: 0x0, statesSize: 0x340, shShareMemSize: 0x0, shIntrSize: 0x0, shParaSize: 0x100, swParaSize: 0x0, lcdTensorSize: 0x0, shaderStatesSize: 0x340, tensorStatic: 0x0
NBG: operationSize: 0x78, nnSize: 0x0, tpSize: 0x0, shSize: 0x4, swSize: 0x0, layerParamSize: 0x0, lcdtSize: 0x50, patchSize: 0x380, lcdSize 0x540
NBG: entranceSize: 0x1f0, nbIOSize: 0xe8, layeSize: 0x4c, sectionsSize: 0x474, inputoutput size: 0x0, InitCommands size: 0x540
NBG: lcdSize: 0x540, headerSize : 0x798
Calculate NBG size : 4888 bytes
vxoBinaryGraph_SaveBinaryEntrance[14907]: collect input count=1, output count=1
vxoBinaryGraph_SaveBinaryEntrance[14982]: total operation count=1
generate NBG, device count=1, core count per-device: 1,
input table address: 0x44fc1ec0
output table address: 0x44fbecc0
vxoBinaryGraph_SaveBinaryEntranceExt[14131]: graph input/output=1/1, refine input count=1, output count=1
NBG network name field : dummy_network_name
vxoBinaryGraph_SaveBinaryEntranceExt[14697]: header input count=1, output count=1
generate NGB, save initialize commands
generate NBG, map VIP-SRAM start address=0x400000
generate NBG, patch AXI-SRAM startAddress=0xff000000, endAddress=0xff100000
vxoBinaryGraph_SaveInitialOperation[10003]:fail to search AXI-SRAM address in init command buffer
Dump HEX data size 0x20
0801028A 00000011 08010E13 00000002 08010E21 00220000 3CBAAFB0 00000000
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
vxoBinaryGraph_SaveBinaryEntrance[15553]: failed to save initial operation
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:86: Failure
Value of: graph->Compile()
Actual: false
Expected: true
---------------------------Begin VerifyTiling -------------------------
AXI-SRAM = 1048576 Bytes VIP-SRAM = 522240 Bytes SWTILING_PHASE_FEATURES[0, 1, 1]
0 SH [( 5 1 1 1, 20, 0x0x3cbb0a90(0x0x3cbb0a90, 0x(nil)) -> 5 1 1 1, 20, 0x0x3cbb1280(0x0x3cbb1280, 0x(nil))) k(0 0 0, 0) pad(0 0) pool(0 0, 1 1)]
id IN [ x y w h ] OUT [ x y w h ] (tx, ty, kpc) (ic, kc, kc/ks, ks/eks, kernel_type)
0 SH DD 0x(nil) [ 0 0 0 0] -> DD 0x(nil) [ 0 0 0 0] ( 0, 0, 0) ( 0, 0, 0.000000%, 0.000000%, NONE)
PreLoadWeightBiases = 1048576 100.000000%
---------------------------End VerifyTiling -------------------------
KernelStreamSize: 0x0, statesSize: 0x340, shShareMemSize: 0x0, shIntrSize: 0x0, shParaSize: 0x100, swParaSize: 0x0, lcdTensorSize: 0x0, shaderStatesSize: 0x340, tensorStatic: 0x0
NBG: operationSize: 0x78, nnSize: 0x0, tpSize: 0x0, shSize: 0x4, swSize: 0x0, layerParamSize: 0x0, lcdtSize: 0x50, patchSize: 0x380, lcdSize 0x540
NBG: entranceSize: 0x1f0, nbIOSize: 0xe8, layeSize: 0x4c, sectionsSize: 0x474, inputoutput size: 0x0, InitCommands size: 0x540
NBG: lcdSize: 0x540, headerSize : 0x798
Calculate NBG size : 4888 bytes
vxoBinaryGraph_CollectInputAndOutput[13820]: input node param count is bigger than 1018224656 > 5
vxoBinaryGraph_SaveBinaryEntrance[14903]: failed to collect input and output of network
vxoBinaryGraph_SaveErrorHandle[8965]: failed to save NBG file, remove it, name=network_binary_pid-166964_tid-2098951744.nb
vxProcessGraph[15913]: Process Graph fail!
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:87: Failure
Value of: graph->Run()
Actual: false
Expected: true
prev_ptrs = 0x3cebfcc0
/media/data/home/leokuo/TIM-VX-1.1.34.fix/src/tim/vx/ops/activations_test.cc:90: Failure
Expected equality of these values:
golden
Which is: { -5, -0.2, 0, 1.1, inf }
output
Which is: { 0, 0, 0, 0, 0 }
prev_ptrs = 0x3cbb2ec0
prev_ptrs = 0x3cebfcc0
Exit VX Thread: 0x7ad621b0
[ FAILED ] Linear.shape_5_1_fp32_omit_b (9 ms)
[----------] 2 tests from Linear (16 ms total)
[----------] 2 tests from Gelu
[ RUN ] Gelu.shape_5_1_fp32_approximate
Created VX Thread: 0x7ad621b0
#productname=VIPNano-QI, pid=0x88
prev_ptrs = 0x3cebfcc0
prev_ptrs = 0x3cbb2ec0
prev_ptrs = 0x3cebfcc0
CopyArrayRange from ptr 0x3cf4f7f0 to 0x7fe7fe6a50 from 0 to 1024
CopyArrayRange from ptr 0x3cf2f460 to 0x7fe7fe5a50 from 0 to 1024
hardware doesn't support
W [_setup:243]Call vxTensorTableLookupLayer fail.
Kernel "com.vivantecorp.extension.cl.hard_gelu_F32toF32_2D" does not exist
Segmentation fault |
Did you set other env variable such as VIV_VX_ENABLE_SAVE_NETWORK_BINARY? |
@sunshinemyson No. export PYTHONPATH=/home/khadas/VeriSilicon-tvm/python:$PYTHONPATH
export LD_LIBRARY_PATH=/home/khadas/TIM-VX-1.1.34.fix/build/install:/home/khadas/VeriSilicon-tvm/build:$LD_LIBRARY_PATH
export VIVANTE_SDK_DIR=/home/khadas/TIM-VX-1.1.34.fix/build/aarch64_A311D_6.4.8 |
@sunshinemyson Thanks. Now, I can pass the unit_test. |
sdk is the root dir of our driver. should have following structure: |
@sunshinemyson [ 135.267564] npu_version: 2
[ 135.268371] galcore irq number is 36.
[ 135.268382] Galcore version 6.4.6.2
[ 627.735912] [galcore]: GPU[0] hang, automatic recovery.
[ 627.748042] ====>>>>npu hardware reset end!
[ 627.748196] [galcore]: recovery done
[ 689.175159] [galcore]: GPU[0] hang, automatic recovery.
[ 689.187353] ====>>>>npu hardware reset end!
[ 689.187525] [galcore]: recovery done
[ 750.615034] [galcore]: GPU[0] hang, automatic recovery.
[ 750.627147] ====>>>>npu hardware reset end!
[ 750.627311] [galcore]: recovery done
[ 812.054627] [galcore]: GPU[0] hang, automatic recovery.
[ 812.067103] ====>>>>npu hardware reset end!
[ 812.067282] [galcore]: recovery done
[ 873.493411] [galcore]: GPU[0] hang, automatic recovery.
[ 873.514460] ====>>>>npu hardware reset end!
[ 873.517882] [galcore]: recovery done
[ 934.932529] [galcore]: GPU[0] hang, automatic recovery.
[ 934.944641] ====>>>>npu hardware reset end!
[ 934.944811] [galcore]: recovery done
Then, I cannot rmmod galcore until rebooting. |
@leokuo725 Sorry that we can not give you suggestion about the issue in time. Please let me know if it still an issue? |
I also ran into this bug.
So what can I do then? Tengine says a "kernel version" of |
@gdh1995 You can get the galcore version from dmesg. |
@sunshinemyson |
The old galcore is
I've tried SDK from https://github.com/VeriSilicon/TIM-VX/releases/download/v1.1.37/aarch64_A311D_6.4.9.tgz and https://github.com/VeriSilicon/TIM-VX/releases/download/v1.1.37/aarch64_S905D3_6.4.9.tgz . No error messages changed. I also tried v6.4.8 (https://github.com/VeriSilicon/TIM-VX/releases/download/v1.1.34.fix/aarch64_A311D_6.4.8.tgz). The error message is:
|
Sorry it's a mistake of mine. I ran tengine's benchmark tool with a |
@sunshinemyson I saw the same error "PLS isn't existed" on my VIM3:
X86 Host
VIM3 Target ( export VIV_VX_DEBUG_LEVEL=1 )
TIM-VX Version is 1.1.37 Could you help me with it? |
@sunshinemyson
I tried the VSI NPU as TVM target, ran the test_operations.py in TVM_FOLDER/tests/python/contrib/test_vsi_npu.
It had error "PLS isn't existed" on VIM3 Pro side. I found the previous issue , I cannot solve the problem by setting "VSIMULATOR_CONFIG=VIPNANOQI_PID0X88".
The following is my environment:
Environment variable (Host)
Environment variable (VIM3 Pro)
export VIV_VX_DEBUG_LEVEL=1
Model: Khadas VIM3 Pro
SoC: Amlogic A311D with 5 TOPS Performance NPU
OS information:
Read More
NPU information:
Read More
TIM-VX Version:1.1.32
TVM Branch commit id: b822ec32702e2676dce1e430221e8efc05c98935
The output message after executing Unittest program of TIM-VX:
Read More
The output message after executing TVM test_operations.py at X86 Host side:
Read More
The output message after executing TVM test_operations.py at VIM3 Pro side:
Read More
Test Functions Passed in test_operations.py
Read More
Test Functions Failed in test_operations.py
Read More
If you need more debug messages, please let me know.
Thanks.
The text was updated successfully, but these errors were encountered: