Improve arch/cpu detection/selection on ARM and AArch64 #18100

yuyichao · 2016-08-18T02:58:54Z

Allow cpu_target to specify a generic arch, matching the behavior on x86
Detect the CPU arch version with uname
Require armv6

Close Libunwind compilation error on ARM #13270 (ARMv5 is not supported)
Fix build failure on Raspberry Pi 3 #18042
Remove warning about generic arch since it's not really useful

Fix on Raspberry Pi (ARM), WARNING: unable to determine host cpu name. #17549
Require at least the same ARM arch version the C code is compiled with

vtjnash · 2016-08-18T03:02:24Z

src/codegen.cpp

+        // This is the most reliable way I can find
+        // `/proc/cpuinfo` changes between kernel versions
+        struct utsname name;
+        if (uname(&name) >= 0) {


should we suggest this to llvm too?

Potentially by including this in the feature detection function. The LLVM cpu/feature separation seems to be inconsistent with other targets since the cpu list doesn't include any generic target (other than the "generic" target itself). This is possibly related to the difficulty to determine the base instruction set. I'll probably open an LLVM issue asking about this.

https://llvm.org/bugs/show_bug.cgi?id=29030

ViralBShah · 2016-08-18T04:12:07Z

Trying to build this branch on scaleway:

    JULIA usr/lib/julia/inference.ji
terminate called after throwing an instance of 'std::regex_error'
  what():  regex_error
Aborted

yuyichao · 2016-08-18T11:22:12Z

The regex error is a gcc bug (IIUC it's regex support isn't complete until a later version, either 4.9 or 5.0) Should be fixed now since it's not using regex anymore.

yuyichao · 2016-08-18T11:51:31Z

What platform do we want to build the ARM binary for?

According to https://build.julialang.org/builders/package_tarballarm/builds/700/steps/make/logs/stdio it's currently using the CFLAGS/CXXFLAGS -march=armv7-a so the binary is actually armv7a only and after this PR the sysimg and JIT will also require armv7 on that setup (after all the libjulia code can't run on older hardware anyway).

Personally I only care about armv7+ (possibly armv7-r in additional to armv7-a) but RPI0 and RPI1 B+ are both armv6.

ViralBShah · 2016-08-18T12:02:01Z

I am ok with armv7. At least that is what all of us have access to test on. If there is a lot of demand for armv6, we can always do it later.

yuyichao · 2016-08-18T12:34:27Z

Fair enough. I updated the readme to mention that the ARM binary we provide is armv7-a only. Also adds auto detection of known A profile so that the arm buildbot should automatically build sysimg with A profile too.

* Allow `cpu_target` to specify a generic arch, matching the behavior on x86 * Detect the CPU arch version with `uname` * Require `armv6` Close #13270 (`armv5` is not supported) Fix #18042 * Remove warning about generic arch since it's not really useful Fix #17549 * Require at least the same ARM arch version and profile the C code is compiled with

ViralBShah · 2016-08-18T13:35:17Z

Still crashes for me in the building of the system image like before now, but hopefully llvm 3.9 will fix that.

vtjnash · 2016-08-18T15:40:32Z

src/codegen.cpp

+    //   armv8-m.base, armv8-m.main
+    //
+    // Supported AArch64 arch names on LLVM 3.8:
+    //   armv8.1a, armv8.2a


it's too bad llvm felt ProcDesc needed to be private: http://llvm.org/docs/doxygen/html/MCSubtargetInfo_8h_source.html#l00033

or we could have auto-generated this list for the user on-demand and provided useful help messages.

The list is auto-generated with llc and is just listed here for better reference. We also support generating this list with julia -C help.

OTOH, not having access to this list at runtime (other than letting LLVM printing a help message) mean that some of the logic above has to be hard coded instead of going through a fallback list and printing the cpu/feature not-recognized warning once instead of once every codegen....

I didn't realize that worked. In llvm 3.3, it also called exit, but now it continues, so I guess we could intentionally create a TargetMachine("help") for the user. As it is now, we print the help several times, then error (probably because it fell back to "generic" arch)

* Allow `cpu_target` to specify a generic arch, matching the behavior on x86 * Detect the CPU arch version with `uname` * Require `armv6` Close #13270 (`armv5` is not supported) Fix #18042 * Remove warning about generic arch since it's not really useful Fix #17549 * Require at least the same ARM arch version and profile the C code is compiled with (cherry picked from commit 760bc41) ref #18100

(cherry picked from commit 5977167) ref #18100

tkelman · 2016-08-23T22:21:13Z

Bisecting on release-0.5, the backport of this is causing segfaults during bootstrap repeatably. Not always at the exact same place, but early on:

    JULIA usr/lib/julia/inference.ji
essentials.jl

signal (11): Segmentation fault
while loading essentials.jl, in expression starting on line 190
Allocations: 24337 (Pool: 24337; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl

signal (11): Segmentation fault
while loading int.jl, in expression starting on line 193
Allocations: 100302 (Pool: 100302; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl
operators.jl
pointer.jl
abstractarray.jl
array.jl

signal (11): Segmentation fault
while loading array.jl, in expression starting on line 167
Allocations: 206520 (Pool: 206519; Big: 1); GC: 2
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl

signal (11): Segmentation fault
while loading essentials.jl, in expression starting on line 190
Allocations: 24337 (Pool: 24337; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl

signal (11): Segmentation fault
while loading int.jl, in expression starting on line 193
Allocations: 100302 (Pool: 100302; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl
operators.jl
pointer.jl
abstractarray.jl
array.jl
hashing.jl
nofloat_hashing.jl
reduce.jl
intset.jl
dict.jl
iterator.jl
docs/core.jl
inference.jl

signal (11): Segmentation fault
while loading inference.jl, in expression starting on line 3524
Allocations: 340809 (Pool: 340805; Big: 4); GC: 3
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl

signal (11): Segmentation fault
while loading essentials.jl, in expression starting on line 190
Allocations: 24337 (Pool: 24337; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl

signal (11): Segmentation fault
while loading int.jl, in expression starting on line 193
Allocations: 100469 (Pool: 100469; Big: 0); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl

signal (11): Segmentation fault
while loading int.jl, in expression starting on line 330
Allocations: 135198 (Pool: 135197; Big: 1); GC: 1
Segmentation fault
make[1]: *** [/home/tkelman/julia-0.5/usr/lib/julia/inference.ji] Error 139
make: *** [julia-inference] Error 2
    JULIA usr/lib/julia/inference.ji
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl

signal (11): Segmentation fault
while loading int.jl, in expression starting on line 193
Allocations: 100469 (Pool: 100469; Big: 0); GC: 1
Segmentation fault

yuyichao · 2016-08-23T22:55:26Z

This only affects the flags we pass to llvm so it's most likely LLVM bug.

Backtrace?

tkelman · 2016-08-24T02:53:22Z

https://gist.github.com/13a70c583f00a5d37272aa8650eef1a4

yuyichao · 2016-08-24T03:06:40Z

Turn on KEEP_BODIES (in options.h), use julia-debug if possible

Dump the llvm ir of the segfaulting function with p jl_dump_llvm_value(jl_function_ptr_by_llvm_name("<function_name_in_gdb>"))
Dump the asm with disassemble $pc
Dump the AST and the arguments of the function call with p jl_(meth), p jl_(jl_uncompress_ast(meth, meth->code)) , p jl_(args[<0 to 2>]) in the jl_call_method_internal frame.

Last time I saw this issue on scaleway somehow passed the wrong argument for a expression splicing.

tkelman · 2016-08-24T04:07:41Z

https://gist.github.com/e2b48ee04c3e72368980f7a29b60ff6a

tkelman · 2016-08-24T04:35:08Z

here's master, segfaults earlier https://gist.github.com/92a510dad180b42779e1e29677652d67

vtjnash · 2016-08-24T04:36:14Z

llvm must have relocated jl_get_ptls_states wrong? The IR looks a bit strange there. And it looks like we shouldn't have needed to emit the ptls either, but that's a separate issue

vtjnash · 2016-08-24T04:42:02Z

Oh wait, sorry. That's supposed to be the ssp

yuyichao · 2016-08-24T06:50:29Z

As Jameson said, it seems that LLVM uses the wrong address for __stack_chk_guard, what's __stack_chk_guard and __stack_chk_fail? Also, maybe check with release build sincethat shouldn't have the stack protection....

tkelman · 2016-08-24T07:01:39Z

release build of master https://gist.github.com/6a6b96e0cfdf50407810eb614e742ad1

vtjnash · 2016-08-24T18:34:46Z

In that one, it's clearly trying to load the GOT for jl_get_ptls_states from 0x81b300. Why isn't this a PIC value?

yuyichao · 2016-08-25T03:21:18Z

This really look like the relocation/code model issue we need to workaround on ARM by turning off fastisel.

Maybe you can check what flags are we passing to LLVM?

diff --git a/src/codegen.cpp b/src/codegen.cpp
index 2789963..07406e3 100644
--- a/src/codegen.cpp
+++ b/src/codegen.cpp
@@ -5732,12 +5732,16 @@ static inline SmallVector<std::string,10> getTargetFeatures(std::string &cpu)
     }
 #endif

+    jl_safe_printf("CPU: %s\nFeatures: ", cpu.c_str());
+
     SmallVector<std::string,10> attr;
     for (StringMap<bool>::const_iterator it = HostFeatures.begin(); it != HostFeatures.end(); it++) {
         std::string att = it->getValue() ? it->getKey().str() :
                           std::string("-") + it->getKey().str();
+        jl_safe_printf("%s, ", att.c_str());
         attr.append(1, att);
     }
+    jl_safe_printf("\n");
     return attr;
 }

tkelman · 2016-08-25T05:10:00Z

    LINK usr/bin/julia
    JULIA usr/lib/julia/inference.ji
CPU: generic
Features: v7, aclass, fp16, vfp2, vfp3,
essentials.jl

yuyichao · 2016-08-25T10:37:03Z

Hmmm, why is fp16 and vfp3 in there. Isn't this the generic build on the buildbot? The auto detection shouldn't put these two options in there unless -C native is used.

yuyichao · 2016-08-25T10:41:56Z

Also, after specifying -C armv6 maybe try using sth like

        if (att == "v7")
            continue;

in front of the jl_safe_printf("%s, ", att.c_str()); to see which option is causing the issue. Don't remove vfp2 and try not remove v6 unless none of the other ones have an effect.

tkelman · 2016-08-25T10:51:16Z

-C native is being used. I don't think we have generic build options working properly on the arm buildbot yet. We definitely aren't doing the right old distro, new gcc thing that we do on x86/amd64 linux.

tkelman · 2016-08-25T10:53:10Z

Ah my mistake, the above backtraces were from just a normal make, don't think I had anything in Make.user there. The buildbot is setting -C generic from JULIA_CPU_TARGET=generic MARCH=armv7-a on the command line.

ViralBShah · 2016-08-27T02:56:29Z

Is the MARCH setting messing up the scaleway arm?

tkelman · 2016-08-31T13:58:05Z

I'm going to revert this for now on release-0.5 until we figure out why it's causing problems.

ViralBShah · 2016-09-11T21:21:20Z

I set the CPU to generic, and tried disabling aclass and also v7, but that just put the crash elsewhere.

 cd /home/viral/julia/base && /home/viral/julia/usr/bin/julia-debug -C generic --output-ji /home/viral/julia/usr/lib/julia/inference.ji --startup-file=no -g0 -O0 coreimg.jl
CPU: generic
Features: aclass, v7, vfp2, 
essentials.jl
generator.jl
reflection.jl
options.jl
promotion.jl
tuple.jl
range.jl
expr.jl
error.jl
bool.jl
number.jl
int.jl
operators.jl

signal (11): Segmentation fault
while loading operators.jl, in expression starting on line 872
Allocations: 154334 (Pool: 154333; Big: 1); GC: 2
Segmentation fault (core dumped)
Makefile:215: recipe for target '/home/viral/julia/usr/lib/julia/inference.ji' failed
make[1]: *** [/home/viral/julia/usr/lib/julia/inference.ji] Error 139

ViralBShah · 2016-09-13T08:32:39Z

I successfully built on arm with 0.5-rc4 on scaleway, but can't build on master, which seems to be due to this PR - which @tkelman reverted on release-0.5.

ViralBShah · 2016-09-13T09:49:24Z

Both, master (which has this PR) and release-0.5 (which does not) fail to build on my Cortex A15 Chromebook. The failure is always in the system image step, but at different points.

ViralBShah · 2016-09-13T10:16:24Z

On the cortex A15, using JULIA_CPU_TARGET=native with release-0.5 is trouble, and using generic gets me further. If I force -O0 I can get a bit further.

* Allow `cpu_target` to specify a generic arch, matching the behavior on x86 * Detect the CPU arch version with `uname` * Require `armv6` Close #13270 (`armv5` is not supported) Fix #18042 * Remove warning about generic arch since it's not really useful Fix #17549 * Require at least the same ARM arch version and profile the C code is compiled with (cherry picked from commit d9f5334 from PR #18100)

* Allow `cpu_target` to specify a generic arch, matching the behavior on x86 * Detect the CPU arch version with `uname` * Require `armv6` Close #13270 (`armv5` is not supported) Fix #18042 * Remove warning about generic arch since it's not really useful Fix #17549 * Require at least the same ARM arch version and profile the C code is compiled with (cherry picked from commit 760bc41 and PR #18100)

(cherry picked from commit 5977167 and PR #18100)

vtjnash reviewed Aug 18, 2016
View reviewed changes

ViralBShah added the backport pending 0.5 label Aug 18, 2016

ViralBShah added the system:arm ARMv7 and AArch64 label Aug 18, 2016

yuyichao force-pushed the yyc/threads/arm branch 2 times, most recently from dc36640 to 6e84766 Compare August 18, 2016 06:38

yuyichao force-pushed the yyc/threads/arm branch 2 times, most recently from c646bbb to 32f54b1 Compare August 18, 2016 12:29

yuyichao force-pushed the yyc/threads/arm branch from 32f54b1 to 5873beb Compare August 18, 2016 12:42

yuyichao and others added 2 commits August 18, 2016 21:04

fix "recommanded" typo

5977167

yuyichao force-pushed the yyc/threads/arm branch from 5873beb to 5977167 Compare August 18, 2016 13:04

vtjnash reviewed Aug 18, 2016
View reviewed changes

vtjnash merged commit 04047f6 into master Aug 18, 2016

vtjnash deleted the yyc/threads/arm branch August 18, 2016 21:12

tkelman added a commit that referenced this pull request Aug 20, 2016

fix "recommanded" typo

22a73dd

(cherry picked from commit 5977167) ref #18100

tkelman removed the backport pending 0.5 label Aug 22, 2016

tkelman added the backport pending 0.5 label Aug 31, 2016

tkelman mentioned this pull request Sep 16, 2016

non-x86: ensure cgmemmgr caches are consistent #18516

Merged

staticfloat pushed a commit that referenced this pull request May 5, 2017

fix "recommanded" typo

9ea7534

(cherry picked from commit 5977167 and PR #18100)

yuyichao mentioned this pull request May 6, 2017

Provide build instructions for nVidia Jetson TX2 (AArch64) #21727

Merged

Improve arch/cpu detection/selection on ARM and AArch64 #18100

Improve arch/cpu detection/selection on ARM and AArch64 #18100

Conversation

yuyichao commented Aug 18, 2016

vtjnash Aug 18, 2016

Choose a reason for hiding this comment

yuyichao Aug 18, 2016

Choose a reason for hiding this comment

yuyichao Aug 18, 2016

Choose a reason for hiding this comment

ViralBShah commented Aug 18, 2016 • edited Loading

yuyichao commented Aug 18, 2016

yuyichao commented Aug 18, 2016

ViralBShah commented Aug 18, 2016

yuyichao commented Aug 18, 2016

ViralBShah commented Aug 18, 2016

vtjnash Aug 18, 2016

Choose a reason for hiding this comment

yuyichao Aug 18, 2016

Choose a reason for hiding this comment

yuyichao Aug 18, 2016

Choose a reason for hiding this comment

vtjnash Aug 18, 2016

Choose a reason for hiding this comment

tkelman commented Aug 23, 2016

yuyichao commented Aug 23, 2016

tkelman commented Aug 24, 2016

yuyichao commented Aug 24, 2016

tkelman commented Aug 24, 2016

tkelman commented Aug 24, 2016

vtjnash commented Aug 24, 2016

vtjnash commented Aug 24, 2016

yuyichao commented Aug 24, 2016

tkelman commented Aug 24, 2016

vtjnash commented Aug 24, 2016

yuyichao commented Aug 25, 2016

tkelman commented Aug 25, 2016

yuyichao commented Aug 25, 2016

yuyichao commented Aug 25, 2016

tkelman commented Aug 25, 2016

tkelman commented Aug 25, 2016 • edited Loading

ViralBShah commented Aug 27, 2016

tkelman commented Aug 31, 2016

ViralBShah commented Sep 11, 2016

ViralBShah commented Sep 13, 2016

ViralBShah commented Sep 13, 2016

ViralBShah commented Sep 13, 2016

ViralBShah commented Aug 18, 2016 •

edited

Loading

tkelman commented Aug 25, 2016 •

edited

Loading