Skip to content

[CI] aarch64-linux-gnu test job is systematically hanging #59546

@giordano

Description

@giordano

I haven't found when this started (2f5e1d0 is the earliest commit where I've seen this, but haven't looked further back), but aarch64-linux-gnu jobs are systematically hanging, for example: https://buildkite.com/julialang/julia-master/builds/50445#0199318a-6ad9-4f3d-b7a2-ad4e0191cb97/933-1634

errorshow                                        (8) |            has been running for 1h 0m           
channels                                        (10) |            has been running for 1h 0m           
subarray                                         (9) |            has been running for 1h 15m          
SparseArrays/cholmod                             (5) |             has been running for 45m            
read                                             (6) |            has been running for 1h 15m          
broadcast                                        (4) |            has been running for 1h 15m          
errorshow                                        (8) |            has been running for 1h 15m          
channels                                        (10) |            has been running for 1h 15m          
subarray                                         (9) |            has been running for 1h 30m          
SparseArrays/cholmod                             (5) |            has been running for 1h 0m           
read                                             (6) |            has been running for 1h 30m          
broadcast                                        (4) |            has been running for 1h 30m          

Testsets like read and errorshow should take few seconds, not well over one hour (this is on an x86_64-linux-gnu machine):

julia> Base.runtests(["read", "errorshow"])
Running parallel tests with:
  getpid() = 109433
  nworkers() = 2
  nthreads(:interactive) = 1
  nthreads(:default) = 1
  Sys.CPU_THREADS = 11
  Sys.total_memory() = 30.809 GiB
  Sys.free_memory() = 20.524 GiB

Test  (Worker) | Time (s) | GC (s) | GC % | Alloc (MB) | RSS (MB)
read       (2) |        started at 2025-09-12T20:00:30.647
errorshow  (3) |        started at 2025-09-12T20:00:30.728
      From worker 3:    Warning: detected a stack overflow; program state may be corrupted, so further execution might be unreliable.
      From worker 3:    Warning: detected a stack overflow; program state may be corrupted, so further execution might be unreliable.
errorshow  (3) |     9.71 |   0.24 |  2.5 |    1108.73 |   510.00
read       (2) |    16.57 |   0.74 |  4.5 |    3047.27 |  7885.12

Test Summary: | Pass  Total   Time
  Overall     | 7080   7080  19.1s

Metadata

Metadata

Assignees

No one assigned

    Labels

    ciContinuous integrationsystem:armARMv7 and AArch64

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions