Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hermes missed node #56

Open
hxu65 opened this issue Mar 26, 2024 · 0 comments
Open

Hermes missed node #56

hxu65 opened this issue Mar 26, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@hxu65
Copy link
Collaborator

hxu65 commented Mar 26, 2024

Error Code:

/tmp/hxu40/spack-stage/spack-stage-hermes-master-zgd7o3lrvr3zhvr32eqh2dlriqjyg22s/spack-
src/hrun/include/hrun/network/rpc.h:133 FATAL 31389 GetIpAddressFromNodeId Attempted to get from node 0, which is out of
the range 1-5

Jarvis pipeline information:

hermes_run with name hermes_run
    adapter_mode=default
    borg_min_cap=0.0
    borg_paths=['/mnt/hdd/hxu40/hermes_data', '/mnt/nvme/hxu40/hermes_data', '/home/hxu40/hermes_data', '/mnt/ssd/hxu40/hermes_data']
    data_shm=8g
    dbg_port=4000
    devices=[]
    do_dbg=False
    domain=None
    dpe=MinimizeIoTime
    dworkers=2
    exclude=[]
    flush_mode=async
    flush_period=5000
    hide_output=False
    include=[]
    oworkers=4
    oworkers_per_core=32
    page_size=1m
    pkg_type=hermes_run
    port=8080
    pqdepth=48
    provider=sockets
    qdepth=100000
    qlanes=4
    ram=0
    rdata_shm=8g
    recency_max=1.0
    reinit=False
    shm_name=hrun_shm_${USER}
    sleep=5
    stderr=None
    stdout=None
    task_shm=0g
    threads=32

The adios2 information

 adios2_gray_scott with name adios2_gray_scott
    Du=0.2
    Dv=0.1
    F=0.01
    L=32
    adios_memory_selection=False
    adios_span=False
    checkpoint=True
    checkpoint_freq=70
    checkpoint_output=ckpt.bp
    db_path=//mnt/nvme/hxu40/metadata.db
    dbg_port=4000
    do_dbg=False
    dt=2.0
    engine=hermes_derived
    full_run=True
    hide_output=False
    k=0.05
    limit=0
    mesh_type=image
    noise=0.01
    nprocs=80
    out_file=/mnt/nvme/hxu40/gs.bp
    pkg_type=adios2_gray_scott
    plotgap=10.0
    ppn=20
    reinit=False
    restart=False
    restart_input=ckpt.bp
    sleep=0
    stderr=None
    stdout=None
    steps=100
@JaimeCernuda JaimeCernuda added the bug Something isn't working label Mar 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants