WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

michaelsimp · 2024-10-14T01:23:56Z

Answers checklist.

I have read the documentation ESP-IDF Programming Guide and the issue is not addressed there.
I have updated my IDF branch (master or release) to the latest version and checked that the issue is present there.
I have searched the issue tracker for a similar issue and not found a similar issue.

IDF version.

v5.3.0

Espressif SoC revision.

Chip is ESP32-S3 (QFN56) (revision v0.2)

Operating System used.

Windows

How did you build your project?

VS Code IDE

If you are using Windows, please specify command line type.

PowerShell

Development Kit.

ESP32-S3-WROOM-1

Power Supply used.

USB

What is the expected behavior?

I expect the ESP32 to continue to run the application without crashing when the WIFI Mesh parent disappears.
If the MESH_ROOT was powered off, I expect a MESH_NODE to assume the role of MESH_ROOT
If the WIFI Router is powered off, when I restore it, I expect the mesh network to establish itself

What is the actual behavior?

Sometimes these tests work perfectly. The Mesh network goes down and the nodes start scanning. If I restore the WiFi router, the Mesh network is reestablished
Sometimes the Mesh network goes down, and can't recover. It doesn't crash but it doesn't scan properly and reestablish the Mesh network.
Very regularly, if I power off the WiFi router the MESH_ROOT intermittently crashes OR if I power off the MESH_ROOT a MESH_NODE intermittently crash

Steps to reproduce.

Power on system comprising 2 x ESP32-S3 dev boards and a Wifi router
Connect a serial terminal (I am using PUTTY) to each serial port for monitoring
Let the Mesh network get established and verify MESH_ROOT and MESH_NODE connected.
Power off the WiFi Router
In the example (logs below) the MESH_ROOT crashed

Debug Logs.

I (00:58:03.336) aWifiMesh: <MESH_EVENT_MESH_STARTED>ID:77:77:77:77:77:76
I (136546) mesh: <MESH_NWK_LOOK_FOR_NETWORK>need_scan:0x3, need_scan_router:0x0, look_for_nwk_count:1
I (00:58:03.336) aWifiMesh: This node MAC:48:ca:43:9b:53:d8
I (00:58:03.354) aWifiMesh: WiFi Mesh started successfully, heap:141084, root not fixed
WIN> I (140766) mesh: [S6]VONETS, 00:17:13:20:bd:74, channel:8, rssi:-12
I (140776) mesh: find router:[ssid_len:6]VONETS, rssi:-12, 00:17:13:20:bd:74(encrypted), new channel:8, old channel:0
I (140786) mesh: [FIND][ch:0]AP:11, otherID:0, MAP:1, idle:1, candidate:0, root:0[00:17:13:20:bd:74]router found
I (140796) mesh: [FIND:1]find a network, channel:8, cfg<channel:0, router:VONETS, 00:00:00:00:00:00>

I (00:58:07.590) aWifiMesh: <MESH_EVENT_FIND_NETWORK>new channel:8, router BSSID:00:00:00:00:00:00
W (140796) wifi:<MESH AP>adjust channel:1, secondary channel offset:1(40U)
W (140816) wifi:<MESH AP>adjust channel:8, secondary channel offset:1(40U)
I (141126) mesh: [SCAN][ch:8]AP:1, other(ID:0, RD:0), MAP:0, idle:0, candidate:1, root:0, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (141126) mesh: 1330[SCAN]init rc[48:ca:43:9b:53:d9,-9], mine:0, voter:0
I (141136) mesh: 1368, vote myself, router rssi:-9 > voted rc_rssi:-120
I (141146) mesh: [SCAN:1/10]rc[128][48:ca:43:9b:53:d9,-9], self[48:ca:43:9b:53:d8,-9,reason:0,votes:1,idle][mine:1,voter:1(1.00)percent:1.00][128,1,48:ca:43:9b:53:d9]

I (141456) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:1, candidate:1, root:0, topMAP:0[c:0,i:1][00:17:13:20:bd:74]router found<>
I (141466) mesh: [SCAN:2/10]rc[128][48:ca:43:9b:53:d9,-8], self[48:ca:43:9b:53:d8,-8,reason:0,votes:1,idle][mine:1,voter:2(0.50)percent:1.00][128,1,48:ca:43:9b:53:d9]

I (141776) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:0, candidate:1, root:1, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (141776) mesh: 7391[selection]try rssi_threshold:-78, backoff times:0, max:5<-78,-82,-85>
I (141796) mesh: [DONE]connect to parent:ESPM_3372B8, channel:8, rssi:-15, 30:30:f9:33:72:b9[layer:1, assoc:0], my_vote_num:0/voter_num:0, rc[00:00:00:00:00:00/-8/0]
I (141806) mesh: set router bssid:00:17:13:20:bd:74
I (142596) mesh: <MESH_NWK_MIE_CHANGE><><><><ROOT ADDR><><><>
I (142596) mesh: <MESH_NWK_ROOT_ADDR>from assoc, layer:2, root_addr:30:30:f9:33:72:b9, root_cap:1
I (142616) mesh: <MESH_NWK_ROOT_ADDR>idle, layer:2, root_addr:30:30:f9:33:72:b9, conflict_roots.num:0<>
I (00:58:09.409) aWifiMesh: <MESH_EVENT_ROOT_ADDRESS>root address:30:30:f9:33:72:b9
I (142616) mesh: [scan]new scanning time:600ms, beacon interval:300ms
I (142636) mesh: 2012<arm>parent monitor, my layer:2(cap:6)(node), interval:7286ms, retries:1<normal connected>
I (00:58:09.436) aWifiMesh: <MESH_EVENT_PARENT_CONNECTED>layer:1-->2, parent:30:30:f9:33:72:b9<layer2>, ID:77:77:77:77:77:76
I (00:58:09.451) mesh_netif: It was a wifi station removing stuff
Guru Meditation Error: Core  0 panic'ed (LoadProhibited). Exception was unhandled.

Core  0 register dump:
PC      : 0x4212753c  PS      : 0x00060030  A0      : 0x82127613  A1      : 0x3fcc1660
A2      : 0xffffffff  A3      : 0x00000000  A4      : 0xff000000  A5      : 0x00000001
A6      : 0x3fcc0a64  A7      : 0xff000000  A8      : 0x3c1505e4  A9      : 0x00000000
A10     : 0x3fcc0a64  A11     : 0x00000000  A12     : 0x00000101  A13     : 0x3c1505e4
A14     : 0x00000007  A15     : 0x3fcd8024  SAR     : 0x00000004  EXCCAUSE: 0x0000001c
EXCVADDR: 0xff00000c  LBEG    : 0x40056f5c  LEND    : 0x40056f72  LCOUNT  : 0xffffffff


Backtrace: 0x42127539:0x3fcc1660 0x42127610:0x3fcc16b0 0x4037e0aa:0x3fcc16d0

More Information.

My application integrates a number of IDF example programs including ip_internal_network
I went back to the example project ip_internal_network and built it unmodified, and can reproduce the same problems quite readily.

Also, for when the ESP32 nodes don't completely crash, I would like to know how to restart the Mesh network in software.
I have tried stopping the Mesh network with:
ESP_ERROR_CHECK(esp_mesh_stop());
ESP_ERROR_CHECK(esp_mesh_deinit());
ESP_ERROR_CHECK(mesh_netifs_destroy()); // I have tried with and without this line. Without it, the logs continually report:
I (135746) mesh: mesh is not started
E (00:58:02.547) mesh_netif: Received with err code 16388 ESP_ERR_MESH_NOT_START

I then try to restart the Mesh network with:
/* mesh initialization /
ESP_ERROR_CHECK(esp_mesh_init());
ESP_ERROR_CHECK(esp_mesh_set_max_layer(CONFIG_MESH_MAX_LAYER));
ESP_ERROR_CHECK(esp_mesh_set_vote_percentage(1));
ESP_ERROR_CHECK(esp_mesh_set_ap_assoc_expire(10));
/ set blocking time of esp_mesh_send() to 30s, to prevent the esp_mesh_send() from permanently for some reason /
ESP_ERROR_CHECK(esp_mesh_send_block_time(5000)); // was 30 seconds
mesh_cfg_t cfg = MESH_INIT_CONFIG_DEFAULT();
cfg.crypto_funcs = NULL;
/ mesh ID */
memcpy((uint8_t ) &cfg.mesh_id, MESH_ID, MAC_SIZE);
/ router */
cfg.channel = CONFIG_MESH_CHANNEL;

cfg.router.ssid_len = strlen(meshProvisionData.ssid);
memcpy((uint8_t *) &cfg.router.ssid, meshProvisionData.ssid, cfg.router.ssid_len);
memcpy((uint8_t *) &cfg.router.password, meshProvisionData.password, strlen(meshProvisionData.password));

ESP_ERROR_CHECK(esp_mesh_set_ap_authmode((wifi_auth_mode_t) CONFIG_MESH_AP_AUTHMODE));
cfg.mesh_ap.max_connection = CONFIG_MESH_AP_CONNECTIONS;
cfg.mesh_ap.nonmesh_max_connection = CONFIG_MESH_NON_MESH_AP_CONNECTIONS;
memcpy((uint8_t *) &cfg.mesh_ap.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
ESP_ERROR_CHECK(esp_mesh_set_config(&cfg));
ESP_ERROR_CHECK(esp_mesh_start());

Doing the above when the system is running normally, often causes the ESP32's to crash with various errors
eg after start, MESH_NODE does a scan and then crashes with Guru Meditation Error: Core 0 panic'ed
I (00:22:02.752) aWifiMesh: <MESH_EVENT_FIND_NETWORK>new channel:8, router BSSID:00:00:00:00:00:00
W (1323864) wifi:adjust channel:1, secondary channel offset:1(40U)
W (1323874) wifi:adjust channel:8, secondary channel offset:1(40U)
I (1324184) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:0, candidate:1, root:1, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (1324184) mesh: 7391[selection]try rssi_threshold:-78, backoff times:0, max:5<-78,-82,-85>
I (1324204) mesh: [DONE]connect to parent:ESPM_3372B8, channel:8, rssi:-14, 30:30:f9:33:72:b9[layer:1, assoc:0], my_vote_num:0/voter_num:0, rc[00:00:00:00:00:00/-120/0]
I (1324214) mesh: set router bssid:00:17:13:20:bd:74
I (1324834) mesh: <MESH_NWK_MIE_CHANGE><><><><><><>
I (1324834) mesh: <MESH_NWK_ROOT_ADDR>from assoc, layer:2, root_addr:30:30:f9:33:72:b9, root_cap:1
I (1324844) mesh: <MESH_NWK_ROOT_ADDR>idle, layer:2, root_addr:30:30:f9:33:72:b9, conflict_roots.num:0<>
I (1324854) mesh: [scan]new scanning time:600ms, beacon interval:300ms
I (00:22:03.744) aWifiMesh: <MESH_EVENT_ROOT_ADDRESS>root address:30:30:f9:33:72:b9
I (1324854) mesh: 2012parent monitor, my layer:2(cap:6)(node), interval:4526ms, retries:1
I (00:22:03.771) aWifiMesh: <MESH_EVENT_PARENT_CONNECTED>layer:2-->2, parent:30:30:f9:33:72:b9, ID:77:77:77:77:77:76
I (00:22:03.785) mesh_netif: It was a wifi station removing stuff
Guru Meditation Error: Core 0 panic'ed (LoadProhibited). Exception was unhandled.

Core 0 register dump:
PC : 0x4212753c PS : 0x00060830 A0 : 0x82127613 A1 : 0x3fcc15d0
A2 : 0xffffffff A3 : 0x00000000 A4 : 0x00000278 A5 : 0x00000001
A6 : 0x3fcc09d0 A7 : 0x00000278 A8 : 0x3c1505e4 A9 : 0x3fcd778c
A10 : 0x3fcc09d0 A11 : 0x00000000 A12 : 0x00000101 A13 : 0x3c1505e4
A14 : 0x00000007 A15 : 0x3fcaa7f4 SAR : 0x00000004 EXCCAUSE: 0x0000001c
EXCVADDR: 0x00000284 LBEG : 0x40056f5c LEND : 0x40056f72 LCOUNT : 0xffffffff

Backtrace: 0x42127539:0x3fcc15d0 0x42127610:0x3fcc1620 0x4037e0aa:0x3fcc1640

I sometimes get MTX task stack overflows too when I try this, same as #13882

The text was updated successfully, but these errors were encountered:

zhangyanjiaoesp · 2024-10-22T09:56:57Z

@michaelsimp
I have tested using the ip_internal_network example, but I didn't reproduce your problem. Can you provide the .elf file when the crash issue happen? Or can you provide the core dump decode file?

michaelsimp · 2024-10-24T04:15:21Z

Hi Thanks for the response. I have a deadline for a project demo this week but I will go back and reinstall from scratch and rebuild and test and send you the files requested. One thing I note is that I originally installed IDF into vscode when 5.2.2 was the current build. I have since upgraded to ver 5.3.0 and selected this and did a clean build on my project. Now I have gone back to "show examples" to recreate the ip_internal_network project from scratch, it asks which version of the IDF I want to use but only lists ver 5.2.2. Only once I create the project I can change the version of IDF to 5.3.0 Using winmerge I compared the directory structure and files from my target folder containing ip_internal_network from ver 3.2.2, with the ...\esp\v5.3\esp-idf\examples\mesh\ip_internal_network folder and the only change was in the partition.csv which now must be aligned. I also note there is now a version 5.3.1 marked as stable, and 5.3.0 has gone? I also found migration notes for going from 5.3 to 5.4 although I can't find this in the installer. Is 5.4 the master (development branch). Which version would you recommend I use? I have found everything in IDF to be stable and bug free except for the WiFi Mesh. Could you advise how I update IDF properly so it will enable me to select latest version of IDF when I create project from "Show Examples". Last time I followed the instructions from Visual Studio at: https://marketplace.visualstudio.com/items?itemName=espressif.esp-idf-extension#:~:text=In%20Visual%20Studio%20Code%2C%20select,not%20supported%20inside%20configured%20paths Regards Michael On 22/10/2024 10:57 pm, ZYJ wrote: @michaelsimp I have tested using the ip_internal_network example, but I didn't reproduce your problem. Can you provide the .elf file when the crash issue happen? Or can you provide the core dump decode file? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#14720 (comment)", "url": "#14720 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

…

--------------UEln60op0cSX066h04nQooA4 Content-Type: image/png; name="HUQ061jBK81kEjpj.png" Content-Disposition: inline; filename="HUQ061jBK81kEjpj.png" Content-Id: Content-Transfer-Encoding: base64 iVBORw0KGgoAAAANSUhEUgAAARoAAAEvCAYAAACaBA6kAAAgAElEQVR4Xu19DbQd1XXefk8/ j8fPQ1TYQMBGgvXA6ymoxEIgEI717Dh1rNXIiaWsYqzmx4XlZSe1bJ6cChGXunZoo2db7XKo I+y4toxBIFrTmLTguFL4keXISlTbosHCeQKLYAGqxeNHYCy9zpmZM3P+Zs6ZOzN3ztz73bW0 4N17fvb5zj7f7LPPmb0HRkcvmiF8gAAQAAI1IjAAoqkRXTQNBIBAiACIBooABIBA7QiAaGqH GB0AASAAooEOAAEgUDsCIJraIUYHQAAIgGigA0AACNSOAIimdojRARAAAiAa6AAQAAK1I9Ac 0bzlY/TFD15KJwdDPPiX76UNW2sfKzrwAIHl679Av//mYNZf2Eefu/5P6BEPZIII9SPQMdEk CiPJeJDuu+ZG+qqL3E0Tzdo/pjvetcAg6cu079Z/Rf/xoein9/2Hr9HK85ViySJZTn+45UN0 6WmGZp64j675N7dbkUjaFxaesc+4JZ2Uy8tgFbKyAqKsBXSlsv7RUFMIdEA0OYpNBZSnJqJJ CNC20K1Ecy3dcsdKMlFR+jTOwyKYUoendlGiiRRFxLlNREMEi6appd5sv8WJRligL//tn9L7 N3Hjlyn8Snrq+iYtGmHRORONbMHw6RAtNtmKCAhoy7n0jdDsN/eXWiTmtsUpzyUaiagUQknG V2DMzeoaeu9jBCokGh1FbXslLv4si0b4Xn96x31o1kiwoL/1Q7ro7ZHPR/zIZCj8krRRlGjE 1jMWuSCfzf/kTjSsX9PWoyDRZMiWzlVqLbnP331E72LWH8fSYA3Gc28abzgy7rtJ4JXnRZZv Py0SrE0RY2eLto8XfRNDL040pCuRaTFl+hk42ZiIJnM7k6P8IWpliEaBnVsRGuGZCKmbFk0k Z7rguDwFicZoheltFJm/FEEm0w6a90HDljOTaPK3n1y3xHG//MLJdLLkF+P6IepmgW18Eyuv z/rsgGgYQmb/RUI4JhJJvosVQCtjWjDpd2HbB9OTKhKto7Ufoz88+CeBA7fAossiNXG7opFN TGqJs9jio7Ft34Lmilk0FqJRlDfLmtL8JOpcCDjrc6rPn+SLsvjetPEaLSwB13g+SLB4uJVq 2t7CovGTwTokGj4YlXDip+yCrBMdYaGSeryd43xlNgvzB/1kZXxSlOX76IRo7H4UytzOZRON tMg1Ukuftk0QTToeZb44yWZalub50311ykmcQLjqeFMCkS0QdSs3lRCNUK6mAwU/l2q7pSpJ NKo5HxHCn9PvRnclMj6qdaJZK6Z6gbJ+7sh43G6XiSaWR9xORETiSGyVEY1AxqYjdgcLKp6x 5FiezdeO+R8Kj/BNloJpKozzp9yD0rdeEUEQvy4Qy78wuT4Aomk3leRLX5xo2KJZtF+6I6KZ sJRaNJnOUO1pJCyirAUjWhaVbZ2yncG/R18STtXEOzWd+kf0yXC3aGSLL7UiHMlO6TrdYhyk g+cvEBy5QUEXZ7arNaE43Z9aGd9LMlhPqa7kbZ3yLRpsnfykq86IxnjRjQ2QK0H2liJZIAZF NV8CDJq1XmZLyUJ9ktpPnfSJYQr/jbPjG6ymeStlTcgNdnSPRiLizohG2w7mHaULIufNX1jM 6NdK5zCxYFwuPTKNim+Nm07FxL40C7PInS4/12ZPSVWcaDIcwZJz1rDV4KhZFdXkH1AuvumE JJrdWU9+Zd5y/BB5W7lMf4TztqUM0ZhOUjokGumo3PwaiOnkyTp/JqIxPSiUOc3aavFb5m5E IzjLO5yPnlrdHg2mA6LxSHqIAgSAQCsQANG0YpogJBBoNwIgmnbPH6QHAq1AAETTimmCkECg 3QiAaNo9f5AeCLQCARBNK6YJQgKBdiMAomn3/EF6INAKBEA0rZgmCAkE2o0AiKbd8wfpgUAr EADRtGKaICQQaDcCIJp2zx+kBwKtQABE04ppgpBAoN0IdEQ0s4fm06w5p9Hg4ElEA+0GANID ASDQAQIzRCdOvELHX3uBfv7qEWsDhYhmYHAOzT35XBqcFRAMPkAACACBAIETx1+hn738FM2c eC0Tj0JEM3TqgpBkjo2cQ89c+BZ6Yf6FdCIgH3yAABDoLwQGA1I57ciP6PU/eoiGp58OyebV Fw+WJxq2XZpz0utCkvnR5b9DC//hXnr94T006/ir/YUwRgsEgAAdnzVEz5y1lKYuWEUX/s1/ DcnmtVeezdxGOVs03Jp54pd+i858/gd0zj8+DLiBABDocwSe/oWr6bnTf5HO/7u7cq0aZ6IZ HnlT6Pj9wa9soCu+fRMsmT5XMAwfCDAEmGXznSs/Sb/4V7cQBQ7iY9N/bwTGnWhOD4gm+Hzv Vz9OVz+4DigDASAABEIEHv7lzbT4gU+E/3/seRAN1AIIAIEaEADR1AAqmgQCQEBGAEQDjQAC QKB2BEA0tUOMDoAAEADRQAeAABCoHQEQTe0QowMgAARANNABIAAEakcARFM7xOgACAABEA10 AAgAgdoRaAnRLKHrP7WWxo7uoHWb7i0HyrsnaPMKoh3rJqlkS+Xk8KV2iMd8evT2DbRljy9C QY5eQ8Abolly3S20dtGwgu+xeAF0k2hW0cTmcTpPnemnZJJbtX4zjZ8rFjqUktfS6+mWa8dI Hg0fi02Fov5p5zqa/HpUVu8r+v7Y/q204ba9UoP5OBr6BtHYJgS/V4CAX0SzYIq2btxC8tKp YJRiE1aLRl/osgRm0lu1foJoU2wlhUSzkKYEKyEiAHKwHDKIhlRrjhOiQHCBoGE/3cCx4mlp RXNW3WnFKEIhnzvrMnp16Aw698lv5gr91BvfQUOvPEdnPvN3pQYHotHgsxBNSCIjtDtv62Ug moACwu3fwoO6FSKL4Eo0rJZOeiCaUushv3KPEM0rw2fSd6/6ZDjWhQfuySSbHy/4NXriwlVh uaWPbAgI56cdg9sSolEXafr37nlr0y3MS49qFpG87Qie/juJxlfk+WhciGaMjghbGw19I9HE WyDNMlFrFyGaoK6i/IWJRqkf4hXIuPXoMmErK1tNksQZY1XlCkavbEnVrST7fRlNK74iLk/k m+PzvoOmFozT2ClEh7LmIRyXsAFmW9+dZ2mWZjgWCYOYvIO2o08k5+EVOVtldWySHuoys1Yj uWVMMsfS8fI2V3xleH5ANJ9Kflzw+H+n8564Xyp8aME76eCF706+W/rIjQHR/L+OJWk10ciK Fk3afMFvESrpPIF8Et9JzsKJlUb0kajocj+IyUcS0b++darHotH7qoRoAt9TOjabf8xsqUkE EeNOok8pJoJ0cbkTzdgpFn+Xhn/Q9nqiyU2HjFalKKtMbGwrOkFL9k1GjnKjRZPxYEj0jhNX KnPqR1O/O9K1Q4pnz15Kjy16f6La5//DX9Abpu4L/37ygn9OTy5cmfz2ph/cRmceLufQ8Ito VGdw8mQwWzTqKZS0yJyftGaLQnUGa08bweGrEY6h78iyyiM4LkdBiyYkxtQSMDqDDZZeMmqT RSOSs/bENzzUWBtLpwVrUpZJXbzJSGPrKbJWChCN7fQxb4uTK6tle2toN8R73m75NFSaf1Ob +kORW3x5D7iOzYmMis+cs4x+OPY7ya9vnPoGzQzMIrZl4p/RR79MZz397dJd+0U0mU7M7K2T eOoiEY2mUDFW1n22zRmsYM4JR1zMplMnZbFrhJCcahUkGoXUKrFo1O2dE2bCtkfCPpq7kT3p KZpEcglBuRON3c+Vbn90q1PpR9WTZMtleCgYcMg6EeRbri17solGJpWCeld66UcNqGQjNlsV ybA2QTTahHUy4UqdLGvKSTkKEo2yUJohmsj/tOxo5Ohm/z96gBNLE0QjW4fMOhUJJ1vWdILS B4FAOBlEw8dtnl6/iSYkm7OvoB8u+l1J/Iv3/zm97id/46SxLoV6m2gMF9EiBcrbC3dCNIoy dY1odFmbIprQLxX4DrdvPEzjm0fpgHAq5751ku8P6adqrid3suprmOTIKtdU8M3aOuVeJ/Cf aNiYnz378sBn83vh8C/e/8WAZKq9vdm7RMOPfqliZ3CgpBOX7qVJ8aKceumtG0QTm/jq1qAx ognxXk108AiNzTtg8FmMke4Mlm8kq877xKpItpWORBNgM0GTyYVHHZMsWYPv1y+hvZv4XS4H S9Xk6Gb+ptD5LJ6UidcaTA+zTh5wLraEe5lXT5ofFh56xZ5V0r3VqGQPEw0bnnJUyfwke0Zo 7Qr78bZ2M5hiE9p461fZz9dBNNItZDY2s2O5OaKJLwtmXUrUcDPJLx/3MhINry8kPiN3opGO tg3OcPMFSvVoWz8+T30yeTfBxVOxdlg0RYmjaHlviKao4CjvHwKFSa7BIbRJ1gZhqqxrEE1l UPZ7Q0vouk++L7j5/FW68Qvl7lzUj2SbZK0fjW70UBPR/FGQ1+kj3ZAfffiCwKob6DNvDd4F /ein6X/4IlOWHG2S1XcsHeV7+Jc/G+R1+vdh6eryOr3jJrr6oY/mijAzE6Ssw6cHEPh1uuGz 7G33wC/xtRvptmoPKyrGp02yVjz0mpsbGAhS1OZ8Hn7LZ2jxN6P3qyogmovDhr73jj+i5bBo ap5aNA8E2oPAI8yi+Sa3aB4zCl4gJe7FxAyV7/8qiKY9KgBJgUD9CDCiuSTYOjHD59jzJYnm pJHIovl+sHVanrF1wpap/klFD0CgKQSytlCPBFunS+Kt0yvTJYiGEcjw6W+KiOZXNtLyh28I rBv4YZqacPQLBHxBgJHPI1d/mi75qyg0BfPRmAjJunXihBIRzUxINFc9dIN1nCAiK0QoAAS8 R8DmCGYD2PUWTjQDiTNYrZdLNCJZDJ8eb53evoGueji4gx1/QCje6woEBAKVIyASya6rN9El 37oltmjSrZNYJpNoVAIZPi1wBg/M0KPjE7T0O/+OBo+/qgkP0ql8PtEgEPAGAZN1c2LWEO25 4t/S2I5JGpgJLJoXZB8Nr2MkGhNhDJ1yPg0EjT75T3+T5j+/n86OA+aAXLzRAwgCBLqGACeQ n5xzJR05fRG98f/8N5oJjI9XX3pCk4GVdSaaWXP/Cc0J3gA9dtrZNPXm99L5QcSuM5/9W5p1 /GdwDHdtetEREGgeAUYcx2fNpede92Z6IggLunDv12j4hcP02qtH6PjP9NjDRqIxWygz4R2a uae8kQYDq+bYqWfRcwuuoBfPWEAngvCAzEkcFgg/7P/j/zaPCSQAAkCgFALB5ZjwYnB8Ozi8 JTxAgzM/p1N/+gSdeXA3Db/4DJ0IrJmfvfRk2JPTqZOJaPh3A4OzadWvr6TBwUFjY6XGg8pA AAi0DgHGDSdOnKCv3xsEPw/Ih39yT53yrJnQRgkafc97fpO+/e3yAY1bhygEBgJAwIjAlVde Sffcc09o6UgnTcI7UpKPJs+a4USzevV7aNeuXYAcCAABIBAicNVVV9Hdd29Ptk3JSZMr0YjE w/8fRAPtAgJAQESAEc1dd90dWjMiyUQ8E/l2ci0alWjY32vWrIZFAz0DAkAgQYARzbZtdyVE oxJOAaKJTp1ANNAuIAAEVAQqIxpu2dRDNLa0rAUm1poMrUBbvVBUzeDQC2PCGLxDICKabbFF E51I822TdjNYdQSb/DOdEo0xlWucYJ1n+1NT4HaEppVo1GT0cS9Jyo/obz1DYV5UfFbDki86 GUxGAjktC4KcFI1Xz8fRgBiIpiM1QqViCDCiufNOTjTBHZvB6K6NtIUaHb0oul6nhH2onGhy E3EVG1hmaUeiyc6BbLauVq2fINo0SSyTT5hM7dqFNHX7hig5fPAxp/cwSemaqZITopy2BNH9 K9ITUzNW3amx75Y3rRINJ5gOiSby03TiDO7aArEqiyWRV0giI7RbyMSo6YAxr5NjTqIw2b2c sTEr06OeyTEmtG4QdssVvyPxrbrTUat9UaklRKMu0vTvMMEY31YYEoXJW5zg6b+TaHyFPYFc pkUTJ0I7stOQtJ6rTEYCuWzCEHWtCNEE9RTlL0zYSn0u49ajy4LUwcOxYOZkdeGPWcnytEWp bkmVraSSQzzqWMWCz/sOmlowTmOn6AneOJLq1vYQm6+ndEszLC/JqiaRi+Q8vGJzqmdhpZyt srTN1mUOa4f6I2MSfdebvFMp0fCtVPUWjZloZEWLJm1+kN2QJZoP1TRINj8+r+KUuEG73A+i pqNNVKRrFo2+0CshmoC407HZHPFmS00iVVPa2Dilb7K4ChDN2Cn5/i4NgyRFrl1W9WGw5LoJ WrJvMtoCmywaba5VvDhxpTKnfjT1u7yc8O0mIL+IJnmCxqAmFkoG0RzdIeV4lhTM+UmrTqDZ Gaw9bYQUrxrhGPqOnrA5lkEiRkGLJnzqL6Pp2B9kdAYbLL2kO5NFI5Kz9sQ3KLxGErJMWZac 9H0RolHmXZtBhnWSRlf5NVdWy/bWQDRsDMuOinm1zRbSwoN67m3xoahbb+0mFlV6v4gm07eQ vXXi1gsbmEQ0RsXNeCpJqBRMts4JR1zMpvzc0mLPy/FckGgUUqvEolEXqdU3IRNL+ORfOk1b N26hvXH+85E9hm2BWK4A0ciL1rAgE/xNlk+erFw/WOZ1w0NBw0Gfx1QaXh+5txkmIBpNTwsS TbRJkx24WdaU00OqINEoC7QZoom2qfzJzv5/9AAnlmihdZVoOM7x9ky9WpAtazpBqWUoEE4G 0RjHljQFoul9olkxP3Tk8SPmxOpZlLcX7oRoFGXqGtHosjZFNKFT+N1E2zcepvHNo3RAOJVz 3jqtUJz0muPd9eROZHRDnRxZ5WeBgm/G1ilzmxY2BqLpbaKJJ3iMKnYGB0o6celemowdzqEu qZfeukE08dNa9Q81RjQh3quJDh6hsXkHJN9ZdDI1RiQ46jXMYqsw9Vuk25LUP+ZGNNKdJuNC z5I1+H79Etq7iW35HC1V1anNqkk6AqLpcaJJnybsdCr8MD/JnhFaqz45pcdYxs1gvmc3+V/U /XwdRKPdDDY7lpsjGsulRA23LB8I84+wD/Ov7KaRa8U7Ra5EIx9Fm04HzRco83xnkVTpsbm6 peJys1Li2EA0XhGNtNbxRysRKExyDY6yTbI2CFNlXXvjDK5sRGioIQTcrI2GhFO6bZOsfiBW VgoQTVkEUT9CwHoE7hFQbZLVI9jKiMKIZjp4qfJ/xoGvSr3rVOZmcJlBoG6TCHCfluvb6ZC1 SQSa6psRzeLXTdPGz/0vLfhV4TARIJqmphH9AgG/EQiJZvGZtHvjrfQNZtWsuZG+8M4LaPYr /5e+8OH/RCyVQRLK0xYmAkTj92RDOiDQFAIR0Sym575zE23+i6X0B5Pvp0vPmEOzZ79G+7/4 B/SZIJcBiKap2UG/QKBHEOBEc+JH99DHv7QPFk2PzCuGAQS8QkAjGsUpzISFRePVlEEYINA+ BEA07ZszSAwEWocAtk4+TFmpVxZ8GED1MmhBy6rvAi12EQHZGTxAq2/6L/RrC310Biev/Avo 5AR00gJAKVkM8jA2BjPKquAS2tM2oSAaDaFaiEZ9+dU2L/i9MgTac7ydFcgqA4pV111Ph26T 38I9L5ds5BfqMsN0Sv1VdJUdRFOZQtsawjtONoTq+b01F/bKKoitfvp7EBMuCNRkjeLG5sMl I4LLvIFoXFCqqEwn8YYq6rqPm2FEM/TYNvrKvjSXk/gaAoOm9lMnc2Ak2VoIiWDebjnOSYGJ sxFN2pS7laLLxOsaovSrYRJE68pINEq4CnWbqLZX8Hc5S0DO6wOusZddxrdzihauGKPhJIyC GpYhlcOkE9btMH+H6fZpWhbEv0lyOCjZBdyyURRQLhS1IuDHS5Wml9xMQbPFmCx5AbfVYRfy o7gSjSlEpR7xPhTFFilf+z0jnKcQNNwU3CnN5GlIUyIEdNIWWq7vwoSHLaJgjAMPIm6KqxzA kpdxQP1ND+4e9yEGNuN+PEE3InKSIyq6P3Ss6wcFHBHwg2gKJU1jIzMomTZg0SJwyT7AG3Al GiXIdVhdWWBxk9ZI+aYA46r1ZtleOWWAyNzu5Y9ZW5iKLG7jGyM5F1Z+n3q6Fjnzp5HAjYRp 2Crh7W1HeqiumCdEE0dnSxaXaRGrgy6y145Jx8kKKk80sn9Hj9qWjiQmQMPCTZLiScOWtzha /m9hfMlvqgPcdHoX95HtADekdEnmqvj4kiElsugPAnsqlpTYk+DgRgIx6ElVvrXq1mHPt+QN 0UiOVacTpiJEY9q+ZM1tPUSTGynfxUIQxeVbEYFEzNsBgQR4WSdsdWxSq4WUrAamLaRS38Ua C3N6pYQDoukt7vGHaIT8PwdGDUm5VNyLntQ4l6+aaHRfhKZCpq1TTv5sE6nk+h3EJ70zDoqU nKCClK2rldzjVueqU5/yg6PzrZOa7hhbJx8oyyOiCeBgyjwaPNcCp6+YriP0fUjR6XUfjXTB S4pCb/bpmJyE0YS4Ek22M1g7GrdFylcXYmyxSFkDmB9rPdHkpnv1aHaqs1Udv7SlMPu3ZOey STVZvdU0cpRo/tHtSdrhsGTR8XGcczIOODuDxayVjlsnOIO7Tz1+EU3sFNYv1hn8AIrvQb5J ashkoJQvTzQmSyWHpDTfiJokXnF2qsfFYVaANEeV5J8JfDM7Di6kcW4FaXVVH4iOp8sFRXPm gFhpi46PO855hgq2cRKOoU1WkuqT0mR2JBqrBdb9ddjzPXpGNC3DG6cXLZswJm5B314LR+ij yCCaUrPius0q1QkqV4gAtk0VglmgKRBNAbCMRQtdBizbGeqXQgAvVZaCr0xlEE0Z9FAXCAAB JwRANE4woRAQAAJlEADRlEEPdYEAEHBCAETjBBMKAQEgUAYBEE0Z9FAXCAABJwRANE4woRAQ AAJlEADRlEEPdYEAEHBCAETjBFPNhZxeOqxZBs+aryU4uWdj7Cdx2kM0BbMg8Bf90snMD35l DROZpRVVXNgD0Wjo1kI0uLDXGLe1i2iWTtPWjTyzQR5m6tve8QuQQihMtXbxrAmshYpeQQDR dG0B4BWErkEtddQaoimtIAUXs1N/VUVqKyhbM6rSK73ipcomZtILoulGFgQ9QHg+3C5EgywI QpApNTSFKcsDsiA0sca96JNnqhwaGiL2b+7cuTRnDstUOTv8Nzg4WH+6lcifokRGqzILAtvk sGj4OVHrpNlw8rsgC0IS4EuzyJAFwYvV7ZEQjGjuv//+kFDEfzy3ExO19rxOphgh+cGJXLIg CCibIsDp7kea2DxO54Xfu2RNQBaEqTgQF7IgeLSiPRXFE6KpLwtCFJUtJ0GacWJcsiZkEw2y IHBQzVkeEsiRBcFTWqheLEY0DzzwQNMWTTAw0bHqFKnf5tQraPWo2FodtMWIBlkQDHmZYszT qwXIglD9EvejRX+IptIsCOZEboUgr4xokAXBzRGPLAiF9LNlhT0imgC5CrMg3KKkBFHnRQpO 7pA1QZ9XZEFQt0BicHFGLhOX7qXJ2/YaUgKzmupdpxyiCUpnpsRFFoRWUI5fRFNVFoScbIwU H7vKWRDsWRNMs6k7rJEFIcXJkuUBWRBaQRBVCekZ0VQ1rC61gywIXQK6ym5svr0q+0JbHAEQ TSldqOgVhFIyoHIRBArdpyrSMMrmIgCiKasgTpf7ynaC+pUggJcqK4Gxk0ZANJ2ghjpAAAgU QgBEUwguFAYCQKATBEA0naCGOkAACBRCAERTCC4UBgJAoBMEQDSdoIY6QAAIFEIARFMILhQG AkCgEwRANJ2ghjpAAAgUQgBEUwiumgpbX+CsqV+Pm60lOLnH4+110dpDNEWzIAgzZwzMlDez 8SU82r+VNrCXAh3KHtm5jia/3qG6gGg04GohGlzY61BBy1drF9E4Z0FgwMShIk6JQDrmQhox ntGbwi51KnoFAURTXpMdW8ArCI5AVVysNURTVEHS8ntpyafWkhz1LgfFMOgWC+Z5Hs0/aLFo kAWhYnXsRnN4qbIbKKt9eEE09WZBKGJ18Kh524PYOHZyQhYEZEFoYtG2sU8viKbeLAiuRCOW I7reagUhCwKyILRxyTcjsx9EEwe8IsGhWl0WBDeikftzqYMsCMiC0MyibWOvnhBNfVkQXNLW 6v6fckSDLAh8KSALQhtJoQ6ZvSGa6rMgcLhspGEI4ykhnZXjCVkQouN80xZSUVXLqRqyINSx tP1q0x+iqTQLggiyjWhME+JSx5VokAUBWRD8WvRNSOMR0QTDryoLgoSkmTTk4OQq9C5EgywI CWqmTKDIgtDEeva2T7+IpqosCF0hGpOlgiwIKfTIguDtqm9AMM+IpgEEynSJLAhl0GuoLi7s NQE8iKYU6i5brFIdoHLFCBS9YV5x933bHIim7NQjC0JZBLtXHy9Vdg9rpScQTWPQo2Mg0D8I gGj6Z64xUiDQGAIgmsagR8dAoH8QANH0z1xjpECgMQRANI1Bj46BQP8gAKLpn7nGSIFAYwiA aBqDHh0Dgf5BgBHN4sWLaWhoKPw3d+5cmjNnDs2ePTv8Nzg4SAOjoxfNMEhmZsL/JB/xb/b/ /O81a1bTrl27+gfFsiNFzGANwVqCk5edJ9TvGAFGNHfeuY0GBgaM/1jDfhBNB1kQ0vADbBjH 6NHbN9CWPTpWcjnl96d20LpN92YDXMWFPRBNd4gGF/Y6JoqyFdtFNAWyIKhvZ+e/rW2A0Wnx V/QKglNfZaca9RkCeAWhGT1oDdEUUxDTi3PFSCE/lGg8WciC0IzWluoVL1WWgq/Dyl4QTeVZ EDLeqnYiDwako4WBLAjIgtDhuuu7al4QTdVZELKsH1eryI2QkAUBWRD6ji86HrAfRFNxFoRy RGMK0WnCF1kQkAWh43XXdxU9IZpqsyCUIRpXqyeIr0cTm5fRtHSSZfIDyal5ZQ0zZwngKXl1 bZRPzrRyLz1KWzduIZYtPPlNPTUznd7FHWWnDZbHKm8Zi48vGVciix4AXrIqw+yh08nYUlwU q9K4ZTb4ZKryrfUdXXQ+YG+IptIsCB37aEEMlv4AABcHSURBVIo4jIsRzciedRRlDbCfcLFF tuxoTjre+Eh9WCARM0EKJMDLZi7afCVKZYqS66XjQRaEzpdf/9T0h2iqzIJgdOaWXxCyWrgS DbIguDnXZctDsmiynPPq964WDUKwdp3hPCKaYOwVZkFQb5aqT3zTvRr3bRObJ2RBULdAh4RM o4xcJi7dS5O3BRu5LOJfv4T2boq2etFWdJx4tlLVIR9tBcUtVmytHRUuVDoSTbF57vqa7MkO /SKairMgSD4MwX8R0sR1t9DaRUdox7pJ4vd+3U6bUj3QyyMLQooOsiD0JGN0OCjPiKbDUTRV DSZ4U8iX6BcX9kqA13FVEE3H0KXbJznXdqkGUblmBLBtqhngjOZBNGVxr+KlyrIyoL4bAnip 0g2nGkqBaGoAFU0CASAgIwCigUYAASBQOwIgmtohRgdAAAiAaKADQAAI1I4AiKZ2iNEBEAAC FRLN5fShTb9NFw+9Rg8++CBiBkO3gAAQSBDICk7+2v4v0e//5++E5RxjBoNooFdAAAiYEcgn mgGauPWDfhNN0dcGoAgVIOD7jWiv5SsSIaCCufKkCUY0FwXpVk4PUq08u+tjtOnraTaEaz75 ZVq5wDndSjMWjc9Eo4f6zJp1c0wX6SVFqapePrus3GfhIO0mkb1eyIHAXstnIxrDy6KekEUZ MTKJZvlH6PMf+CU61T2vE4hGngjXSH2slkMIC6nxVXT9dYdoC3szmn3iAFJWsnGMhWxVKK8X ctuJhqHfe+9jZRLNNTfTV951QZEEciAaaYEWCjBVVrFsT8lIMncLy0I1IBorF2cXKDBXC6YM UQVLdN1g1SyiuXLdn9IHl5zWPaLhW6AdNB7EHQnSve3nEeaiRXgeB0kJ92DaOskhLvXEcWoI TNkSULclcn012ZzZihAtlCzFEsmliPVj0hYX5TVYTZwwdhKNrwgQNoUCDbtTMDQQjQ0X9Xc1 bGj+nOTrgIZIIt8BGhV0R+qTW3c7p2jhijEaJh7WQukr+T7uhbd9+zQtu5bViz66HqjtcAzT udo9b22o6+FH0Wu3YGENMkfBrr2xaLiiyROmP+nVgFbGAEmkBkOan2apFAMyMbCUF+3U9pZc N0FL9k2GGS61N3+zXqhUYtKa3hiWvuOhOYXJs26DhLJOfhdTnFwes1eJK6yRt/oyokI0dlyC eVxPNMkzfiq4afWD9idoMg59atcBM9GwR5MaE2eMiD/AOOZanKJ0vlm7WmpejplQT8M/bjvp K2gn1aP0QZbOcTTG+cnDNexZCgRWcF17V9wbH40p17LR1Ff8DHrIxxHaLQSz4v6P7FAO4oTm WQZmqyPLopJi/mq+EYsF4uJzEYOM29L2ckJVA3yb3mY2Bu5W5JWIxh2XVPvl9vIc+i46YCYa 4eESFzCR+xExKqBpearWm/EN8PzogHKzZmev/jBysVK945NMgbw5dbJvgcQxpKa8Fi2fbQEM H91sTs1eVjx5uiQLWInMb7A6km6khZ698BLycYjCXyhuSixzdhaD2HIzEo2QBC6x8CwYiouv EC7CFjjoK5E3acO+zU2nNjuXeuapkyb3QuIpY0SVUbd5kmVkDRdqc+ybCcQ039Yg9e3hGcoj moGBf0Gf3LqyO/donCwDA7BuaTnSitEWTVRSs4maKpuYEkW1lnSBMh2ugnOYWBjRebtpHd9K GMZViGhC0zxoM895aHJOmxaNixNbW7D5uHAs061CxtM6IXn5QZKbEcLFCuFlrEQT+1XE7ZTR olHIWdrmgGhMU5JPNANhFcebwTM0MzMTVlizZnXhVxBMRGNdPEFfTtHyk5GbSCVvL6w6bNPg 2eaHSZ6ScUtne+AXEtOVmFsqej/IipUr0bgcgWtbpzxcTKSSty2Qf7OOqwDRWHUli3hXCMRi tWhsWS9cLZre2zrdeec2GhhIL+qJ/98o0USed8GBFyqV7FiUF2S8/6U0kVpYY/0E0SYWhFyf PNkJHfxui8wvReIXnXwZ2xNhIbBFs3reEaJ507Q9TvQW/iw5P+N2gi1glrNQdFCH9Y04KSsw 0xmsPp1tGHL50noRhvJWU5RRd7CzwPDDydYpnR8mszJHDjqgcY3BYavdNTIRqvYdPzkSxuZA NKb5UJ3Bqs9QJ9Tecwb7SzTCIuLHiOpRa1YmgrFTUvXL888c2rmDaAV/IqtH2/qxpXoMS4J/ xrqnziIE0bEbiq36H5RTCa286XhVXX4GxTUumnSxZ2JoqJeHCz9B4Z6fY/t30NSCceKLTa2r +Zo0P1COfyYm7s3MCtkZH9vHUKjpYG65VvfRyP6ZgGDCNopZNCL5J3qbbMccLRoXy1JjWH+/ qPDt7XJbJ38hcpTMwcHr2FJtxYpux2oTBA1bEehoy2httbkCIJqKsG/FIu6xp2RFU+dhM721 bWIAg2g8VLM6RXK63FenAGjbgoD5nk3bYQPRtH0GIT8QaAECIJoWTBJEBAJtRwBE0/YZhPxA oAUIgGhaMEkQEQi0HQEQTdtnEPIDgRYgAKJpwSRBRCDQdgRANG2fQcgPBFqAQOuJphUX5Vqg CIVEzHx1oVAr9RX2Wr7eelnSdRJBNK5IdVDOPUav/o4V6y4vyl7+u0XZwlZyYc/rhRyM3Wv5 bESDC3tJGAiuxjwsBPub/X+ZMBEdrOOwir8WTZE4wLYYJgo6ajjSOB6KHArSgGhVryB4vZDb TjShVvdUGE82Ilg0nTKcrZ5LAKmkjfKK5fISnruFZRkciMY2+zm/2yyaqKrLfJYQoutVvSEa bpkgC0JnOmBXTIPVxAljJ7IgIAtCZ3rnWssvoglSTyALQjR1RbIg2AOwBw0iC4IcRA1ZEFw5 opJyfhHNPDkynksE/L7NgiBMvynCnaYdmaE8lWwByIIgQ2eMGaxmWEAWBBsb+UU0Yj4m5hIL Q0SahoAsCBEqhmDaWTPuGjPYEL2PN5lEvtOCfMsZJRIRtOwQyIKQZX32QxYEb0N5WkNjcjLi BOXggO2ZLAgucYJF0ilCNGpaFpW8kAWBdki5w/TcYCN71sUJ8FTwHEN5xrpdOAOEzbRo6Hev LRq7g7NPsyBkZcjMUyJXonE5AkcWhByisV25cCUat9OphnijcLdeEw2yIPCnohycvKNjamRB oM1iZglkQShMFmUqVEo0l1++lC677DJ65plnKsnrFA7MEgG/H7MgZPuu8k6rkAUBWRDKUEW5 uoxoFi9eTENDQ+G/uXPn0pw5c2j27Nnhv8HBQbcEcpdffjldffVyevzxx8MGdu3aVU6yttVG FoS2zZjX8rq4DbwegCIcI5r7778/JBTxH08ix4o7Zar88If/NR08eJAefPBBetvb3tZ3ROPv qxDCjLv4X9qkvT0ra/mb4r5BUxnR3HTTRrr11lvD9536kWh8m9gseSp5qbItg22lnL37UuUD DzxQ3qL5sz/7PG3YsCGc2vHx8b6zaFqp0xAaCHQJAWbRgGi6BDa6AQL9igCIpl9nHuMGAl1E AETTRbDRFRDoVwRANP068xg3EOgiAiCaLoKNroBAvyIAounXmce4gUAXEWg90bTiolwXJ7Qr XSGUZwmYe+tlSVcgQDSuSHVQzv3lx+JZEFJxigRBj2PRLjqivH1ccHAgmoKAicVtRIMLe7lZ EJq6sOevRVOEAApmQWB6K71smgYCy10BVb2CAKKpkWhY0735CkKrL+x5SzQOQbhkq2ScaGdW sCRVr4Wn4r4lQSzchTR1+wbasidf/90tLMs6AtHUTDS9mQXBC6JBFoRlNO1AFpqGO1spyIKQ hCJNLMKAoHdO0cIVLBTpoXg7GYdHTYDm38dfcJK9fZqWXZuGMNWDyavtcKszfUjsnrc2DVX7 khwvO7JY3R4gJViva1W98dHwGCvIghDNvXMWBFeFRBYEZEHoGq3oHflFNMiCEM1QHCDciWxc iQZZEOSkbK7hUJEFoRJ68otokAUhmVTnwEeliYbkEyhkQYgIadGwsMCE7ZPRP4Xg5DY28p5o bFHgJWewgwO2Z7Ig8Jmtg2iQBYG2btxCexPrUiBjEI2NU4y/e000Lk91PYFcngPNdGyYd5Qo /uZy5Jh3TM2Pu7cHW6O1lJ2OI5on59O0qonGpT1kQUAWhIJ04zXRIAuCOQuCNMcuxJCcsozQ bjEfkfHpHF8YI/kUZNX6CaJNk3Sv4SlvypS55LoJWrJvMjxyV0mTb034KZDUNikX2oz5qwLS Xk80uSmURv/w7Z94kqP6vZAFoSBVlCvuN9EkC0TMhChfTuvHLAgdEY3pEljmfRj9prJ0NGyo p2VmyMlSeWz/DppaME4LD26lDbft1TKSSn056IC2BLh8O4Noj0GKFf5BFoRyZFGmtjdEU2YQ XtRFFgQvpqFXhHBxG7RprCCaimbL2adSUX8dNeO6zeqocVSqDgEXf2B1vXWjJRBNN1D2qA9k QfBoMoyi4KVKL1+q9F1tIB8QAAJEsGigBUAACNSOAIimdojRARAAAiAa6AAQAAK1IwCiqR1i dAAEgACIBjoABIBA7QiAaGqHGB0AASDQeqJpxUW5XtMzhPIsMaO24OQlmva4Koimxslxj9Hb SRYEpY4aCjJjXJVc2APRlNAaG9Hgwp6XF/b8tWjqzIKgKqOjclb1CgKIpkaiYU3jFQQJ4JmZ meTvm27aSLfeemv49/j4OO3atavEZLhX9ZZoHIJwpaMsqFimhe5AIu4WlgV/EI27gmolbRZN VAEvVQrAiUSzbt2HaWpqih566KGOiAZZENyzIJjJ1UZWyIKALAgl+LFk1cp8NFdccTktX76c Hn/8cZozZ05hiwZZEMSYO3lZELKeiJYnJbIgIAtCSbIoU70yomHWzdKlS+myy5bQs88+2xnR IAtCNJe5WRA6JBpkQUAWhDJMUbIuI5rFixfT0NBQ+G/u3LmhQTJ79uzw3+DgIA2Mjl4UOmPE rZL6N/uN/75mzerOiAZZEJLpzN6jV000yIKgZvhEFoSSrGKozojmzju30cDAgPEfq9Io0SAL QhyNX5i8jnw0mRaNgWiQBQFZECrmGq+JxsXz3o9ZEIy42EKJuhKNw+lVtLXjBOXmhObxgeMz Fbr+U2uTmMGyTssWm4sOaGsi41TMqiumesYEcgo5K8fR+SehZotUH6fb6VTFfFBbc14TDbIg ZGVBiBb3/P1RcO/gMDRn4ca6k+kMVhcNsiBMJTnQef7sIgnkArwNmRvSrBCuRGMj8No4oZaG /SYaNuR40tK8gciCEGmCnEReyxygqYtBcZEFgVJSiW0tKUtlQDA7WSaFIgnkRGIXThKTm9uO RONiWdZCCfU06g3R1DO8LrZq27p0UZSsrry93OgBNr6J0NGW0bdBCPKAaCqanFYs4h57SlY0 dR4201vbJgYwiMZDNatTpEpeqqxTwL5v2/G9tZbhBKJp2YRBXCDQRgRANG2cNcgMBFqGQG1E s/X597UMCogLBIBAXQisPf2r9dwMBtHUNWVoFwi0DwEQTfvmDBIDgdYhAKJp3ZRBYCDQPgRA NO2bM0gMBFqHAIim8Sk7k+76zDm0+MdP05s++1xXpPn0zZfQNfRT+oWbD3WlP7866T7ehcb/ Ly+if7yK6I4P/JBuKFTR78Igmprn5wMfGaOPX/zzHMXpvuKDaLpL7IVUzEY04e+z6eG7H6Xf +lahlhst3NdEE5LAG47RJz46RZ+vYxrevpD+fs0wfS9XKUA0lUFvW6RhR93Hu9D4HMZQu94W EtitMIimRqIJFeL0acsWpfuK37MWjcMi7QWiITqP/vrzZxDt+j699StuC73pUt4QDVf+O+gM uubsCJbpx5jfgkIfxtUni98JvozQajiVRjiSL78oWyjG31+gtwlthlXFemqdnwj+DG6l7DpG i69i/b6asS2KCOSsfYoyKG1PP/ZT+t4bztB8NCEeMQ5ExxNT2UwSOlll1WdDNbYRLtKhVB9V HJNF/DJdFij5aFwymiM+H1yOaEx8zg6ECyJaHLxe9J2o/vLv0nwIMn/i+ZFgKzorrphiL4+X /Zw/L8wntpleb2wrbJyPN8gadA3DheORq2/pPNx3+jnp/KlYJu0b8Ob93v0arRT0WsWrbQ8L v4gmWFgJoILiy9/J+9NP33wR0c3ccRZN9NXTnBhU5g9+v/k0+t83R1slowmqbXeUNrmimZRH XDdhuTl0n+jUi+seFhYZXyDigtWUSNyXn2twFipP8tz6wb5e/T3yI5G074/kEhZrMh/Cd/F4 KCGbGKuTU2KM2mbEoH4n+q30J3TY/0j60NBxUudaIIdcRyqXkT/IIpLMHK/4kAnL5elb2nZK DNHYzhIIWcdb0EuOs6BfJj9f27ZPfhGNdBJiUCSH/bU0ARYfiWmymMKtfF45ARIXsoEsjGYp q3Ppa5J15WSNmAhKGvdJmtkstWut/5xCNFlmuPJ9hhNSxtC0DdQXmmr6G7eYytypxCNZHZxY Cmyd0ocRnz238apzbRq/2rZcxrLtMeJsqOM0VqNmNvJlpUQT5EkIMiEQsSwIRV9B0BehSWnN /gzNbBaeBslvypOJoa0TTfpE0mcjfpI7OXjjp6tENBlbKZU81S2MIAi3euSFyZRwhA5zh7ND fTsxRZ1KpJul2NL32UQj+xPkhaNve/igLVtGVSanxZflE1O+z2krW9/MbesPP8XSFZXN2K+B aIwPlEY4xKnTdhMN38YIJGI2KQUCyS2bRQYClt0gGsUS0mZSVDLVcjJYUmp9H4lGsyIVobP9 SsKdk7qJxqpvIJos1qmYaKLcT92yaEykkrt3VRQxa+uUe5mtJNFkmezJHt6p/ZQQv3uBstVz qC8v2iJbJ/0imdxWZxaNi7+hdqJRcTOQll3fHIjGdmLkatE4kaqTsdGVQq0mmuRkgO/RVUdt 8PdfL36B3spPRVxM7XjrIXn5xXYcFnI4cybT1tB2ppOT5NMz2QkZb80uCJzngQP9u5LzM7be cuo7O4MFZ2yEtXD6wsaojaczoomwOpVSpzJrPCDAm4neGt9ediIap7kxOWwNmJkWsvqddjDg QjTxll1yvqvOYJXQ9YeBCzl3hUEcO1GJZnBwIKiZJpNjzSQJ5NgfWdkq+ffdtGi4HyE5Bg58 M3f8eJiu4Xdj1KNI7chT2FKJp0ian0M9abFdwosXiumug9L2gV1P0+FL1Zuquq9IPkZO2x81 +J74XRF+vMxK555qBb+np0Ox5qjt8kXGj3vjYvKxa4dEkxCzcE1BOKVK5ll9bcJABqn/xHa8 LR/Bq8fp2kMsHq/kn1H1LeOwItMSSo7phflxtGjafLzNSGZgYDBElGeudCYa7gjulGgcibFV xdqmDLngtsxUb5WiFBa23Rf2wjzbYWrckF7C/y9ANJG1A6IRtMbJlC+sZc1UANE0g7uh17Zt m9gQ+NaJWzMgmorVyf5SZcUd1tUciKYuZIu1a7xnU6yJJkozotm2bVu4ZeLbpcSScbFomNCi f4b9vXr1ewrfo2li8OizAAIgmgJgoaiKQBbR8O2TdesUNRhd1OP//Y3feDd97cXfBtpAAAgA gRCB9576Zdq+/R7JmhEdwRrRiBaMiKFo1bzznf+MhoeHadYs/vIb0AYCQKBfETh+/DgdO3aM 7rvvL6slGk5G3DmcHolzy6dfIce4gUDvIyBuh0JLRThdSv0z4S/JiZPRonGxaiJyiYgFRNP7 yoURAgGOgEo0EdnIx9maI5j9Pjp6UeiBMW2VlG8lYlGdxFHZqKnIp4MPEAACvYBAfHAUWinq R3L4hoQT35tJK0VEZCIam1Uj/q7eJu4FYDEGIAAE3BDgxMJKZ5FM+FsW0RQhGzeRUAoIAIFe RMBENnycxpvBJhCyLBa7JYP9Uy8qFcbUbwjo2yUTAiLZpCQT2jLhn/8fbOZ0e5NBKGYAAAAA SUVORK5CYII=

--------------UEln60op0cSX066h04nQooA4--

michaelsimp · 2024-10-24T21:15:24Z

Hi
I did a quick set of tests today.

I created the ip_internal_network project from examples and configured as follows:
Set IDF version to 5.3.0
Set target device to ESP32-S3 with jtag integrated debugger
Set partition table Factory partition to 0x400000
Set device flash size to 16MB - matches my ESP32-S3
Set the Router SSID to "VONETS" and Password to "pass9999"
Set Panic Handler to "Print registers and halt"

Clean and Build project.
Load into 2 ESP32-S3 with serial terminals connected for monitoring.
One becomes MESH_ROOT and the other connects as MESH_NODE
Took turns at powering off the MESH_ROOT and watching the other become MESH_ROOT and then powering it back on and it connects as a MESH_NODE. This seemed to work ok today.

But what I did find easy to reproduce was:
Power off MESH_ROOT and power back on BEFORE other MESH_NODE becomes MESH_ROOT.
The original MESH_ROOT I power cycled, becomes MESH_ROOT again, but the MESH_NODE remains disconnected.

See attached files:
MESH_ROOT powered off at line 171
MESH_NODE loses connection around line 33 and never recovers

Also see .elf and .bin files in attachment. I don't know what or where the "Core dump decode file is", but these tests don't show a CPU crash. I cant run under the debugger due to the power cycle tests.

Please see attachment MESH-Testing.zip two comments down

michaelsimp · 2024-10-24T21:56:34Z

I did some more tests which can easily cause CPU crashes.

Power on both nodes, one becomes MESH_ROOT and one MESH_NODE.
Power off WIFI router.
MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end)

Second test. Power up only one node, becomes MESH_ROOT
Power off Router
This time MESH_ROOT does not crash
Power on Router
MESH_ROOT does not crash
Power on a second node which connects to the first MESH_ROOT - see line 492
MESH_ROOT crashes.
See file attached "MESH_ROOT crash on MESH_NODE connect.txt"
MESH_ Testing 2.zip

michaelsimp · 2024-10-24T22:02:15Z

MESH-Testing.zip
This is the attachment for the first tests, 2 entries up. It did not upload properly last before

brianignacio5 · 2024-10-29T08:43:53Z

Hi @michaelsimp

The esp-idf vscode extension allows you to save settings in multiple places: User (Global settings for vscode), Workspace and Workspace folder (your project's .vscode/settings.json). The ESP-IDF: Show Examples command shows you the current esp-idf path used in the current vscode window. You can change where to save settings with the ESP-IDF: Select where to save configuration settings command. It sounds confusing but it does allow to use multiple projects each with different esp-idf versions (even at the same time! Using vscode workspace) More information in here

It seems the example you are trying to use have some components with specific behavior in each esp-idf version. So building a v5.2.2 example using esp-idf v5.3 might produce some compilation problems. How about creating an example using esp-idf v5.3 ?

Open a vscode window.
Select esp-idf v5.3 from status bar (recommended) or the ESP-IDF: Configure ESP-IDF extension.
Run the ESP-IDF: Doctor command. Check that esp-idf is indeed using v5.3
Run the ESP-IDF: Show examples. The esp-idf path shown should be v5.3 now
Create your project from esp-idf example and try to build.

We will try to update the Show examples command to show all available esp-idf versions from esp-idf vscode extension to make this easier.

zhangyanjiaoesp · 2024-10-29T08:58:18Z

@michaelsimp The backtrace of the crash issue is here:

xtensa-esp32s3-elf-addr2line -piaf 0x4208c922:0x3fca7e60 0x4201c951:0x3fca7ee0 0x4201f96f:0x3fca7f20 0x4202345e:0x3fca7f50 0x420167bd:0x3fca7f70 -e ip_internal_network.elf

0x4208c922: parse_msg at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:993
 (inlined by) handle_dhcp at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:1190
 (inlined by) handle_dhcp at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:1106
0x4201c951: udp_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/core/udp.c:404
0x4201f96f: ip4_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/core/ipv4/ip4.c:746
0x4202345e: ethernet_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/netif/ethernet.c:186
0x420167bd: tcpip_thread_handle_msg at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/api/tcpip.c:174
 (inlined by) tcpip_thread at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/api/tcpip.c:148

I think this issue is caused by the mismatch between your IDF version and the version in the example. Please update your version according to Brain's suggestion and test it again.

michaelsimp · 2024-10-30T00:35:35Z

I installed IDF version 5.3.1
I had a problem at step 2 in your instructions to entries up states: "Select esp-idf v5.3 from status bar (recommended) or the ESP-IDF: Configure ESP-IDF extension."
When I try is reports, "Open a folder first."
So I open a folder of a the project I made in version IDF 3.0. Then on the status bar I could select Version 5.3.1
When I run Run the ESP-IDF: Doctor command, I get the following errors:

Extension configuration report has been copied to the clipboard with errors.
Cannot open file ../report.txt. Detail: FIles above 50MB cannot be synchnrozied with extensions.
I checked my report,txt and found it was over 181MB
I tried continuing:
When I select Show examples it only shows 5.3.1 which is good.
I created ip_internal_network, but when completed the status bar reports ESP-IDF v5.2.2 again.
So I tried deleting the large report.txt and trying again.
Same problem, it created a report.txt of 181MB again

brianignacio5 · 2024-10-30T01:10:40Z

Delete this file:

%USERPROFILE%\.vscode\extensions\espressif.esp-idf-extension-VERSION\esp_idf_vsc_ext.log

and try to run ESP-IDF: Doctor command again. Seems that your extension log have been logging a lot and vscode limit.

About the ESP-IDF v5.2.2 again, it is because the newly created project does not set settings when created. You can select the v5.3.0 from status bar again.

Again sorry for this issue, will work to make it easier to use in the next release of esp-idf extension.

michaelsimp · 2024-10-30T01:48:55Z

I need to make some real progress on this so I have completely uninstalled esp-idf and manually deleted all the ESP and espressif folders including all 3 IDF versions.
I have reinstalled ESP-IDF and only IDF version 5.3.1 to remove all doubt.
I will rebuild and test and report
Thanks

michaelsimp · 2024-10-30T02:17:28Z

Hi again
Its hard to tell, but seems as if it might be a little more robust, especially with the router power off and on test.
Attachments.zip
But it still crashes, see files attached including my .elf

"Fail 1.txt" is taken from the Mesh_Root
Line 1458 Mesh_Node disconnected
Line 1495 Mesh_Node reconnects
Line 1496 crash

"Fails 2.txt" is taken from the Mesh_Node
Line 1789 Mesh_Root disconnected
Line 1822 Reconnect
Line 1832 Crash divide by zero

zhangyanjiaoesp · 2024-10-30T03:55:45Z

It's weird, I have tested it multiple times as you said (the following two cases) and it can connect normally without any crashing issues.

Power on both nodes, one becomes MESH_ROOT and one MESH_NODE. Power off WIFI router. MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end)

Second test. Power up only one node, becomes MESH_ROOT Power off Router This time MESH_ROOT does not crash Power on Router MESH_ROOT does not crash Power on a second node which connects to the first MESH_ROOT - see line 492 MESH_ROOT crashes.

I'm using the Github IDF, and I will try to test with the vscode extension

michaelsimp · 2024-10-30T05:06:50Z

FYI I use vscode for coding and building and sometimes JTAG debugging Most of the time in testing I am am monitoring serial com port using Putty terminals on com ports Thanks On 30/10/2024 4:56 pm, ZYJ wrote: It's weird, I have tested it multiple times as you said (the following two cases) and it can connect normally without any crashing issues. Power on both nodes, one becomes MESH_ROOT and one MESH_NODE. Power off WIFI router. MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end) Second test. Power up only one node, becomes MESH_ROOT Power off Router This time MESH_ROOT does not crash Power on Router MESH_ROOT does not crash Power on a second node which connects to the first MESH_ROOT - see line 492 MESH_ROOT crashes. I'm using the Github IDF, and I will try to test with the vscode extension — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#14720 (comment)", "url": "#14720 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

zhangyanjiaoesp · 2024-10-30T08:28:38Z

@michaelsimp

Are you using the completely unmodified example code during your testing?

michaelsimp · 2024-10-30T19:38:58Z

Yes. I did not change anything except:

set target device to ESP32-S3 - Internal JTAG debug - What device are you testing with? Could this be a factor?
change partition table, factory partition size to 0x400000
Using vscode GUI menuconfig:
- change device flash size to 16MB
- set WiFi router SSID to VONETS and password to "pass9999"

See my source files attached where you can see most are untouched with the original install date 30/10/24 02:13pm NZ time.
Source.zip

michaelsimp · 2024-10-30T21:17:06Z

FYI I use vscode for coding and building and sometimes JTAG debugging
Most of the time in testing I am am monitoring serial com port using Putty terminals on com ports

In addition to the crashing, sometimes a MESH_NODE will not reconnect to the MESH network. As a work around for this, I want to stop and restart the wifi mesh network? No matter what I try, my nodes intermittently crash on restart? Is this perhaps related?

I am hoping you could please answer a few questions to help me.

What are the recommended steps to stop and then restart the WiFi Mesh network? I currently have :

    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());

but this causes a lot of error logging which stops if I add ...
ESP_ERROR_CHECK(mesh_netifs_destroy());

My restart is as follow:

void wifiMeshStart() {
    ESP_LOGW(TAG, "Wifi Mesh switch on");
    /*  mesh initialization */
    ESP_ERROR_CHECK(esp_mesh_init());
    ESP_ERROR_CHECK(esp_mesh_set_max_layer(CONFIG_MESH_MAX_LAYER));
    ESP_ERROR_CHECK(esp_mesh_set_vote_percentage(1));
    ESP_ERROR_CHECK(esp_mesh_set_ap_assoc_expire(10));
    /* set blocking time of esp_mesh_send() to 30s, to prevent the esp_mesh_send() from permanently for some reason */
    ESP_ERROR_CHECK(esp_mesh_send_block_time(30000));
    mesh_cfg_t cfg = MESH_INIT_CONFIG_DEFAULT();
#if !MESH_IE_ENCRYPTED
    cfg.crypto_funcs = NULL;
#endif
    /* mesh ID */
    memcpy((uint8_t *) &cfg.mesh_id, MESH_ID, MAC_SIZE);
    /* router */
    cfg.channel = CONFIG_MESH_CHANNEL;

    cfg.router.ssid_len = strlen(meshProvisionData.ssid);
    memcpy((uint8_t *) &cfg.router.ssid, meshProvisionData.ssid, cfg.router.ssid_len);
    memcpy((uint8_t *) &cfg.router.password, meshProvisionData.password, strlen(meshProvisionData.password));
    
    ESP_ERROR_CHECK(esp_mesh_set_ap_authmode((wifi_auth_mode_t) CONFIG_MESH_AP_AUTHMODE));
    cfg.mesh_ap.max_connection = CONFIG_MESH_AP_CONNECTIONS;
    cfg.mesh_ap.nonmesh_max_connection = CONFIG_MESH_NON_MESH_AP_CONNECTIONS;
    memcpy((uint8_t *) &cfg.mesh_ap.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
    ESP_ERROR_CHECK(esp_mesh_set_config(&cfg));
    /* mesh start */
    ESP_ERROR_CHECK(esp_mesh_start());
    ESP_LOGI(TAG, "WiFi Mesh started successfully");
}

I notice with my custom application:
I have 1 MESH_ROOT and 5 MESH_NODEs spread around the office. Mesh_Node seem to connect to parents not based on the RSSI as documented.

The IDF documentation states "To prevent nodes from forming a weak upstream connection, ESP-WIFI-MESH implements an RSSI threshold mechanism for beacon frames." Is this configurable and if so where? I cant find it in the API or in MenuConfig. What is the default RSSI threshhold value?

The IDF documentation states in Preferred Parent Node "The preferred parent node is determined based on the following criteria: Which layer the parent node candidate is situated on. The number of downstream connections (child nodes) the parent node candidate currently has".
Does this mean RSSI is not part of the parent selection process?

Is it recommend to use self-organized networking or for serious applications should I manually build the mesh network? I will only have a max of 10 mesh nodes altogether but they do a reasonable amount of MQTT5 communications to the cloud.

zhangyanjiaoesp · 2024-10-31T02:43:12Z

@michaelsimp The following are the answers for your questions:

What are the recommended steps to stop and then restart the WiFi Mesh network? I currently have :
```
    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());
```
Call esp_mesh_stop() is enough.
but this causes a lot of error logging which stops if I add ...
ESP_ERROR_CHECK(mesh_netifs_destroy());
Where did you add the mesh_netifs_destory() function? What does the error log look like? Can you provide an example？
Where did you call the wifiMeshStart() function?
I have 1 MESH_ROOT and 5 MESH_NODEs spread around the office. Mesh_Node seem to connect to parents not based on
the RSSI as documented.

RSSI is not the only criterion for selecting the parent node, the layer and connections also need to be considered.
The IDF documentation states "To prevent nodes from forming a weak upstream connection, ESP-WIFI-MESH implements an RSSI threshold mechanism for beacon frames." Is this configurable and if so where? I cant find it in the API or in MenuConfig. What is the default RSSI threshhold value?

You can call this API:

esp-idf/components/esp_wifi/include/esp_mesh_internal.h

Line 216 in 9106c43

esp_err_t esp_mesh_set_rssi_threshold(const mesh_rssi_threshold_t *threshold);
The IDF documentation states in Preferred Parent Node "The preferred parent node is determined based on the following criteria: Which layer the parent node candidate is situated on. The number of downstream connections (child nodes) the parent node candidate currently has". Does this mean RSSI is not part of the parent selection process?

Same to the fourth point, selecting parent need to consider RSSI, layer and connections, the doc need to be updated.
Is it recommend to use self-organized networking or for serious applications should I manually build the mesh network? I will only have a max of 10 mesh nodes altogether but they do a reasonable amount of MQTT5 communications to the cloud.

You can use self-organized network.

zhangyanjiaoesp · 2024-10-31T03:21:39Z

@michaelsimp I can reproduce the crash using the vscode, I will check the difference between VSCode and standard IDF

michaelsimp · 2024-10-31T04:36:49Z

Hi Zhangyanjiaoesp
This excellent news for me. Hopefully it is just something simple you will find soon and be able to offer me a fix.
Thank you

michaelsimp · 2024-10-31T05:26:15Z

Hi Zhangyanjiaoesp

Thanks so much for taking the time to answer all my questions.

Please note THESE tests are with MY application (NOT with example program ip_internal_network) running on a network of 6 nodes - all ESP32-S3. My application has a CLI console integrated so I can trigger actions and see the responses on the COM port.

Q1 I will go back to just calling esp_mesh_stop() and see what happens.
To restart should I just be able to call ESP_ERROR_CHECK(esp_mesh_start());

Q2 Triggered from the CLI Console I was calling:

    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());
    ESP_ERROR_CHECK(mesh_netifs_destroy());

Q3 wifiMeshStart() is also called from my CLI console

The CLI Console is started from my Mainline as is my WiFi Mesh application (built on top of the ip_internal_network source).
Triggering the Mesh Stop and Start would be called from the CLI Console thread. I am assuming this is ok and does not need any mutex protection. Please advise how I should call it if this is a problem.

Q4 I understand this

Q5 Thanks

Q6 Thanks for clarifying this but I am not finding this to be the case. I distributed some MESH_Nodes across the office with the aim of creating a multi-hop network between the far extremes. But it does not form as expected or at all well for healthy RSSI. I have nodes which are close to my Mesh_Root or 2nd layer Mesh_Nodes which are not at parent capacity numbers. When my other Mesh_Nodes do connect to these parents they provide an RSSI on the child to the parent of -35dBm. But they most frequently want connect to nodes a much longer distance away getting a RSSI of < -70dBm.

I read somewhere the default RSSI threshold is -120dBm, but I am finding nodes with RSSIs < -70dBm often lose their MQTT connection to the broker. I have an office environment and I have located the nodes approx 10 to 20 meters apart with a max of one wall between but they are not all line of site. I very much doubt I could even get a connection at RSSI less than -100dBm. I am thinking it may be a signal to noise ratio issue so I have scanned the office for WiFi channel usage and selected channel 1 on my WIFI Router as nothing else is using this channel and no other channels overlap. I know this is not an easy question to answer with precision and I appreciate the many influences, but realistic what is the ballpark min RSSI range at which I can expect a node to work reliably at what sort of distance range.

Q7 Because of my Q6 response above, I have started evaluating the example project "manual_networking" to make my MESH_NODES manually scan and select MESH_NODES with the healthiest RSSI. It sounds like you are saying the IDF framework should already be doing this?
So I am now wondering if the vscode crashing issue is also causing this to not work properly and your fix might fix both.
Should I put manual scanning and parent selection changes to one side and wait for the outcome of the vscode crashing?
I guess I would prefer to use the self-organized network as much as possible, if it works as you describe.
Please advise / confirm manual scanning and parent selection should not be required and I might need you to look at the node parent selection for healthy RSSI next.

Best regards

zhangyanjiaoesp · 2024-10-31T06:41:32Z

@michaelsimp

According to the backtrace of the crash issue, it seems related to DHCP, it won't affect the mesh networking.
I'm sorry I didn't quite understand your question regarding the selection of the parent node. Can you draw a picture to explain it? For example, where are nodes A, B, and C located? What level? How many child nodes are connected below? What is the RSSI of A, B, and C scanning each other? Do you expect A to connect to B but actually connect A to C?

michaelsimp · 2024-10-31T07:34:50Z

See attached:
MeshMaps.zip

"Target Network .png" shows the walls as black lines and nodes as circles. The blue lines are approx what I was expecting to see.

It forms very randomly but with bad choices like the file "Actual example Network.png" with links and dBm in red.

My project is configured for up to 50 nodes and 3 children per node as I wanted to force some layers.

michaelsimp · 2024-10-31T08:50:14Z

"According to the backtrace of the crash issue, it seems related to DHCP, it won't affect the mesh networking."

That may be the case with the crash issue you found, but the root cause of the vscode IDF environment may cause more than one issue. I guess you will know better when you get to the bottom of the vscode IDF environment issue

Do the diagrams I sent help you understand my issue better now?

zhangyanjiaoesp · 2024-10-31T10:04:38Z

Yes, I now understand your question. Once the ROOT node is formed, the chances for other idle nodes to connect to the root node are the same; as long as they can scan the root node within the RSSI threshold range, they can connect and become second-layer nodes. Therefore, it is reasonable for C and D to connect to A and become second-layer nodes. What is the RSSI that B, E, and F receive from A? Since each node can connect to 3 child nodes, if they are within the RSSI threshold range and root A is not yet fully connected, at least one of B, E, or F should be able to connect to A.

I think you can call esp_mesh_set_rssi_threshold() to limit the RSSI threshold for optional parents, which would allow nearby nodes to connect as much as possible. However, nodes D and E are too far from node A. If you set the same RSSI threshold for all nodes, the connection results might still not meet your expectations, unless you configure different RSSI thresholds for each node. Alternatively, you could only call the esp_mesh_set_parent() function to specify the parent of each node.

michaelsimp · 2024-11-20T06:35:42Z

Hi

In response to your last post:

At present I only consider the RSSI value.
The test in findClosestParent() does consider nodes already at capacity number of children, and will not try to switch to them.

My thoughts were, I am not wanting to build the mesh network from scratch as I start with a self configured network. I am only planning to make changes to nodes with poor RSSIs. So far my tests have been successful network architecture wise (when I have a fixed ROOT so I don't get the broken mesh problem).

I appreciate what you are saying and will certainly doo more testing and add more intelligence into the parent selection if necessary. I already send all my node network attributes to the MESH_ROOT where I have a table of all node and their parent, children, layers and RSSI. I could broadcast this to all nodes if necessary to enable smarter logic at the selection.

But I can't keep the manual scan and parent switch code while it causes my network to break which leaves me in a real predicament performance wise.

I really need a resolution to this as my priority.

Thanks for your ongoing help

zhangyanjiaoesp · 2024-11-22T03:18:57Z

The definition of reason code 100/101 is here:

esp-idf/components/esp_wifi/include/esp_mesh.h

Line 267 in f420609

} mesh_disconnect_reason_t;

zhangyanjiaoesp · 2024-11-22T03:28:39Z

I don't feel comfortable with this solution unless it is endorsed by you guys, but anyway after several successful cycles, it failed again with lots of:

You can use the reason code to categorize the issues, but this is not entirely reliable, as different scenarios may generate the same reason code.

I have tried this but no success.

Are you saying that using disconnected->reason == WIFI_REASON_BEACON_TIMEOUT for judgment is completely ineffective, or it can work but can't work as well as (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)?

zhangyanjiaoesp · 2024-11-22T03:45:17Z

The test 2 1/2/3/4 refer to the four devices in a single round of testing?
where is test 3?

michaelsimp · 2024-11-22T05:48:55Z

Yes Test 2 1.txt through test 2 4.txt were the 4 devices on a single round of testing
Sorry here are the missing test logs from yesterday
Nov22.zip
Tests 1, 2, 3 were done on the software with:
disconnected->reason == WIFI_REASON_BEACON_TIMEOUT

test 4 was done with"
(disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)

Are you saying that using disconnected->reason == WIFI_REASON_BEACON_TIMEOUT for judgment is completely ineffective, or it can work but can't work as well as (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)

Neither work reliably. after 2 or 3 cycles some nodes will fail and go MESH_IDLE and not scan and the network is broken.

zhangyanjiaoesp · 2024-11-25T03:33:16Z

I just reviewed the log for test3, and the device behavior is normal. The device being in the MESH_IDLE state is not permanent; it is a temporary state. Below is my analysis:

At the beginning, the self-organizing network formed the following topology:
root(53:d8) --- node A (39:d4)
|--- node B (72:b8)
|--- node C (5c:68)
node C call manual scan, select A as the better parent, change to layer3 node (self-organized disabled, set parent)
node A call manual scan, still select root as the better parent, still layer2 node (self-organized disabled, set parent)
root power off
node B found root leave, beacon timeout, parent disconnect, enable self-organized, change to be root
node A found root leave, beacon timeout, parent disconnect, enable self-organized. However, at that moment, it was sending data, and it is trying to reconnecting, when you queried, the device was shown as in the MESH_IDLE state. I believe that if the device remains in an idle state and cannot recover, then this is an issue. However, if there are no subsequent logs, I don't consider it a problem. You cannot expect the device to always be in a non-idle state whenever the application layer checks the mesh status.

zhangyanjiaoesp · 2024-11-25T03:55:08Z

In the test4 log, the device eventually connected successfully.

The log you referred to is just a part of the intermediate process.

zhangyanjiaoesp · 2024-11-25T03:59:12Z

According to your test log, I think the disconnected->reason == WIFI_REASON_BEACON_TIMEOUT will be better than (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET) , because there are too many disconnect reason, and it is unreasonable to switch to self-organized mode as soon as the reason is not equal to 8, 201, or 206

michaelsimp · 2024-11-25T04:09:41Z

How long should it take to for a MESH_IDLE to find a parent again? I am sure I waited 10s of seconds and it wasn't even scanning.
I am setting up for another run of tests where I will wait longer. I am just worried about my back trace overflowing using Putty

zhangyanjiaoesp · 2024-11-25T04:10:13Z

In the test2 logs, I cannot analyze the entire network change process as I did with test3 because the log only contains part of the information. It is curious why such a reason would occur.

It seems that in test2_2 and test2_3, there was no opportunity to switch to the self-organized network, and the device kept trying to connect to the originally configured parent, but the parent could not be detected.

michaelsimp · 2024-11-25T05:36:48Z

Hi

Today I am getting problems where nodes get stuck in a loop logging forever. I can't keep my trace open long enough as I lose the start, but take my word for it please, once in this state it never comes out no matter how long (minutes). eg

I (00:02:07.516) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:139
W (00:02:07.528) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
W (00:02:07.529) aWifiMesh: <MESH_EVENT_ROUTING_TABLE_REMOVE>remove 1, new:3
I (128652) mesh: [wifi]disconnected reason:201(), continuous:1/max:12, non-root, vote(,stopped)<><>
I (00:02:07.644) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.645) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (128772) mesh: [wifi]disconnected reason:201(), continuous:2/max:12, non-root, vote(,stopped)<><>
I (00:02:07.769) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.769) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (128882) mesh: 1145[xrsp:1]the asked:19, max window:2, force to increase/decrease(up) xseqno:17 for child 48:ca:43:9b:5d:20, xrsp_seqno:14, heap:101160
I (128892) mesh: 1307[recv]cidx[0]48:ca:43:9b:5d:20 xseqno loss, current/new:15/19, in:17, out:17, pending:0
I (128892) mesh: [wifi]disconnected reason:201(), continuous:3/max:12, non-root, vote(,stopped)<><>
I (00:02:07.893) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.894) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129022) mesh: [wifi]disconnected reason:201(), continuous:4/max:12, non-root, vote(,stopped)<><>
I (00:02:08.018) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.019) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129142) mesh: [wifi]disconnected reason:201(), continuous:5/max:12, non-root, vote(,stopped)<><>
I (00:02:08.143) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.144) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129272) mesh: [wifi]disconnected reason:201(), continuous:6/max:12, non-root, vote(,stopped)<><>
I (00:02:08.268) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.269) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1

Nov 25.zip

I found I am the author of one way that this can happen, if I scan and switch parents manually. See test1 logs attached where
node A MAC 48:27:e2:18:39:80 switches no node B 48:ca:43:9b:5d:20
Bode B MAC:48:ca:43:9b:5d:20 switches no node A 48:27:e2:18:39:80

This is one cause of the above. I think I can fix this by checking that I am not switching the nodes parent to one of its children.
I think it probably also makes sense to not swap to a parent node which has a higher layer than this node too.
But while not ideal that I am doing this, it shouldn't result in the node getting stuck in a disconnect loop?

But Test 2 looks the same problem but is not triggered by the above. MESH_ROOT is powered off. A panic crash on a MESH_NODE 48:ca:43:9b:5d:20 which still happens from time to time, but my bigger concern is that after this, MESH_NODE MAC: 48:27:e2:18:39:80 gets stuck in the disconnect loop

Are you able to reproduce these problems with the ip_internal_network you modified a week or so back? I get lost in all of this and feel we would make better progress if you were able to test, analyze and debug directly.

michaelsimp · 2024-11-25T06:27:18Z

Hi again
Regarding your analysis of test 3 specifically node A where you said.
node A found root leave, beacon timeout, parent disconnect, enable self-organized. However, at that moment, it was sending data, and it is trying to reconnecting, when you queried, the device was shown as in the MESH_IDLE state. I believe that if the device remains in an idle state and cannot recover, then this is an issue. However, if there are no subsequent logs, I don't consider it a problem. You cannot expect the device to always be in a non-idle state whenever the application layer checks the mesh status.
The disconnect came at 00:01:18
I (00:01:18.371) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:200
I stopped the log after 00:01:58, 40 seconds later and nothing was happening, no visible signs of scanning for a parent.
I am sure that when this occurs, no matter how long I leave it, it does not recover.
Also when it gets stuck in the disconnect loop, logs the repetitive sequence (above) indefinitely.

michaelsimp · 2024-11-25T06:51:56Z

Two posts back I wrote:

I found I am the author of one way that this can happen, if I scan and switch parents manually. See test1 logs attached where
node A MAC 48:27:e2:18:39:80 switches no node B 48:ca:43:9b:5d:20
Bode B MAC:48:ca:43:9b:5d:20 switches no node A 48:27:e2:18:39:80

This is one cause of the above. I think I can fix this by checking that I am not switching the nodes parent to one of its children.
I think it probably also makes sense to not swap to a parent node which has a higher layer than this node too.
But while not ideal that I am doing this, it shouldn't result in the node getting stuck in a disconnect loop?

When I looked at the code, I am finding it difficult to decipher the variables

parent_record is the best parent candidate found so far
assoc I think is each node found in the scan esp_mesh_scan_get_ap_record(&record, &assoc); is this correct?
parent_assoc I am not clear on what this is or how it gets set.
It is initialized to: mesh_assoc_t parent_assoc = { .layer = CONFIG_MESH_MAX_LAYER, .rssi = -120 }; as a worst case record
and updated to contents of assoc when a better parent is found

the original source (taken from the example project "manual_networking") seems to be already checking :
if (assoc.layer < parent_assoc.layer || assoc.layer2_cap < parent_assoc.layer2_cap) {
But I am not sure if this stops a MESH_NODE selecting a child as a new parent, or do I need to add the line:
if (esp_mesh_get_layer() >= assoc.layer)

Could you take a look at the test 1 logs as they do appear to be setting parent to each other.

The entire routine is currently as follows if you could check and make any changes please.

void findClosestParent(int num) { // after a WiFi scan
    ESP_LOGW(TAG, "findClosestParent  Current RSSI: %d", currentRSSI);
    int i;
    int ie_len = 0;
    mesh_assoc_t assoc;
    mesh_assoc_t parent_assoc = { .layer = CONFIG_MESH_MAX_LAYER, .rssi = -120 };
    wifi_ap_record_t record;
    wifi_ap_record_t parent_record = { 0, };
    parent_record.rssi = currentRSSI; // has to be better than current RSSI to change parent
    bool parent_found = false;
    mesh_type_t my_type = MESH_IDLE;
    int my_layer = -1;
    wifi_config_t parent = { 0, };
    wifi_scan_config_t scan_config = { 0 };

    for (i = 0; i < num; i++) { // iterate through scan records looking for eligible closer parent node
        ESP_ERROR_CHECK(esp_mesh_scan_get_ap_ie_len(&ie_len));
        ESP_ERROR_CHECK(esp_mesh_scan_get_ap_record(&record, &assoc));
        ESP_LOGD(TAG, "ie_len: %d  sizeof(assoc): %d", ie_len, sizeof(assoc));
        if (ie_len == sizeof(assoc)) {
            ESP_LOGI(TAG,
                     "<MESH>[%d]%s, layer:%d/%d, assoc:%d/%d, %d, "MACSTR", channel:%u, rssi:%d, ID<"MACSTR"><%s>",
                     i, record.ssid, assoc.layer, assoc.layer_cap, assoc.assoc, assoc.assoc_cap, assoc.layer2_cap, MAC2STR(record.bssid),
                     record.primary, record.rssi, MAC2STR(assoc.mesh_id), assoc.encrypted ? "IE Encrypted" : "IE Unencrypted");

            // ESP_LOGI(MESH_TAG, "Type: %d  layer_cap %d:  assoc %d  assoc_cap: %d  rssi: %d", assoc.mesh_type, assoc.layer_cap, assoc.assoc, assoc.assoc_cap, record.rssi);
            if (assoc.mesh_type != MESH_IDLE && assoc.layer_cap && assoc.assoc < assoc.assoc_cap) { 
                // ESP_LOGI(MESH_TAG, "assoc.layer: %d  parent_assoc.layer %d:  assoc.layer2_cap %d  parent_assoc.layer2_cap: %d", assoc.layer, parent_assoc.layer, assoc.layer2_cap, parent_assoc.layer2_cap);
                if (assoc.layer < parent_assoc.layer || assoc.layer2_cap < parent_assoc.layer2_cap) {
                    if (record.rssi > parent_record.rssi) { // closer parent found
                        if (memcmp(parent_record.bssid, record.bssid, MAC_SIZE) != 0) { // dont switch to same parent
                            ESP_LOGW(TAG, "Closer Parent found: %s  RSSI: %d", record.ssid, record.rssi);
                            parent_found = true;
                            memcpy(&parent_record, &record, sizeof(record));
                            memcpy(&parent_assoc, &assoc, sizeof(assoc));
                            if (parent_assoc.layer_cap != 1) {
                                my_type = MESH_NODE;
                            } else {
                                my_type = MESH_LEAF;
                            }
                            my_layer = parent_assoc.layer + 1;
                            // break; // MSB removed, keep searching for the closest parent
                        }
                    }
                }
            }
        } else {
            ESP_LOGD(TAG, "[%d]%s, "MACSTR", channel:%u, rssi:%d", i, record.ssid, MAC2STR(record.bssid), record.primary, record.rssi);
        }
    }

    esp_mesh_flush_scan_result();
    if (parent_found) { // parent: Both channel and SSID of the parent are mandatory
        parent.sta.channel = parent_record.primary;
        memcpy(&parent.sta.ssid, &parent_record.ssid, sizeof(parent_record.ssid));
        parent.sta.bssid_set = 1;
        memcpy(&parent.sta.bssid, parent_record.bssid, 6);
        if ((my_type == MESH_NODE) || (my_type == MESH_LEAF) || (my_type == MESH_IDLE)) {
            ESP_ERROR_CHECK(esp_mesh_set_ap_authmode(parent_record.authmode));
            if (parent_record.authmode != WIFI_AUTH_OPEN) {
                memcpy(&parent.sta.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
            }
            ESP_LOGW(TAG,
                     "<PARENT>%s, layer:%d/%d, assoc:%d/%d, %d, "MACSTR", channel:%u, rssi:%d",
                     parent_record.ssid, parent_assoc.layer,
                     parent_assoc.layer_cap, parent_assoc.assoc,
                     parent_assoc.assoc_cap, parent_assoc.layer2_cap,
                     MAC2STR(parent_record.bssid), parent_record.primary,
                     parent_record.rssi);
            esp_err_t err = esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer);
            switchParentTimer = currentTimeMs(); // reset timer for event <MESH_EVENT_PARENT_DISCONNECTED>
            if (err != ESP_OK) {
                ESP_LOGE(TAG, "esp_mesh_set_parent Error %d  my_type: %d  my_layer: %d", err, my_type, my_layer);
            }
            selfOrganizeReactivateTimer = SELF_ORGANIZE_REACTIVATE_TIME; // start self organize reactivation timer
        }
    } else {
        ESP_LOGE(TAG, "No eligible closer Parent found");
        if (currentRSSI == NO_RSSI) { // scan again if no connection yet
            esp_mesh_set_self_organized(false, false);
            esp_wifi_scan_stop();
            scan_config.show_hidden = 1;
            scan_config.scan_type = WIFI_SCAN_TYPE_PASSIVE;
            esp_wifi_scan_start(&scan_config, 0);
        }
    }
}

michaelsimp · 2024-11-25T22:35:08Z

Hi

By the way all yesterdays test and logs and today were made with your recommendation of only using
disconnected->reason == WIFI_REASON_BEACON_TIMEOUT

I have been testing getting the 4 nodes stacked up across 4 layers and powering off the NODE on layer 2 rather than the MESH_ROOT as this provide a cleaner set of logs.
Test 1 Nov 26.zip

See test 1

Layer 1 MESH_ROOT 48:ca:43:9b:53:d8
NODE A Layer 2 48:27:e2:18:39:80
NODE B Layer 3 48:ca:43:9b:54:c0
NODE C Layer 4 48:ca:43:9b:5d:20

Then power down NODE A on layer 2

NODE B switched from layer 3 to layer 2 and parent from NODE A to MESH_ROOT - perfect
NODE C stayed on layer 4 with parent 48:ca:43:9b:54:c1 which is now on layer 2, and does not show in the

Is this valid?
Node B moved from layer 3 to 2 when its parent dropped. Why did Node C not move to layer 3 ?

It stayed like this for minutes while I wrote this up

Then I powered of the MESH_ROOT see test 1 MESH_ROOT line 469. This node 48:ca:43:9b:53:d8 now becomes MESH_NODE and child of Node B.

See test 1 Node B.txt line 2038
NODE B which was on layer 2 connected to MESH_ROOT goes to MESH_IDLE with 2 children Node C and the old MESH_ROOT 48:ca:43:9b:53:d8

Remains broken like this indefinitely.

zhangyanjiaoesp · 2024-11-27T11:49:09Z

Regarding the issue of the log infinite loop (aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201), I have already explained it in my previous comment.

It seems that in test2_2 and test2_3, there was no opportunity to switch to the self-organized network, and the device kept trying to connect to the originally configured parent, but the parent could not be detected.

Maybe we should first investigate why the specified parent node cannot be found at this point. Is it due to a power failure, has it become idle, or is there another underlying reason?

Are you able to reproduce these problems with the ip_internal_network you modified a week or so back? I get lost in all of this and feel we would make better progress if you were able to test, analyze and debug directly.

Sorry, I can't reproduce your issue on my side.

I have already discussed this with you in my previous comment: when selecting a better parent node, what criteria do you prioritize? I believe you can completely disregard the conditions in the example and instead design your own criteria based on your specific needs. First, you can move the definitions of parent_assoc and parent_record outside,

then update the parent_assoc->layer when connecting to the parent.

Before scanning, retrieve the current parent information.

Finally, within the findClosestParent() function, design the criteria for selecting a better parent based on your requirements and the issues encountered during testing.

My thoughts were, I am not wanting to build the mesh network from scratch as I start with a self configured network. I am only planning to make changes to nodes with poor RSSIs. So far my tests have been successful network architecture wise (when I have a fixed ROOT so I don't get the broken mesh problem).

You mentioned that you don't want to rebuild the network from scratch, but instead, you want to adjust the initial network formed by the self-organizing process. However, during the actual testing, I've observed that you often call scan at the application layer while the initial network is still being formed, which forcibly interrupts the self-organizing process.

So, when you call scan at the application layer, is it completely random? Would it make sense to first check whether the initial network has been fully formed before manually triggering the scan?

I believe we must first resolve the issues mentioned in points 3 and 4 before proceeding with further problem analysis. If the initial logic framework isn't properly established, it could lead to a range of unforeseen issues down the line, which would be quite painful for me to handle.

michaelsimp · 2024-11-28T04:35:29Z

The problem I am trying to solve all the way through this is triggered by the MESH_ROOT or the parent of a node being powered off or rebooted. So yes definitely I have been doing both powering off and rebooting the parent and both trigger this issue. So this is why parent can not be found at this point, the question is how to fix it. This can and will happen in the field.
The latest catch of one of these today, I powered off the parent on layer 2 and the MESH_NODE on layer 3 continued to report this see attached. This was made after the changes documented in point 3 below.
MESH_EVENT_PARENT_DISCONNECTED.txt
So the symptoms are continuous <MESH_EVENT_PARENT_DISCONNECTED> events OR nothing. Node just sits in MESH_IDLE state and does nothing.
I have made the changes you suggested, but as you commented this would stop my nodes finding a better parent. So I modified the event <MESH_EVENT_PARENT_CONNECTED> to:
parent_assoc.layer = mesh_layer; // MBS consider can switch to parent on same level or better

and function:

void findClosestParent(int num) { // after a WiFi scan
...
                if (assoc.layer <= parent_assoc.layer || assoc.layer2_cap < parent_assoc.layer2_cap) { // parents on same layer or better qualify
...

I understand this could drop a node up (numerically) one layer, but I think this is acceptable as it is much better to a further layer back with a stronger RSSI from my testing. Do you understand and accept this strategy, at least as far as not causing the broken network issue?

My background task only scans for a better parent, if (esp_mesh_get_type() != MESH_IDLE), as I assumed this meant the network was established.
As mentioned earlier, I have been manually triggering the scan from my console (my main task) when I consider the network to be established based on the event ip_event_handler triggering which calls my printMeshInfo(); function.
Two questions:

1 You said Would it make sense to first check whether the initial network has been fully formed before manually triggering the scan? What is a suitable test to determine the network is fully formed?
2 Am I potentially doing too much in the event_handler functions? Is there any restriction on how long they can run (within reason of course)?
3 How long should it take to for a node which is disconnected or MESH_IDLE to start to search for a parent again or scan for MESH_ROOT if the MESH_ROOT disappears? Just roughly eg 10 seconds, 30 seconds, longer? I have waited 10s of minutes while writing up logs etc...
Also once the network is broken and not able to fix itself, I sometimes execute the following commands from my console to try and fix it.
wifi scan - triggers my wifiMeshScan() function to look for a new parent. But only if the node is not MESH_IDLE.
wifi root - triggers my wifiMeshRoot() function which calls ESP_ERROR_CHECK(esp_mesh_set_self_organized(true, true));
But neither of these has ever fixed it.

zhangyanjiaoesp · 2024-11-28T10:03:08Z

For the <MESH_EVENT_PARENT_DISCONNECTED>reason:201 loop issue, is it possible to add a timer or counter, so that once the time or count exceeds a certain threshold, the function esp_mesh_set_self_organized(true, true) is called to reselect the parent?
How about directly comparing with mesh_layer ?

void findClosestParent(int num) { // after a WiFi scan
...
                if (assoc.layer <= mesh_layer || assoc.layer2_cap < parent_assoc.layer2_cap) { // parents on same layer or better qualify
...

My background task only scans for a better parent, if (esp_mesh_get_type() != MESH_IDLE), as I assumed this meant the network was established.

Why does this phenomenon occur if you only call scanning when it is not idle? It's obvious that the device hasn't been connected to WiFi yet.

4.

How long should it take to for a node which is disconnected or MESH_IDLE to start to search for a parent again or scan for MESH_ROOT if the MESH_ROOT disappears?

If the device is in self-organized network, it will attempt to reconnect the original parent for at least 6 seconds before selecting a new parent.
5.

wifi root - triggers my wifiMeshRoot() function which calls ESP_ERROR_CHECK(esp_mesh_set_self_organized(true, true));

Have you ever called this command during the testing process? I don't think I saw this command in the log.

wifi scan - triggers my wifiMeshScan() function to look for a new parent. But only if the node is not MESH_IDLE.

The function of this scan cannot cover all situations because the judgment criteria are relatively single. Perhaps we can add more judgments？

michaelsimp · 2024-11-28T22:00:32Z

Yes I am happy to add this if it worked. But as I said for now I can invoke esp_mesh_set_self_organized(true, true) manually from my console command "mesh root". See logs attached, where I rebooted the MESH_ROOT to start the problems:
Nov 29.zip

test 1 mesh root
line 378 Reboot the mesh root

test 1 became 2nd root
Line 524 the disconnect loop started.
Line 946 invoke wifi root command
Node becomes MESH_ROOT but there already is a MESH_ROOT

test 1 became node on layer 5
Line 126 starts logging I (65677) wifi:>>>intv = 102400, max = 0. I don't know what this means
Line 437 disconnect loop starts
line 1621 run wifi root command
Node becomes MESH_NODE on layer 5 to parent 48:ca:43:9b:53:d9 which is on layer 2

test 1 node 48ca439b53d8
for your reference. Ends up mesh node on layer 2 to the second MESH_ROOT

You said "How about directly comparing with mesh_layer ?"
I don't understand what you mean here. Please elaborate
You said "Why does this phenomenon occur if you only call scanning when it is not idle? It's obvious that the device hasn't been connected to WiFi yet."
It is not obvious to me. I don't know what half the logs mean reported from "wifi" and "mesh"
I only run wifi scan when the node looks stable (not logging a whole lot of stuff) and it is reporting MESH_NODE
Can you please advise a suitable test I can implement to determine if it is safe to scan for closer parents.
My test logs clearly show I have waited much longer than 6 seconds for a node type MESH_IDLE to take some action - more like up to 60 seconds before I give up.
See notes above and logs running "mesh root" to manually trigger esp_mesh_set_self_organized(true, true)

You said "The function of this scan cannot cover all situations because the judgment criteria are relatively single. Perhaps we can add more judgments？"
Can you please propose suitable tests.

I suspect the reason you can't reproduce the problem using your modified version of "ip_internal_network" is that you are only looking for new parents on higher layers and so you don't find one to switch to. You raised this with me and your code was only looking for layer -1

Can you please send me the source file (not just the patch file) so I can play with it.
I would also appreciate it if you could try this looking for a closer parent on the same or lower (numeric) layer so that it actually finds one calls esp_err_t err = esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer);
I suspect this breaks something which triggers the problems later when the parent or MESH_ROOT either power off or reboot causing combinations of the following:

Repeating <MESH_EVENT_PARENT_DISCONNECTED> event loop
Nodes stuck indefinitely on type MESH_IDLE
Nodes sitting on layers more than one level up from the parent eg node on layer 5 with parent on layer 2
Nodes connected to parent which is MESH_IDLE

zhangyanjiaoesp · 2024-12-03T03:47:47Z

From the Nov 29.zip log, running "mesh root" to manually trigger esp_mesh_set_self_organized(true, true) seems work well. Then you can use it to stop the 201 loop.
I (65677) wifi:>>>intv = 102400, max = 0 this log was added as a debug log while addressing the crash issue. The solution to the crash has already been merged into release v5.3, but it has not yet been synced to GitHub. Once it is synced, you can update your IDF version.

You said "How about directly comparing with mesh_layer ?"
I don't understand what you mean here. Please elaborate

In your last comment, you let parent_assoc.layer = mesh_layer and in findClosestParent() function, you use assoc.layer <= parent_assoc.layer for comparison. So I suggest using mesh_layer for the comparison instead.

void findClosestParent(int num) { // after a WiFi scan
...
                if (assoc.layer <= mesh_layer || assoc.layer2_cap < parent_assoc.layer2_cap) { // parents on same layer or better qualify
...

The mesh network by default allows multiple roots. However, users can call the esp_mesh_allow_root_conflicts(false) to disable this feature, and once configured, only a single root will be allowed to exist.
Here is my mesh_main.c
mesh_main_1203.zip

michaelsimp · 2024-12-03T06:37:54Z

Hi

Thanks for your ongoing work on this. Very much appreciated as I am in deep with this contract.

Yes it works in terms of breaking out of the 201 loop, but sometimes it results in the software locking up.
Are you saying you have made some more changes to address the crash issue, which I don't have yet? I am currently running ESP IDF ver 5.3.1. Will the update be ver 5.3.2 or later so I can recognize it? Do you have an approx date for release?
Thank you for clarifying I will correct this
I have added esp_mesh_allow_root_conflicts(false) to by Wifi Mesh configuration.
Thanks for this, but if there are more updates which fix the crash (per item 2), I might wait for them to be released in interest of saving time.

Can I ask another unrelated question please.
I want to use a 4G modem using the USB bus on the ESP32-S3. I found an example in a Espressif Git repository https://github.com/espressif/esp-iot-solution called usb_cdc_4g_module. But I don't know how esp-iot-solution relates to IDF.
https://docs.espressif.com/projects/esp-iot-solution/en/latest/ states, "ESP-IoT-Solution contains device drivers and code frameworks for the development of IoT system, which works as extra components of ESP-IDF and much easier to start."
But I have found that esp-iot-solution seems to have a parallel overlapping directory structure of components and examples to IDF.
I copied usb_cdc_4g_module project and I tried to build it using IDF 5.3.1 but the USB modem component was missing, so I manually installed this. Now it builds but crashes soon after startup.
I don't want to enter a debugging session for this in this ticket, but I would appreciate it if you could explain how esp-iot-solution and IDF relate to each other?
Will the USB modem components be released under IDF? If so, roughly when?
If not, should I
The "Getting started" page states the latest version "master" corresponds to IDF ver >-4.4, so it looks like it is not updated as much as IDF. So for me switching from IDF v5.3.1 to esp-iot-solution does not look viable.
I would appreciate any advice on how I can connect to a 4G modem using USB.

zhangyanjiaoesp · 2024-12-03T07:19:27Z

2. Are you saying you have made some more changes to address the crash issue, which I don't have yet? I am currently running ESP IDF ver 5.3.1. Will the update be ver 5.3.2 or later so I can recognize it? Do you have an approx date for release?

Yes, the v5.3.2 will contain the fix. (The fix in v5.4 is here). For the I (65677) wifi:>>>intv = 102400, max = 0, please refer to the previous comment, it solved the node crash issue, hope you have an impression of this.

I have added esp_mesh_allow_root_conflicts(false) to by Wifi Mesh configuration.

In a self-organized mesh network, if multiple roots are not allowed, when more than one root is created, the two roots will compare their RSSI and capacity values. The one with the better metrics will continue to act as the root, while the other will relinquish its root role and reconnect to the network.

zhangyanjiaoesp · 2024-12-03T07:22:21Z

For the ESP-IoT-Solution issue, please create a new issue under the esp-iot-solution project, the colleague in charge of this issue will reply to you.

Xiehanxin · 2024-12-03T07:39:24Z

hi @michaelsimp , after you set the IDF_path, you can directly build the iot-solution example under iot-solution path,

michaelsimp · 2024-12-04T02:59:57Z

Hi

I don't think we understand each others position.

Are you saying you have made some more changes to address the crash issue, which I don't have yet? I am currently running ESP IDF ver 5.3.1. Will the update be ver 5.3.2 or later so I can recognize it? Do you have an approx date for release?

Yes, the v5.3.2 will contain the fix. (The fix in v5.4 is here). For the I (65677) wifi:>>>intv = 102400, max = 0, please refer to the #14720 (comment), it solved the node crash issue, hope you have an impression of this.

Your reference back to a post you made a couple of weeks ago which said

@michaelsimp
Yes, I added a fix to the wifi lib I provided to you. wifi:>>>intv = 102400, max = 0 this log was added by me to locate the problem. I'm glad to hear that the problem has been solved.

Have you tested the router reboot case? Does the root node crash still exist?

But I am confused as since then I have posted heaps of logs which still show multiple problems with the mesh network when the parent is powered off or rebooted, including:
A) the 201 loop. esp_mesh_set_self_organized(true, true) works in terms of breaking out of the 201 loop, but sometimes it results in the software locking up. This still a serious problem for me.
B) nodes locking up
C) the child going to MESH_IDLE and not recovering or attempting to find a new parent. My attempts to trigger a scan for a new parent using

                esp_mesh_set_self_organized(false, false);`
                esp_wifi_scan_stop();
                scan_config.show_hidden = 1;
                scan_config.scan_type = WIFI_SCAN_TYPE_PASSIVE;
                esp_wifi_scan_start(&scan_config, 0);

... result in lock up.

My posts continue through to the end of last week and then yesterday you stated

"The solution to the crash has already been merged into release v5.3, but it has not yet been synced to GitHub. Once it is synced, you can update your IDF version."

So if the latest update you have made was back before #14720 (comment), You need to understand my problem is most certainly not fixed.
I think you reported that this was done when looking for a parent on a lower numeric layer which with the tests both you and I did, no nodes were found so it never ran the code
esp_err_t err = esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer);
which I think is the trigger to breaking the mesh network.

Please confirm again if you have more changes since this date. I tried to check the reference fix in v5.4 is here but I cant view them as I just get action not available. (I am not very familiar with GIT).

zhangyanjiaoesp · 2024-12-04T03:39:02Z

You said in the comment that you couldn't understand what this log meant.

Line 126 starts logging I (65677) wifi:>>>intv = 102400, max = 0. I don't know what this means

Then I replied to you that this log was added to solve the previous crash issue. I quoted my previous comment just to let you know that there have been similar logs before, and this fix has not been updated to GitHub yet, that's all. It has nothing to do with the other issues you reported later !!!

michaelsimp · 2024-12-04T05:23:18Z

Ok thanks for clarifying. I didn't understand the context and got excited at the prospects of a fix.
So is my case still open for action from your side, or do you need something more from me?

michaelsimp · 2024-12-11T20:54:29Z

Can I have an update please.

zhangyanjiaoesp · 2024-12-12T08:40:39Z

@michaelsimp

The IDF v5.3.2 has been released.
Regarding the other issues, could you provide a simpler and reproducible demo? First, during my testing, I am unable to set up the mesh devices in the same way as you. Additionally, the demo you previously provided involves operations such as QR code-based provisioning, console control, and others, while my local testing only uses the ip_internal_network example and performs periodic scanning. Our testing methods also differ. If you can simplify the steps and methods for reproducing the issue, I can conduct local tests and work on resolving it.

michaelsimp · 2024-12-12T19:39:21Z

Ok, but I will need to park it and come back as soon as I can. My project is far behind now and I have to make progress in other areas. Please keep this ticket open

zhangyanjiaoesp · 2024-12-13T03:35:58Z

sure.

michaelsimp added the Type: Bug bugs in IDF label Oct 14, 2024

espressif-bot added the Status: Opened Issue is new label Oct 14, 2024

github-actions bot changed the title ~~WiFi Mesh unstable when parent offline~~ WiFi Mesh unstable when parent offline (IDFGH-13875) Oct 14, 2024

espressif-bot assigned zhangyanjiaoesp Oct 14, 2024

espressif-bot added Status: In Progress Work is in progress and removed Status: Opened Issue is new labels Oct 29, 2024

WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

Comments

michaelsimp commented Oct 14, 2024

Answers checklist.

IDF version.

Espressif SoC revision.

Operating System used.

How did you build your project?

If you are using Windows, please specify command line type.

Development Kit.

Power Supply used.

What is the expected behavior?

What is the actual behavior?

Steps to reproduce.

Debug Logs.

More Information.

zhangyanjiaoesp commented Oct 22, 2024

michaelsimp commented Oct 24, 2024 via email

michaelsimp commented Oct 24, 2024 • edited Loading

michaelsimp commented Oct 24, 2024

michaelsimp commented Oct 24, 2024

brianignacio5 commented Oct 29, 2024

zhangyanjiaoesp commented Oct 29, 2024

michaelsimp commented Oct 30, 2024

brianignacio5 commented Oct 30, 2024

michaelsimp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024

zhangyanjiaoesp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024 via email

zhangyanjiaoesp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024 • edited Loading

michaelsimp commented Oct 30, 2024

zhangyanjiaoesp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Nov 20, 2024

zhangyanjiaoesp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 22, 2024

michaelsimp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024 • edited Loading

michaelsimp commented Nov 25, 2024 • edited Loading

zhangyanjiaoesp commented Nov 27, 2024

michaelsimp commented Nov 28, 2024

zhangyanjiaoesp commented Nov 28, 2024

michaelsimp commented Nov 28, 2024

zhangyanjiaoesp commented Dec 3, 2024

michaelsimp commented Dec 3, 2024

zhangyanjiaoesp commented Dec 3, 2024 • edited Loading

zhangyanjiaoesp commented Dec 3, 2024

Xiehanxin commented Dec 3, 2024

michaelsimp commented Dec 4, 2024

zhangyanjiaoesp commented Dec 4, 2024

michaelsimp commented Dec 4, 2024

michaelsimp commented Dec 11, 2024

zhangyanjiaoesp commented Dec 12, 2024

michaelsimp commented Dec 12, 2024

zhangyanjiaoesp commented Dec 13, 2024

michaelsimp commented Oct 24, 2024 •

edited

Loading

michaelsimp commented Oct 30, 2024 •

edited

Loading

michaelsimp commented Nov 25, 2024 •

edited

Loading

michaelsimp commented Nov 25, 2024 •

edited

Loading

zhangyanjiaoesp commented Dec 3, 2024 •

edited

Loading