Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WiFi based ESPHome BT Proxies error message - "Connection error occurred: EOF received" #87158

Closed
Anto79-ops opened this issue Feb 2, 2023 · 71 comments

Comments

@Anto79-ops
Copy link

The problem

Hi!

Since updating to 2023.2, I have been receiving these messages in my log file for each of my BT Proxies running on ESPHome 2022.12.8

Logger: aioesphomeapi.connection
Source: runner.py:128
First occurred: February 1, 2023 at 9:00:00 PM (170 occurrences)
Last logged: 6:32:30 AM

esp32-bluetooth-proxy-4ca0c4 @ 192.168.1.169: Connection error occurred: EOF received
esp32-bluetooth-proxy-4ca434 @ 192.168.1.115: Connection error occurred: EOF received
esp32-bluetooth-proxy-5f189c @ 192.168.1.168: Connection error occurred: EOF received
esp32-bluetooth-proxy-5a63a8 @ 192.168.1.180: Connection error occurred: EOF received
esp32-bluetooth-proxy-5e9b38 @ 192.168.1.126: Connection error occurred: EOF received

there is a second message that may be related, but it is not as often as the first above:

Logger: aioesphomeapi.connection
Source: /usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py:560
First occurred: 12:18:21 AM (2 occurrences)
Last logged: 5:51:33 AM

esp32-bluetooth-proxy-5a63a8 @ 192.168.1.180: Connection error occurred:
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 520, in send_message_await_response_complex
    await fut
asyncio.exceptions.CancelledError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 519, in send_message_await_response_complex
    async with async_timeout.timeout(timeout):
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 303, in _keep_alive_loop
    await self._ping()
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 628, in _ping
    await self.send_message_await_response(PingRequest(), PingResponse)
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 537, in send_message_await_response
    res = await self.send_message_await_response_complex(
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 522, in send_message_await_response_complex
    raise TimeoutAPIError(
aioesphomeapi.core.TimeoutAPIError: Timeout waiting for response for <class 'api_pb2.PingRequest'> after 10.0s

What version of Home Assistant Core has the issue?

2023.2.0

What was the last working version of Home Assistant Core?

2023.1.7

What type of installation are you running?

Home Assistant OS

Integration causing the issue

ESPHome

Link to integration documentation on our website

https://www.home-assistant.io/integrations/esphome/

Diagnostics information

No response

Example YAML snippet

No response

Anything in the logs that might be useful for us?

No response

Additional information

No response

@home-assistant
Copy link

home-assistant bot commented Feb 2, 2023

Hey there @OttoWinter, @jesserockz, mind taking a look at this issue as it has been labeled with an integration (esphome) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of esphome can trigger bot actions by commenting:

  • @home-assistant close Closes the issue.
  • @home-assistant rename Awesome new title Change the title of the issue.
  • @home-assistant reopen Reopen the issue.
  • @home-assistant unassign esphome Removes the current integration label and assignees on the issue, add the integration domain after the command.

(message by CodeOwnersMention)


esphome documentation
esphome source
(message by IssueLinks)

@bdraco
Copy link
Member

bdraco commented Feb 2, 2023

There is a chance this is related to #85432

@bdraco
Copy link
Member

bdraco commented Feb 2, 2023

Did the problem go away once you reflashed the device?

@bdraco
Copy link
Member

bdraco commented Feb 2, 2023

If not try connecting the device with a cable and collect logs from it when the connection drops.

Its possible its running out of ram and resetting over and over again

@Anto79-ops
Copy link
Author

@bdraco Maybe, for sure! I'm going to re-flash tonight (MST) with your suggestions from the other issue. Should have an answer for your in 6 to 8 hrs.

With regards to uptime and ram. I do have uptime and memory sensors on each BTProxy. Here are some overlays for the last week.

Uptime:

image

Memory for the same time period:

image

Why my core logs filling up with this EOF message, not sure what has happened only thing thing changed was 2023.2..., unless its referring to this:

image

IF you noticed in the logbook entries here, it becomes unavailable often, but this has been happening since day 1 of the BT integration, so unless the log level detail l has changed that it throws it into the core logs now?

@bdraco
Copy link
Member

bdraco commented Feb 3, 2023

We changed the logging a bit in this version so I'm guessing based on your comment above it's been going on for a while but we didn't have a good log message about why it was disconnecting.

I fixed a problem with sending empty packets via esphome/esphome#4284 that could result in the eof but its only in the dev version of esphome.

Are you using noise encryption?

@Anto79-ops
Copy link
Author

The EOF errors are continuing to log, 5 minutes in since re-flashing as per suggestion here.

I first heard of noise encryption on the last release party, so basically no I'm not using that encryption, just whatever is default.

There was a fellow who mentioned in the ESPHome discord channel (not Jesse) that the constant available/unavailable is "normal" because the ESP32 cannot share the BT and WiFi radio, so its constantly switching between BT/WiFi, hence the disconnects. This was months ago, don't know if things have changed or if there is any truth to this....does that make sense to you?

@Anto79-ops
Copy link
Author

I just realized I didn't catch logs while on serial as you asked above.... I'll get the logs to you tomorrow after the inkbird test....cheers

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 3, 2023

with regards to the logs, I was monitoring the logs via serial connection and in another window, watching the log book entries here:

image

In the 45 minutes that I was monitoring it, it become unavailable a few times, briefly, as shown in the log book entries, but no logs were shown (just a black window from the ESPHome website interface). Not sure why that is happening.

Hopefully the new ESPHome update will solve the EOF issue, its counting up the core log entries.

@Anto79-ops
Copy link
Author

just wondering, should I try 1 update for ESPHome Dev? the eof messages are approaching the thousands now in my core logs.

If you recommend a dev update, do I delete this entry in yaml

esp32:
  board: esp32dev
  framework:
    type: esp-idf
    version: 4.4.3
    platform_version: 5.3.0

press save, then install and THEN update to dev?

thanks

@bdraco
Copy link
Member

bdraco commented Feb 4, 2023

Keep the yaml regardless of version. That change isn't in dev yet

@bdraco
Copy link
Member

bdraco commented Feb 4, 2023

Trying out dev is a good idea though

@Anto79-ops
Copy link
Author

Awesome, I will update tonight!

@Anto79-ops
Copy link
Author

Hey, just wanted to update you. Last night I updated to Dev, 2023.2.0-dev Feb 4 2023, 18:28:40 (MST), and it was successful. However, I'm still see the logging.

Logger: aioesphomeapi.connection
Source: runner.py:128
First occurred: February 4, 2023 at 9:37:19 PM (236 occurrences)
Last logged: 9:14:20 AM

esp32-bluetooth-proxy-5f189c @ 192.168.1.168: Connection error occurred: EOF received
esp32-bluetooth-proxy-4ca434 @ 192.168.1.115: Connection error occurred: EOF received
esp32-bluetooth-proxy-5a63a8 @ 192.168.1.180: Connection error occurred: EOF received
esp32-bluetooth-proxy-4ca0c4 @ 192.168.1.169: Connection error occurred: EOF received
esp32-bluetooth-proxy-299fcc @ 192.168.1.137: Connection error occurred: EOF received

and

Logger: aioesphomeapi.connection
Source: /usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py:560
First occurred: 1:45:15 AM (4 occurrences)
Last logged: 9:10:01 AM

esp32-bluetooth-proxy-299fcc @ 192.168.1.137: Connection error occurred:
esp32-bluetooth-proxy-4ca0c4 @ 192.168.1.169: Connection error occurred:
esp32-bluetooth-proxy-4cae58 @ 192.168.1.170: Connection error occurred:
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 520, in send_message_await_response_complex
    await fut
asyncio.exceptions.CancelledError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 519, in send_message_await_response_complex
    async with async_timeout.timeout(timeout):
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 303, in _keep_alive_loop
    await self._ping()
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 628, in _ping
    await self.send_message_await_response(PingRequest(), PingResponse)
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 537, in send_message_await_response
    res = await self.send_message_await_response_complex(
  File "/usr/local/lib/python3.10/site-packages/aioesphomeapi/connection.py", line 522, in send_message_await_response_complex
    raise TimeoutAPIError(
aioesphomeapi.core.TimeoutAPIError: Timeout waiting for response for <class 'api_pb2.PingRequest'> after 10.0s

I do want to say that, my BT proxies are operating just fine. I have 10 switchbots, 4 temp sensors, 6 ibeacons and I'm using my phone as a room presence detector (and it updates decently fast). The systems works really well (has been for months), the logs would suggest otherwise, but it is all good! Is it possible the the logging is just to sensitive?

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 9, 2023

Hi, just wanted to capture this comment here from another fellow is experiencing the same issue as me.

Also, ESPHome 2023.2.0b3 is available. Is this an update to my current Dev version of 2023.2.0-dev (Feb 5 2023) and if so, should I update?

thank you!

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 10, 2023

just wanted to provide an update that ESPHome 2023.2.0b3 still gives the error.

However, I'm starting to wonder if these error messages above are related to another issue that I'm also affected by since updating to 2023.2 here:

#87230

which has also made similar issues with my ZHA system

#87312

seems I'm breaking the 10 second rule, left right and center!

@bdraco
Copy link
Member

bdraco commented Feb 10, 2023

The profiler will only show whats running in the event loop.

To see everything you need py-spy: https://community.home-assistant.io/t/instructions-to-install-py-spy-on-haos/480473

@Anto79-ops
Copy link
Author

Ok, I'm going to try and install py-spy. I actually tried to do this before but found the instructions a bit too technical....I'll figure this out....or I'd be happy to provide you a chrome remote desktop connection session to you and you could directly connect to my computer while I do the physical parts (usb insert-removal etc). ...please let me know

@Anto79-ops
Copy link
Author

Ok, I got the key on my USB named CONFIG plugged into my HAOS instance:

I'm uncertain about the next:

"ha os import on an attached usb

Do I simply type ha os import on the command line?

@bdraco
Copy link
Member

bdraco commented Feb 10, 2023

ha os import in the terminal/ssh addon should work

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 10, 2023

ok thanks!

When I type

ssh root@hassio.local -p 22222

I get the message ssh: Could not resolve hostname hassio.local: Try again

Protection mode is off for the SSH and Web terminal addon. What did I do wrong?

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 10, 2023

I tried this instead:

ssh root@192.168.1.136 -p 22222

got me a bit further, but then it says

root@192.168.1.136: Permission denied (publickey).

my key is not good i guess?

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 10, 2023

ok, still hitting a road block here:

I'm using Notepad++ and puttygen

generating the key by moving my mouse randomly on puttygen. copied the generate key from the "Public key for pasting into OpenSSH authorized_keys file" text box, pasted into Notepad++

Its an RSA key with 2048 bits.

check that is ANSI (check)
check that is it LF EOL (check)

save file as authorized_keys (no extension)

USB is NTFS 8GB

Place the usb into the RPi 4 port

in terminal addon (from community repo) I type

ha os import

say it imports successfully,

then type

ssh root@homeassistant.local -p 22222
ssh root@hassio.local -p 22222
ssh root@192.168.1.136 -p 22222

all of which leads to

root@[ as X option above]: permission denied (publickey).

EDIT: it says "Use the CLI (eg. SSH to the SSH add-on on port 22)"...does this just mean to use the terminal add-on in HA UI?

@Anto79-ops
Copy link
Author

Anto79-ops commented Feb 11, 2023

Made some progress!

I followed this video, didn't understand 1 word he was saying but I was able to log in as root

Ok, I believe py-spy is now installed or (copied at least into the correct directory)

Install

instructions then say

"There should be a py-spy binary once you call unzip in one of the directories it creates"

How do I call unzip?

and for the PID process, it is 61 in my case? this top in bash 5.1

image

but there is a previous directory, i,.e. the one before this command was sent:

docker exec -it homeassistant /bin/bash

baby steps!

@Anto79-ops
Copy link
Author

ok some more progress here, think I unzipped correctly here:

bash-5.1# unzip py_spy-0.3.12-py2.py3-none-manylinux_2_17_armv7l.manylinux2014_a                                                                              rmv7l.whl
Archive:  py_spy-0.3.12-py2.py3-none-manylinux_2_17_armv7l.manylinux2014_armv7l.                                                                              whl
replace py_spy-0.3.12.dist-info/METADATA? [y]es, [n]o, [A]ll, [N]one, [r]ename:y
  inflating: py_spy-0.3.12.dist-info/METADATA
replace py_spy-0.3.12.dist-info/WHEEL? [y]es, [n]o, [A]ll, [N]one, [r]ename: a
error: invalid response [a]
replace py_spy-0.3.12.dist-info/WHEEL? [y]es, [n]o, [A]ll, [N]one, [r]ename: A
  inflating: py_spy-0.3.12.dist-info/WHEEL
  inflating: py_spy-0.3.12.dist-info/license_files/LICENSE
  inflating: py_spy-0.3.12.data/scripts/py-spy
  inflating: py_spy-0.3.12.dist-info/RECORD

I basically said yes to all

Then, I typed this:

cd py_spy-0.3.12.data
cd scripts

while in scripts directory

I typed a few commands:

bash-5.1# pwd
/config/py_spy-0.3.12.data/scripts
bash-5.1# ./py-spy top --pid 61
thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: ParseIntError { kind: PosOverflow }', /root/.cargo/registry/src/github.com-1ecc62                                                                99db9ec823/proc-maps-0.2.1/src/linux_maps.rs:81:65
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
Error: receiving on a closed channel

bash-5.1# ./py-spy top --pid 60
thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: ParseIntError { kind: PosOverflow }', /root/.cargo/registry/src/github.com-1ecc62                                                                99db9ec823/proc-maps-0.2.1/src/linux_maps.rs:81:65
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
Error: receiving on a closed channel

The instructions say to use PID 60, but I think mine is 61, so I tried both.

They both lead to errors above.

Do you know what the issue is here?

@bdraco
Copy link
Member

bdraco commented Feb 11, 2023

.12 is an older version. Try .14

https://github.com/benfred/py-spy/releases/tag/v0.3.14

@bdraco bdraco changed the title ESPHome BT Proxies error message - "Connection error occurred: EOF received" WiFi based ESPHome BT Proxies error message - "Connection error occurred: EOF received" Feb 22, 2023
@bdraco
Copy link
Member

bdraco commented Feb 22, 2023

@Anto79-ops assuming the defaults fix it, can you try the values in this PR as well?

esphome/bluetooth-proxies#71

@Anto79-ops
Copy link
Author

Some good news to report! with these settings:

esp32_ble_tracker:
  scan_parameters:
    interval: 320ms
    window: 30ms
    active: true

There has been a significant improvement, in fact, since changing the parameters, I've seen only 1 EOF warning message in the logs. Typically, I would would have seen 40 to 60 occurrences by now.

Trying the PR suggested values now, as below:

esp32_ble_tracker:
  scan_parameters:
    interval: 450ms
    window: 160ms
    active: true

I'll post an update in a couple of hours.

May I ask, would this change affect the performance (good or bad) of my proxy network?

Thanks

@Anto79-ops
Copy link
Author

ok, I think its fair to say that the PR suggested values are not as good as the default ones in terms of improving the EOF issue.

In 7 minutes, I've already got 7 EOF occurrences since changing the interval and window times.

@bdraco
Copy link
Member

bdraco commented Feb 23, 2023

Thanks for the report. Glad the defaults are working. We probably need to do a bit more experimenting to get values that work best for the Wi-Fi ones.

The downside is it means less time listening for Bluetooth and more time preferring Wi-Fi but likely that isn't an issue unless you have devices that broadcast infrequently

@Anto79-ops
Copy link
Author

Anytime.

BTW, the generic yaml has them at 1100 ms...so this is where it came from in the first place.

https://github.com/esphome/bluetooth-proxies/blob/main/esp32-generic.yaml

@Anto79-ops
Copy link
Author

I went back to default and things are going well. In about 10 hrs, I only recieved 12 EOF messages and about 7 api ping messages. Massive reduction of by almost 97%

One other thing I noticed since changing the intervals, is a reduction in CPU usage, by almost 5%. Does this make sense to you?

You can set here, green diamond was when the change to default was made... Maybe because it's logging less, it helps the CPU a bit.

_20230223_034756.JPG

@bdraco
Copy link
Member

bdraco commented Feb 23, 2023

Rebuilding the connection isn't cheap but I wouldn't expect it to be that expensive. Also wouldn't be incredibly surprised if it was that expensive though either.

@nagyrobi
Copy link
Contributor

nagyrobi commented Mar 3, 2023

@Anto79-ops were did you fail with py-spy?

  • did you try the latest release?
  • did you restart the ssh addon after disabling protection mode?
  • are you sure you're quering it against the corredt pid?

@Anto79-ops
Copy link
Author

@nagyrobi thanks, it fails when I try to start it.

  1. I did try the latest release v0.3.14
  2. I don't think I did this, but I did restart the whole system (not just core)....and protection mode is disabled

image

  1. yes, I believe I am querying the right one, the error message would be different if I did not. This what happens when I try to start it with py-spy
bash-5.1# pwd
/config/py_spy-0.3.12.data/scripts
bash-5.1# ./py-spy top --pid 61
thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: ParseIntError { kind: PosOverflow }', /root/.cargo/registry/src/github.com-1ecc62                                                                99db9ec823/proc-maps-0.2.1/src/linux_maps.rs:81:65
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
Error: receiving on a closed channel

bdraco mentioned to update rust, what version do you have? and how do I check my version and/or update if necessary.

thanks

@marsp88
Copy link

marsp88 commented Mar 3, 2023

Hey!
I found this issue a while back and have been following it along. Unfortunately I ran into every issue described here and nothing fixed the problem.
That was until the other day, I turned WMM Mode on on my OpenWrt access point. And for some reason I have very little to no reconnects now.
I know there is a lot of discussion out there around Wifi settings and issues in connection with ESP devices. And I used to have problems with WMM turned on too. A lot of devices did not connect or lost connection all the time, hence I had turned it off on my 2.4 GHz network.
Now, with WMM turned on again, the only devices disconnecting every once in a while is a tasmota switch. And I can totally live with that.
I am not sure this will help you, but figured I'd share, since I was surprised that that solved the problem for me.

@Anto79-ops
Copy link
Author

@marsp88 thanks for sharing

I actually do have a OpenWRT router (Linksys WRT3200 ACM), but its running default firmware (from Linksys what it had out of the box. and unless WMM is disabled by default, it should still be disabled...because I have no where that setting even is!) AND i actually have the router wifi disabled because I'm using 4 access points scattered throughout my home, which is hardwired to the router via ethernet, all my wifi devices connect to these access points, which then communicates to the router that way. The router actually thinks I have only ethernet wired devices, as a result. These wifi access points are marketed as wifi boosters from my ISP (they are not mesh networks).

Would this issue still apply?

@nagyrobi
Copy link
Contributor

nagyrobi commented Mar 6, 2023

I actually do have a OpenWRT router (Linksys WRT3200 ACM), but its running default firmware (from Linksys what it had out of the box.

If that's the case, that doesn't qualify as an OpenWRT router. You can only name it like that if the firmware running on it is a real OpenWRT build. Otherwise, it's just a simple Linksys WRT...

Besides, that wouldn't matter anyway, especially when addressing WMM, because you just said its WiFi is disabled, and you use 4 other, separate APs. In that case, is WMM enabled on those? That would matter.

What kind of WiFi boosters do you use? Are they broadcasting the same SSID?

@Anto79-ops
Copy link
Author

Anto79-ops commented Mar 6, 2023

Its mostly a mystery what these boosters are, other than they provided by my ISP and are called "Telus Wifi Boosters" Everything is dumbed down from the ISP, but its the only thing I use from them.

However, if you look a little deeper they have model number on them

Arcadyan Technology WE410443 Wi-Fi Repeater

They are all broadcasting the same SSID.

EDIT: checking the booster settings, there is no such setting for WMM.

@nagyrobi
Copy link
Contributor

nagyrobi commented Mar 6, 2023

Is it this one:
https://www.singtel.com/content/dam/singtel/business/sb/Support/connectivity/routers/singnet/Singtel_Mesh_Extender_WE410443-ST_v1.2.pdf

There's something fishy imho according to pages 9-10:

  • Seamless roaming Clients - moving from AP to AP with better signal strength without user intervention.
  • Band Steering - is a function on the APs which kicks clients from 2.4GHz frequency trying to force them to reconnect on 5GHz
  • Self-Healing?? What could that be? Maybe one of your APs is constantly healing itself?
  • Support Arcadyan OWL, a proprietary Mesh solution which can NOT migrate...

These are not ESP-friendly functions...

How's your Home Assistant connected to the network? Wired or WiFi?
How many total wifi client devices you have around the house?

@Anto79-ops
Copy link
Author

Anto79-ops commented Mar 7, 2023

Yeah I wonder myself these are great points you mention.

But I'm not going to lie Wi-Fi in my home is actually pretty good. Some of our laptops can hit 500 Mbps on wifi, but usually it's 80 to 300 Mbps. Wired devices devices hit up to 1.2 Gbps (fiber optic internet).

With about 96 devices on my wifi/ethernet network, about 30 are on ethernet, including HAOS, and my Ubuntu mqtt broker.

It's clear though, ethernet has has the advantage.

I saw the option to turn off the band steering on the boosters..... but honestly if it's just the ESP devices that are causing issues I might as well just upgrade them to olimex ones.

These are my actual booster devices, doesn't look exactly the same as the photo but probably the insides are the same because it's the same model.

DSC_0003.JPG

DSC_0004.JPG

@nagyrobi
Copy link
Contributor

nagyrobi commented Mar 7, 2023

To isolate if it's just this magic AP system being the culprit, how about trying:

  • check with a wifi scanner app, how many APs (MAC addresses broadcasting) do you see behind the same SSID? 1, 4, 8 or more? Maybe a rogue AP not part of this mesh but using the same SSID (and key), interfering? Your neighbour having fun, perhaps?
  • what's the channel width on 2.4GHz? ESP only supports HT20 and HT40, but I feel that ESPHome defaults to HT20. Now if your AP wants to communicate at HT40 while your ESP wants to use HT20 that will lead to performance degradation on both sides.
  • disable band steering. It usually causes problems with other stuff too. Implementation quality differs at vendors, and there's a reason that the "dumbed down" AP exposes this switch while other settings are hidden...
  • turn off all (pull the power) APs except one, to make sure there's no magic meshing, wait 30 mins, see if ESP is still producing the error
  • eventually spin up the Linksys's own WiFi on a different IoT SSID only on 2.4GHz HT20, and connect the ESP to that instead, see if ESP is still producing the error

And yes, you have a lot of WiFi devices sharing the airtime of only 4 APs, it may simply be the issue of overloading them. Strangely the manufacturer doesn't give in the specs the max supported clients number...
Wouldn't even be surprised it's something totally different - with all your devices in the same broadcast domain, a lot of useless chatter is being distributed to all clients (wired and wireless). At this level, some VLAN and SSID separation would be beneficial...

Given your environment, I'd be sure to only use Ethernet based ESP devices (with ext antenna) for Bluetooth proxying. It's all maxed out...

@Anto79-ops
Copy link
Author

Anto79-ops commented Mar 7, 2023

awesome information here, thanks! Let me look into these and see what I find. I should've disabled band steering long time ago! Thankfully, in the advanced options of these APs, I can see which and how many devices are connected to them.

There seems to be something choking my event loops (as I'm getting many messages in my logs, that...xx is taking more than 10 seconds even long after HA has restarted, and even things not related to ESPHome are saying this). I'm hoping the issue will be obvious once I get py-spy running on my instance.

@Anto79-ops
Copy link
Author

ok, I went in to the settings for these boosters and I can answer some questions:

  1. The 2.4 GHz is set to HT20 only. But I do I have the option to set it to HT20 and HT40 (both).

These AP actually put out some detailed logs! So I downloaded the logs and searched for 1 BT Proxy to see its behavior at a given time.

This proxy, 40:22:d8:4c:a0:c4, is connected to one of my AP (it does not hop, because this mac address does not show up on other AP. This is only a sample of what I see in the Family Room log.

Does this say anything? not sure what IF or OWL., but notice it says "connected" a lot. Signal for this BT proxy is between -40 to -60 dBm, so its not a signal strength issue.

==============================Family Room_log.log==============================
06.03.2023 18:45:02 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 18:45:02 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 18:46:06 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
06.03.2023 19:45:04 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 19:45:04 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 19:45:51 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
06.03.2023 20:45:06 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 20:45:06 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 20:45:31 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
06.03.2023 21:45:08 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 21:45:08 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 21:46:01 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
06.03.2023 22:45:10 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 22:45:10 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 22:46:06 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
06.03.2023 23:45:12 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
06.03.2023 23:45:12 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
06.03.2023 23:45:58 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 00:45:14 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 00:45:14 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 00:45:35 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 01:45:16 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 01:45:16 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 01:45:32 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 02:45:18 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 02:45:18 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 02:45:18 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 03:45:20 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 03:45:20 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 03:45:43 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 04:45:22 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 04:45:22 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 04:45:31 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.
07.03.2023 05:45:24 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) send group Msg1
07.03.2023 05:45:24 [4 4 1] IF[2.4G]:STA(40:22:d8:4c:a0:c4) receive group Msg2
07.03.2023 05:46:21 [2 4 2] OWL-(Family Room)IF[2.4G](EC:F4:51:AA:85:58):STA(40:22:D8:4C:A0:C4) connected.

there is more I just stopped here

@Anto79-ops
Copy link
Author

looking into my logs more, I really think this not a wifi issue. Here's what I found.

Here's an example of a disconnect that occurred in HA as reported in my HA Core logs

2023-03-06 12:36:19.241 WARNING (MainThread) [aioesphomeapi.connection] esp32-bluetooth-proxy-5e9b38 @ 192.168.1.126: Connection error occurred: EOF received

this particular ESP32 has a mac address of 0c:dc:7e:5e:9b:38

In my booster APs logs, this particular device only shows up on one of 4 of booster APs in my home (that's the one that it is nearest to), and shown below is 1 hr before and 1 hr after that EOF event as reported by HA. I've placed asterisk (****) every time an event as logged for this particular BT proxy's mac address:

06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) send group Msg1 ****
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) send group Msg1
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) receive group Msg2
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) receive group Msg2
06.03.2023 11:44:54 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) receive group Msg2 ****
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) receive group Msg2
06.03.2023 11:44:55 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) send group Msg1
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) send group Msg1 
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) receive group Msg2
06.03.2023 11:45:03 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) receive group Msg2
06.03.2023 11:45:04 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) receive group Msg2
06.03.2023 11:45:04 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) receive group Msg2
06.03.2023 11:45:53 [2 2 2] OWL-Sync config failed for Found a configured RE with different group id connects to current group
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) send group Msg1 ****
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) send group Msg1
06.03.2023 12:44:56 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) receive group Msg2 ****
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) receive group Msg2
06.03.2023 12:44:57 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) send group Msg1
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) receive group Msg2
06.03.2023 12:45:05 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) receive group Msg2
06.03.2023 12:45:06 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) receive group Msg2
06.03.2023 12:45:06 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) receive group Msg2
06.03.2023 12:45:06 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) receive group Msg2
06.03.2023 12:57:38 [2 4 2] IF[2.4G]:STA(38:78:62:05:51:55) had associated successfully,rssi is -63,channel load is 53
06.03.2023 12:57:38 [4 4 1] IF[2.4G]:STA(38:78:62:05:51:55) send Msg1 of 4-way
06.03.2023 12:57:38 [4 4 1] IF[2.4G]:STA(38:78:62:05:51:55) receive Msg2 of 4-way
06.03.2023 12:57:38 [4 4 1] IF[2.4G]:STA(38:78:62:05:51:55) send Msg3 of 4-way
06.03.2023 12:57:38 [4 4 1] IF[2.4G]:STA(38:78:62:05:51:55) receive Msg4 of 4-way
06.03.2023 13:28:08 [2 4 2] OWL-Sync syncntp done.
06.03.2023 13:28:08 [2 4 6] NTP-Time Server successfully connected and time synchronisation achieved.
06.03.2023 13:35:52 [4 4 1] IF[2.4G]:STA(38:78:62:05:51:55) had been aged-out and disassociated
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) send group Msg1 ****
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) send group Msg1
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) receive group Msg2 ****
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) receive group Msg2
06.03.2023 13:44:58 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) send group Msg1
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(d4:f5:47:22:d2:c3) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(30:fd:38:95:6e:f3) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(54:60:09:3b:c8:14) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(c8:ff:77:35:0e:13) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(14:75:5b:60:e4:ba) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(20:df:b9:60:80:14) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(00:02:d1:76:7a:e6) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(d4:9c:dd:11:45:6a) receive group Msg2
06.03.2023 13:45:07 [4 4 1] IF[5G]:STA(3c:31:78:01:4e:bb) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) send group Msg1 ****
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) send group Msg1
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c8:ff:77:db:d7:90) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c4:5b:be:6d:e2:1e) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(24:f5:a2:f9:d6:a1) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(b8:3a:9d:21:bc:34) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(24:18:c6:19:1f:13) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(c8:c9:a3:5b:df:28) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(b8:d6:1a:87:d8:b8) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(38:1a:52:0c:6f:3a) receive group Msg2
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(0c:dc:7e:5e:9b:38) receive group Msg2 ****
06.03.2023 14:45:00 [4 4 1] IF[2.4G]:STA(a0:cc:2b:4e:61:98) receive group Msg2

If the boosters AP log a connect event (I don't know of they do), it didn't happen during the time that HA reported an EOF event. Which may suggest that the device is not disconnecting from wifi, but disconnecting from HA, instead....maybe?

@MikeDeltaHH
Copy link

I have the same problem with one of my two BT proxies - every few minutes this error appears in my HA log. But the connection is only disconnected to HA, not to the WLAN.

@Anto79-ops
Copy link
Author

Anto79-ops commented Apr 18, 2023

Thanks @MikeDeltaHH for reporting.

At the moment, we're stuck at getting py-spy installed on my system to check for event loop blocking. Py-spy unfortunately still has a bug of some sort that prevents me from running it. Hopefully soon it will be fixed. You may have better luck getting it installed if required.

I still have the EOF messages in my logs, hundreds of them over days, but my BT Proxies and BT network seem to be working well.

@bdraco
Copy link
Member

bdraco commented Jun 9, 2023

esphome/esphome#4924 should reduce the load on the esphome device

@bdraco
Copy link
Member

bdraco commented Jun 9, 2023

Core needs #94138 as well to it be able to us the new functionality

Currently scheduled for 2023.7.x

@Anto79-ops
Copy link
Author

Thanks bdraco! I will follow these.

@bdraco
Copy link
Member

bdraco commented Jun 29, 2023

Closing per discussion in #beta. Considered solved in 2023.7.x. Along with esphome 2023.6.x

@bdraco bdraco closed this as completed Jun 29, 2023
@github-actions github-actions bot locked and limited conversation to collaborators Jul 29, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

8 participants