Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ZHA - application controller is not running #130548

Open
erkr opened this issue Nov 13, 2024 · 62 comments
Open

ZHA - application controller is not running #130548

erkr opened this issue Nov 13, 2024 · 62 comments

Comments

@erkr
Copy link

erkr commented Nov 13, 2024

The problem

My ZHA entities couldn’t be controlled anymore although all ZHA entities seemed available and fine. No sensor updates and errors logged when sending commands (example logging included was a turn on service) .
Even reloading ZHA suggested that ZHA was fine, but the failure state remained. Shortly Un-plugging the ZBDongle-E (Sonoff stock Firmware: 6.10.3.0 build 297), resolved the issue.
So for some reason the ZHA watchdog didn’t notice this hanging dongle state for hours!

What version of Home Assistant Core has the issue?

Core-2024.11.1

What was the last working version of Home Assistant Core?

No response

What type of installation are you running?

Home Assistant OS

Integration causing the issue

ZHA

Link to integration documentation on our website

https://www.home-assistant.io/integrations/zha/

Diagnostics information

No response

Example YAML snippet

No response

Anything in the logs that might be useful for us?

Logger: homeassistant.components.websocket_api.http.connection
Source: components/websocket_api/commands.py:245
integration: Home Assistant WebSocket API (documentation, issues)
First occurred: 19:14:03 (2 occurrences)
Last logged: 19:14:08

[546888653200] Unexpected exception
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 67, in wrap_zigpy_exceptions
    yield
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 85, in wrapper
    return await RETRYABLE_REQUEST_DECORATOR(func)(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/util.py", line 136, in retry
    return await func()
           ^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/zcl/__init__.py", line 375, in request
    return await self._endpoint.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/endpoint.py", line 270, in request
    return await self.device.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 378, in request
    await send_request()
  File "/usr/local/lib/python3.12/site-packages/zigpy/application.py", line 841, in request
    await self.send_packet(
  File "/usr/local/lib/python3.12/site-packages/bellows/zigbee/application.py", line 700, in send_packet
    raise ControllerError("ApplicationController is not running")
bellows.exception.ControllerError: ApplicationController is not running

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/src/homeassistant/homeassistant/components/zha/helpers.py", line 1335, in handler
    return await func(self, *args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/components/zha/switch.py", line 57, in async_turn_on
    await self.entity_data.entity.async_turn_on()
  File "/usr/local/lib/python3.12/site-packages/zha/application/platforms/switch.py", line 95, in async_turn_on
    await self._on_off_cluster_handler.turn_on()
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/general.py", line 465, in turn_on
    result = await self.on()
             ^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 84, in wrapper
    with wrap_zigpy_exceptions():
  File "/usr/local/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 76, in wrap_zigpy_exceptions
    raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: ApplicationController is not running

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/src/homeassistant/homeassistant/components/websocket_api/commands.py", line 245, in handle_call_service
    response = await hass.services.async_call(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/core.py", line 2802, in async_call
    response_data = await coro
                    ^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/core.py", line 2845, in _execute_service
    return await target(service_call)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1007, in entity_service_call
    single_response = await _handle_entity_call(
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1079, in _handle_entity_call
    result = await task
             ^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/components/light/__init__.py", line 628, in async_handle_light_on_service
    await light.async_turn_on(**filter_turn_on_params(light, params))
  File "/usr/src/homeassistant/homeassistant/components/switch_as_x/entity.py", line 152, in async_turn_on
    await self.hass.services.async_call(
  File "/usr/src/homeassistant/homeassistant/core.py", line 2802, in async_call
    response_data = await coro
                    ^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/core.py", line 2845, in _execute_service
    return await target(service_call)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1007, in entity_service_call
    single_response = await _handle_entity_call(
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1079, in _handle_entity_call
    result = await task
             ^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/components/zha/helpers.py", line 1337, in handler
    raise HomeAssistantError(err) from err
homeassistant.exceptions.HomeAssistantError: Failed to send request: ApplicationController is not running

Additional information

All last seen timestamps where not updated (3 hours old when I noticed). So all mains devices should have been made unavailable by ZHA (my timeout for mains devices is 5 minutes)

@home-assistant
Copy link

Hey there @dmulcahey, @Adminiuga, @puddly, @TheJulianJES, mind taking a look at this issue as it has been labeled with an integration (zha) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of zha can trigger bot actions by commenting:

  • @home-assistant close Closes the issue.
  • @home-assistant rename Awesome new title Renames the issue.
  • @home-assistant reopen Reopen the issue.
  • @home-assistant unassign zha Removes the current integration label and assignees on the issue, add the integration domain after the command.
  • @home-assistant add-label needs-more-information Add a label (needs-more-information, problem in dependency, problem in custom component) to the issue.
  • @home-assistant remove-label needs-more-information Remove a label (needs-more-information, problem in dependency, problem in custom component) on the issue.

(message by CodeOwnersMention)


zha documentation
zha source
(message by IssueLinks)

@erkr erkr changed the title ZHA - application controller not running ZHA - application controller is not running Nov 13, 2024
@Nerwyn
Copy link

Nerwyn commented Nov 14, 2024

My ZHA network has gone down twice now with similar issues, also starting with core 2024.11.1.

@erkr
Copy link
Author

erkr commented Nov 15, 2024

@Nerwyn also using a zbdongle-e?!
In my case I recently upgraded from a conbee2 to a zbdongle-e. Better coverage but stability issues in return 🥹
So I can't confirm it's core2024.11.1 induced or a dongle related issue

@Nerwyn
Copy link

Nerwyn commented Nov 15, 2024

Nope, using the SONOFF Zigbee 3.0 USB Dongle Plus-E Gateway with Silicon Labs Multiprotocol.

@puddly
Copy link
Contributor

puddly commented Nov 15, 2024

@Nerwyn Multiprotocol isn't really a supported configuration anymore. I strongly suggest you migrate back to Zigbee firmware and explore alternative options for Thread border routers (if you use Thread).

@erkr
Copy link
Author

erkr commented Nov 15, 2024

@puddly
I’m running the Itead stock Firmware: 6.10.3.0 build 297 (coordinator). Is that recommended?!

For some reason I see ZHA resetting the dongle as it is reported in fail state. That happens a few times per week.

Additionally I noticed this issue reported here after upgrading to core-2024.11.1 (from core-2024.10.4). But I can't really tell it's related to new core version as I upgraded my dongle as well two weeks ago

Best Eric

@nayefalebrahim
Copy link

on core-2024.11.2

also have a SONOFF ZB Dongle-P, have the same error as the OP

@puddly
Copy link
Contributor

puddly commented Nov 17, 2024

@nayefalebrahim please attach a ZHA debug log

@christabone
Copy link

I'm seeing the same with a S31 Lite zb by SONOFF. Here's the logs:

Logger: homeassistant
Source: /usr/src/homeassistant/homeassistant/runner.py:147
First occurred: 9:13:05 PM (2 occurrences)
Last logged: 9:13:51 PM

Error doing job: Task exception was never retrieved (None)
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 67, in wrap_zigpy_exceptions
    yield
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 85, in wrapper
    return await RETRYABLE_REQUEST_DECORATOR(func)(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/util.py", line 136, in retry
    return await func()
           ^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/zcl/__init__.py", line 375, in request
    return await self._endpoint.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/endpoint.py", line 270, in request
    return await self.device.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 378, in request
    await send_request()
  File "/usr/local/lib/python3.12/site-packages/zigpy/application.py", line 825, in request
    await self.send_packet(
  File "/usr/local/lib/python3.12/site-packages/zigpy_znp/zigbee/application.py", line 1100, in send_packet
    raise DeliveryError(
zigpy.exceptions.DeliveryError: Request failed after 5 attempts: <Status.MAC_CHANNEL_ACCESS_FAILURE: 225>

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/device.py", line 764, in async_configure
    await self.identify_ch.trigger_effect(
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 84, in wrapper
    with wrap_zigpy_exceptions():
  File "/usr/local/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 76, in wrap_zigpy_exceptions
    raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: Request failed after 5 attempts: <Status.MAC_CHANNEL_ACCESS_FAILURE: 225>

Here's where it fails in the system logs with the Zigbee debug enabled:

2024-11-19 21:27:20.372 DEBUG (Thread-35) [aiosqlite] executing functools.partial(<built-in method commit of sqlite3.Connection object at 0x************>)
2024-11-19 21:27:20.372 DEBUG (Thread-35) [aiosqlite] operation functools.partial(<built-in method commit of sqlite3.Connection object at 0x************>) completed
2024-11-19 21:27:21.392 DEBUG (MainThread) [zha.zigbee.cluster_handlers] [0x****:1:0x0006]: Failed to bind 'on_off' cluster: 
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 381, in request
    return await req.result
           ^^^^^^^^^^^^^^^^
asyncio.exceptions.CancelledError
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 280, in bind
    res = await self.cluster.bind()
          ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 380, in request
    async with asyncio_timeout(timeout):
  File "/usr/local/lib/python3.12/asyncio/timeouts.py", line 115, in __aexit__
    raise TimeoutError from exc_val
TimeoutError
2024-11-19 21:27:21.393 DEBUG (MainThread) [zha] Emitting event zha_channel_bind with data ClusterBindEvent(cluster_name='On/Off', cluster_id=6, success=False, cluster_handler_unique_id='**:**:**:**:**:**:**:**:*:0x****', event_type='zha_channel_message', event='zha_channel_bind') (1 listeners)
2024-11-19 21:27:21.393 DEBUG (MainThread) [zha] (ZHADeviceProxy) handling event protocol for event: ClusterBindEvent(cluster_name='On/Off', cluster_id=6, success=False, cluster_handler_unique_id='**:**:**:**:**:**:**:**:*:0x****', event_type='zha_channel_message', event='zha_channel_bind')

@puddly
Copy link
Contributor

puddly commented Nov 20, 2024

@christabone MAC_CHANNEL_ACCESS_FAILURE means that your coordinator is refusing to transmit because there is too much RF interference. Make sure your coordinator is on a USB extension cable and away from USB 3.0 ports, SSDs, 2.4GHz WiFi APs, and so on.

@christabone
Copy link

christabone commented Nov 20, 2024

@puddly Thank you for the insight on that issue, I'll be sure to check the coordinator!

EDIT: Moving it further away with an extension cable fixed the issue, thanks again.

@erkr
Copy link
Author

erkr commented Nov 22, 2024

I have only misery with the ZBDongle-E.
Every time I physically unplug and reseat the dongle, it works flawless for 1-2 days. Often without any ZHA warnings or errors in the log
Then it stops working without any warning as described in this issue. ZHA suggests it's running, and still no errors logged.

I now upgraded to the latest ITead version 7.4.4.0 build 0 and will report back if that makes a difference

@erkr
Copy link
Author

erkr commented Nov 24, 2024

Hi @puddly

I'm now testing with the updated Firmware (iTead 7.4.4) and ZHA ran for almost 2 days without errors or warnings.
Till today, it just restarted with this logging:

Logger: homeassistant.components.websocket_api.http.connection
Source: components/zha/helpers.py:1039
integration: Home Assistant WebSocket API (documentation, issues)
First occurred: 14:42:00 (1 occurrences)
Last logged: 14:42:00

[546399264848] Error handling message: Unknown error (unknown_error) Eric Kreuwels from 127.0.0.1 (Python/3.12 websockets/13.1)
Traceback (most recent call last):
  File "/usr/src/homeassistant/homeassistant/components/websocket_api/decorators.py", line 28, in _handle_async_response
    await func(hass, connection, msg)
  File "/usr/src/homeassistant/homeassistant/components/zha/websocket_api.py", line 374, in websocket_get_devices
    zha_gateway_proxy: ZHAGatewayProxy = get_zha_gateway_proxy(hass)
                                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/src/homeassistant/homeassistant/components/zha/helpers.py", line 1039, in get_zha_gateway_proxy
    raise ValueError("No gateway object exists")
ValueError: No gateway object exists

It was running again without reseating the dongle 🤞

@erkr
Copy link
Author

erkr commented Nov 25, 2024

@puddly This realy an issue in ZHA!
Today it failed again. Now I only see a failed delivery, followed by a heartbeat watchdog warning.
After that ZHA suggests it is running again (all entities report available, but ZHA is not updating them anymore). It is most likely a dongle issue, BUT this issue is that ZHA doesn't notice it:

image

This was found in the log

2024-11-25 05:12:55.040 ERROR (MainThread) [zigpy.zcl] [0x6F1D:1:0x0020] Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zigpy/endpoint.py", line 270, in request
    return await self.device.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 378, in request
    await send_request()
  File "/usr/local/lib/python3.12/site-packages/zigpy/application.py", line 825, in request
    await self.send_packet(
  File "/usr/local/lib/python3.12/site-packages/bellows/zigbee/application.py", line 824, in send_packet
    raise zigpy.exceptions.DeliveryError(
zigpy.exceptions.DeliveryError: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/general.py", line 633, in check_in_response
    await self.fast_poll_stop()
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 84, in wrapper
    with wrap_zigpy_exceptions():
  File "/usr/local/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 76, in wrap_zigpy_exceptions
    raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
2024-11-25 06:09:33.819 ERROR (MainThread) [zigpy.zcl] [0x6F1D:1:0x0020] Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zigpy/endpoint.py", line 270, in request
    return await self.device.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 378, in request
    await send_request()
  File "/usr/local/lib/python3.12/site-packages/zigpy/application.py", line 825, in request
    await self.send_packet(
  File "/usr/local/lib/python3.12/site-packages/bellows/zigbee/application.py", line 824, in send_packet
    raise zigpy.exceptions.DeliveryError(
zigpy.exceptions.DeliveryError: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/general.py", line 630, in check_in_response
    await self.checkin_response(True, self.CHECKIN_FAST_POLL_TIMEOUT, tsn=tsn)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 84, in wrapper
    with wrap_zigpy_exceptions():
  File "/usr/local/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 76, in wrap_zigpy_exceptions
    raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
PING 192.168.178.105 (192.168.178.105): 56 data bytes
--- 192.168.178.105 ping statistics ---
1 packets transmitted, 0 packets received, 100% packet loss
2024-11-25 10:52:53.874 ERROR (MainThread) [zigpy.zcl] [0x6F1D:1:0x0020] Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zigpy/endpoint.py", line 270, in request
    return await self.device.request(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/zigpy/device.py", line 378, in request
    await send_request()
  File "/usr/local/lib/python3.12/site-packages/zigpy/application.py", line 825, in request
    await self.send_packet(
  File "/usr/local/lib/python3.12/site-packages/bellows/zigbee/application.py", line 824, in send_packet
    raise zigpy.exceptions.DeliveryError(
zigpy.exceptions.DeliveryError: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/general.py", line 633, in check_in_response
    await self.fast_poll_stop()
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 84, in wrapper
    with wrap_zigpy_exceptions():
  File "/usr/local/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 76, in wrap_zigpy_exceptions
    raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: Failed to deliver message: <sl_Status.ZIGBEE_DELIVERY_FAILED: 3074>
2024-11-25 11:39:31.645 WARNING (MainThread) [bellows.zigbee.application] Watchdog heartbeat timeout: TimeoutError()

@puddly
Copy link
Contributor

puddly commented Nov 25, 2024

@erkr Enable debug logging for a few minutes and post the full log when ZHA is in this bad state.

@erkr
Copy link
Author

erkr commented Nov 25, 2024

@puddly 👌

@erkr
Copy link
Author

erkr commented Nov 25, 2024

@puddly
As far I can seen there is nothing logged by homeassistant.components.zha. Logging started when also setting the bellows part in debug!
These setting where finally applied for logging ZHA:

  action: logger.set_level
  data:
    homeassistant.core: debug
    homeassistant.components.zha: debug
    zigpy: debug
    bellows.zigbee.application: debug
    bellows.ezsp: debug
    zhaquirks: debug

I added the log ending with disabling the debug logging. Some timestamps:

  1. ZHA stopped working at 20:57 (nothing logged)
  2. Started debug log for homeassistant.core, homeassistant.components.zha and zigpy at 21:14:25
  3. Full debug log incl bellows was started at 21:23:58
  4. ZHA revived itself at 21:26:45 (due starting debug log?)
  5. Debug log was stopped at 21:31:56

zha log.txt

@ryanwinter
Copy link

ryanwinter commented Nov 26, 2024

Also seeing this issue sometime after upgrading to 2024.11. I'm runnning home assistant in an LXC. I'll try and grab some logs.

@ryanwinter
Copy link

This is what I saw. Everything is great, and then suddenly I get a "no such device" error. Looking in the container the device is still there.

2024-11-26 14:57:24.676 DEBUG (MainThread) [bellows.zigbee.application] Sending packet ZigbeePacket(timestamp=datetime.datetime(2024, 11, 26, 22, 57, 24, 676509, tzinfo=datetime.timezone.utc), priority=<PacketPriority.NORMAL: 0>, src=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0x0000), src_ep=1, dst=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0xD31D), dst_ep=1, source_route=None, extended_timeout=False, tsn=1, profile_id=260, cluster_id=6, data=Serialized[b'\x00\x01\x00\x00\x00'], tx_options=<TransmitOptions.NONE: 0>, radius=0, non_member_radius=0, lqi=None, rssi=None)
2024-11-26 14:57:24.676 DEBUG (MainThread) [bellows.ezsp.protocol] Sending command  sendUnicast: () {'type': <EmberOutgoingMessageType.OUTGOING_DIRECT: 0>, 'indexOrDestination': 0xD31D, 'apsFrame': EmberApsFrame(profileId=260, clusterId=6, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY|APS_OPTION_ENABLE_ROUTE_DISCOVERY: 320>, groupId=0, sequence=1), 'messageTag': 10, 'messageContents': b'\x00\x01\x00\x00\x00'}
2024-11-26 14:57:24.732 DEBUG (MainThread) [bellows.ezsp.protocol] Received command sendUnicast: {'status': <EmberStatus.SUCCESS: 0>, 'sequence': 175}
2024-11-26 14:57:24.733 DEBUG (MainThread) [bellows.ezsp.protocol] Received command messageSentHandler: {'type': <EmberOutgoingMessageType.OUTGOING_DIRECT: 0>, 'indexOrDestination': 27057, 'apsFrame': EmberApsFrame(profileId=260, clusterId=6, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY: 64>, groupId=0, sequence=170), 'messageTag': 5, 'status': <EmberStatus.SUCCESS: 0>, 'messageContents': b''}
2024-11-26 14:57:24.733 DEBUG (MainThread) [bellows.zigbee.application] Received messageSentHandler frame with [<EmberOutgoingMessageType.OUTGOING_DIRECT: 0>, 27057, EmberApsFrame(profileId=260, clusterId=6, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY: 64>, groupId=0, sequence=170), 5, <EmberStatus.SUCCESS: 0>, b'']
2024-11-26 14:57:24.734 ERROR (bellows.thread_0) [homeassistant] Error doing job: Fatal write error on serial transport (None)
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/serial/serialposix.py", line 621, in write
    n = os.write(self.fd, d)
        ^^^^^^^^^^^^^^^^^^^^
OSError: [Errno 19] No such device

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.12/site-packages/serial_asyncio_fast/__init__.py", line 310, in _write_data
    n = self._serial.write(data)
        ^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/site-packages/serial/serialposix.py", line 655, in write
    raise SerialException('write failed: {}'.format(e))
serial.serialutil.SerialException: write failed: [Errno 19] No such device
2024-11-26 14:57:24.743 DEBUG (MainThread) [bellows.ezsp.protocol] Received command incomingMessageHandler: {'type': <EmberIncomingMessageType.INCOMING_UNICAST: 0>, 'apsFrame': EmberApsFrame(profileId=260, clusterId=6, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY|APS_OPTION_ENABLE_ROUTE_DISCOVERY: 320>, groupId=0, sequence=50), 'lastHopLqi': 255, 'lastHopRssi': -80, 'sender': 0x69B1, 'bindingIndex': 255, 'addressIndex': 255, 'messageContents': b'\x18\x01\x01\x00\x00\x00\x10\x00'}
2024-11-26 14:57:24.743 DEBUG (MainThread) [bellows.zigbee.application] Received incomingMessageHandler frame with [<EmberIncomingMessageType.INCOMING_UNICAST: 0>, EmberApsFrame(profileId=260, clusterId=6, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY|APS_OPTION_ENABLE_ROUTE_DISCOVERY: 320>, groupId=0, sequence=50), 255, -80, 0x69B1, 255, 255, b'\x18\x01\x01\x00\x00\x00\x10\x00']
2024-11-26 14:57:24.744 DEBUG (MainThread) [zigpy.application] Received a packet: ZigbeePacket(timestamp=datetime.datetime(2024, 11, 26, 22, 57, 24, 744046, tzinfo=datetime.timezone.utc), priority=0, src=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0x69B1), src_ep=1, dst=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0x0000), dst_ep=1, source_route=None, extended_timeout=False, tsn=50, profile_id=260, cluster_id=6, data=Serialized[b'\x18\x01\x01\x00\x00\x00\x10\x00'], tx_options=<TransmitOptions.NONE: 0>, radius=0, non_member_radius=0, lqi=255, rssi=-80)
2024-11-26 14:57:24.744 DEBUG (MainThread) [zigpy.zcl] [0x69B1:1:0x0006] Received ZCL frame: b'\x18\x01\x01\x00\x00\x00\x10\x00'
2024-11-26 14:57:24.744 DEBUG (MainThread) [zigpy.zcl] [0x69B1:1:0x0006] Decoded ZCL frame header: ZCLHeader(frame_control=FrameControl<0x18>(frame_type=<FrameType.GLOBAL_COMMAND: 0>, is_manufacturer_specific=0, direction=<Direction.Server_to_Client: 1>, disable_default_response=1, reserved=0, *is_cluster=False, *is_general=True), tsn=1, command_id=1, *direction=<Direction.Server_to_Client: 1>)
2024-11-26 14:57:24.744 DEBUG (MainThread) [zigpy.zcl] [0x69B1:1:0x0006] Decoded ZCL frame: OnOff:Read_Attributes_rsp(status_records=[ReadAttributeRecord(attrid=0, status=<Status.SUCCESS: 0>, value=TypeValue(type=Bool, value=<Bool.false: 0>))])
2024-11-26 14:57:24.745 DEBUG (MainThread) [homeassistant.components.zha.entity] light.living_room_2: Handling event from entity: EntityStateChangedEvent(event_type='entity', event='state_changed', platform=<Platform.LIGHT: 'light'>, unique_id='a4:c1:38:b9:4f:8f:58:2d-1', device_ieee=a4:c1:38:b9:4f:8f:58:2d, endpoint_id=1, group_id=None)
2024-11-26 14:57:24.746 DEBUG (MainThread) [zigpy.zcl] [0x69B1:1:0x0008] Sending request header: ZCLHeader(frame_control=FrameControl<0x00>(frame_type=<FrameType.GLOBAL_COMMAND: 0>, is_manufacturer_specific=False, direction=<Direction.Client_to_Server: 0>, disable_default_response=0, reserved=0, *is_cluster=False, *is_general=True), tsn=2, command_id=<GeneralCommand.Read_Attributes: 0>, *direction=<Direction.Client_to_Server: 0>)
2024-11-26 14:57:24.746 DEBUG (MainThread) [zigpy.zcl] [0x69B1:1:0x0008] Sending request: Read_Attributes(attribute_ids=[0])
2024-11-26 14:57:24.746 DEBUG (MainThread) [bellows.zigbee.application] Sending packet ZigbeePacket(timestamp=datetime.datetime(2024, 11, 26, 22, 57, 24, 746515, tzinfo=datetime.timezone.utc), priority=<PacketPriority.NORMAL: 0>, src=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0x0000), src_ep=1, dst=AddrModeAddress(addr_mode=<AddrMode.NWK: 2>, address=0x69B1), dst_ep=1, source_route=None, extended_timeout=False, tsn=2, profile_id=260, cluster_id=8, data=Serialized[b'\x00\x02\x00\x00\x00'], tx_options=<TransmitOptions.NONE: 0>, radius=0, non_member_radius=0, lqi=None, rssi=None)
2024-11-26 14:57:24.744 ERROR (bellows.thread_0) [homeassistant] Error doing job: Exception in callback None() (None)
Traceback (most recent call last):
  File "/usr/local/lib/python3.12/asyncio/events.py", line 88, in _run
    self._context.run(self._callback, *self._args)
  File "/usr/local/lib/python3.12/site-packages/serial_asyncio_fast/__init__.py", line 146, in _read_ready
    self._protocol.data_received(data)
  File "/usr/local/lib/python3.12/site-packages/bellows/ash.py", line 485, in data_received
    self.frame_received(frame)
  File "/usr/local/lib/python3.12/site-packages/bellows/ash.py", line 527, in frame_received
    self.data_frame_received(frame)
  File "/usr/local/lib/python3.12/site-packages/bellows/ash.py", line 548, in data_frame_received
    self._write_frame(AckFrame(res=0, ncp_ready=0, ack_num=self._rx_seq))
  File "/usr/local/lib/python3.12/site-packages/bellows/ash.py", line 599, in _write_frame
    raise NcpFailure("Transport is closed, cannot send frame")
bellows.ash.NcpFailure: Transport is closed, cannot send frame
2024-11-26 14:57:24.748 DEBUG (MainThread) [bellows.ezsp.protocol] Sending command  sendUnicast: () {'type': <EmberOutgoingMessageType.OUTGOING_DIRECT: 0>, 'indexOrDestination': 0x69B1, 'apsFrame': EmberApsFrame(profileId=260, clusterId=8, sourceEndpoint=1, destinationEndpoint=1, options=<EmberApsOption.APS_OPTION_RETRY|APS_OPTION_ENABLE_ROUTE_DISCOVERY: 320>, groupId=0, sequence=2), 'messageTag': 11, 'messageContents': b'\x00\x02\x00\x00\x00'}
2024-11-26 14:57:24.748 DEBUG (bellows.thread_0) [zigpy.serial] Connection lost: SerialException('write failed: [Errno 19] No such device')

@erkr
Copy link
Author

erkr commented Nov 29, 2024

@puddly
Another log, this is really devastating for my automations (lights, alarms, climate control). ZHA just doesn't work anymore, no errors logged whatsoever. Integration suggests everything is OK:
image
The overview custom:zha-network-card stays populated, but no updates.

I created an automation that notified me when the last seen's stop updating (logging: Integration not responsive: Check zha)
Timestamps log file:

  • Last seen updates stopped at 2024-11-29T09:41:03
  • Started debug logging at 2024-11-29T09:48:40
  • Stopped debug logging at 2024-11-29T09:52:11

The log doesn't contain a single ZHA related logging!

home-assistant (8).log

@erkr
Copy link
Author

erkr commented Nov 29, 2024

Two days ago, something slightly different happened. The symptoms were identical, except that there was 1 error logged: Watchdog heartbeat timeout. This raw log reveals a lot of ZHA related debug information, even before debug log was enabled.

  • Watchdog Error logged at 18:41:59
  • Debug started at 18:49:49
    • Debug logging didn't reveal extra ZHA loggings!
  • Debug stopped at 18:57:53
  • Dongle reseated at 18:58:54
    • That gave some ZHA related debug loggings

Zha 1855.log

@erkr
Copy link
Author

erkr commented Nov 29, 2024

@puddly Are the loggings provided of any use? Need something else?

This seems the same issue: #106185

@ryanwinter
Copy link

I'm going to try and will back to an older version I think and see if this problem goes away. Otherwise perhaps it's time to upgrade my controller.

@ryanwinter
Copy link

Ive rolled back to 2024.10.3, I'll update later this week.

Grasping at straws, but I wonder if its related to this change? The logic for a lost serial connection has changed.

@HomeAssistantPim
Copy link

Little update: as I was desperate I found a configure zigbee button under system/hardware.
Press it, it failed after minutes without description nor logging.
Zigbee could not longer initialize, after a reboot everything seems to keep working for several hours #fingers-crossed

@erkr
Copy link
Author

erkr commented Dec 8, 2024

I also encountered that ZHA was up and running until I restarted core. After that ZHA failed to initialise

@HomeAssistantPim
Copy link

So no luck and I also noticed that as of 2024.12 the backup feature also no longer works.
So 2024.12.x is a nightmare.
I found some guides to manually restore a backup, hopefully I can succeed.

@HomeAssistantPim
Copy link

@puddly I downgrade core back to 10.4 and the issues were resolved.
Now updating to 11.0 (I would like to have sections).
For me ZHA in 12.0 is a no go with SkyConnect v1.0

@Ithrial
Copy link

Ithrial commented Dec 23, 2024

I'm on 2024.12 with the SONOFF Zigbee 3.0 USB Dongle Plus-E on 7.4.5 firmware and I'm having issues as well. Running on dedicated x86 hardware

@HomeAssistantPim
Copy link

@puddly little update here...
Still suffering from this issue and it is related to either HAOS 14.0 or core 11.x.
My bluetooth dongle (rtl8761) seems to affect the operations with my SkyConnect v1.0 dongle.
This is not RF interference but something caused by the updates either with kernel, drivers or HA itself.
Currently have removed my bluetooth dongle to at least have ZHA operational.
Reported this to HAOS repo as well.

@Ithrial
Copy link

Ithrial commented Dec 23, 2024

Whats interesting is that I can get ZHA to crash almost on command during firmware updates to these new Third Reality plugs I'm installing for better network coverage.

@puddly
Copy link
Contributor

puddly commented Dec 23, 2024

@Ithrial Can you enable debug logging and post a log that includes the crash?

@Ithrial
Copy link

Ithrial commented Dec 23, 2024

yep - debugging enabled - as soon as my Third Realty plugs prompt me for an update, I'll clear my logs, kick off the update and watch it crash. The update is from firmware 0x10013050 to 0x10013058

Though now that I look at it, the steps to enable the Debug Logging is straight forward but I'm not 100% where to get the logs from unless they're in the main HA logs I can pull from the File Editor. I looked at the steps for adding logging and it said do some steps in the config.yaml file - my config.yaml has hardly anything in it so hopefully I can get some info that'll be helpful

@puddly
Copy link
Contributor

puddly commented Dec 23, 2024

A log will immediately download when you click the "Disable debug logging" button.

You can speed up the OTA checks by explicitly clicking the refresh button, it'll ask that all devices check in immediately:
Screenshot_20241223-164237_Home Assistant~2.png

@Ithrial
Copy link

Ithrial commented Dec 23, 2024

Got it - Update on "Livingroom Zigbee Switch 2" started where it had left off (around 20%) and ran to about 40ish % then crashed. Double checked if I could restart and gave me errors and failure - then I tried just toggling one of the other switches that was within ear shot to hear if it toggled (with up to date firmware) and thats where I get the error "Failed to perform the action switch/turn_off. Failed to send request: ApplicationController is not running"

See log

home-assistant_zha_2024-12-23T22-01-05.588Z.log

@luca-mg
Copy link

luca-mg commented Dec 29, 2024

I was away for the holidays and Santa apparently gave me the same bug. Throwing:
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 67, in wrap_zigpy_exceptions yield File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/__init__.py", line 85, in wrapper return await RETRYABLE_REQUEST_DECORATOR(func)(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
yada yada

@ryanwinter I'm also on 2024.10.3 now

@petepall
Copy link

Same issue here with sonof dongle P. I replaced this with a connect ZBT-1 and it’s still giving me the same issue.
None of my automations or switches are working anymore. This is happening since 2024.11.

Logger: homeassistant.components.websocket_api.http.connection
Bron: components/websocket_api/commands.py:245
integratie: Home Assistant WebSocket API (documentatie, problemen)
Eerst voorgekomen: 21:17:37 (2 gebeurtenissen)
Laatst gelogd: 21:29:27

[281472362328800] Unexpected exception
[281472326271184] Unexpected exception
Traceback (most recent call last):
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/init.py", line 67, in wrap_zigpy_exceptions
yield
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/init.py", line 85, in wrapper
return await RETRYABLE_REQUEST_DECORATOR(func)(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.13/site-packages/zigpy/util.py", line 136, in retry
return await func()
^^^^^^^^^^^^
File "/usr/local/lib/python3.13/site-packages/zigpy/zcl/init.py", line 375, in request
return await self._endpoint.request(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...<9 lines>...
)
^
File "/usr/local/lib/python3.13/site-packages/zigpy/endpoint.py", line 270, in request
return await self.device.request(
^^^^^^^^^^^^^^^^^^^^^^^^^^
...<11 lines>...
)
^
File "/usr/local/lib/python3.13/site-packages/zigpy/device.py", line 378, in request
await send_request()
File "/usr/local/lib/python3.13/site-packages/zigpy/application.py", line 835, in request
await self.send_packet(
...<14 lines>...
)
File "/usr/local/lib/python3.13/site-packages/bellows/zigbee/application.py", line 716, in send_packet
raise ControllerError("ApplicationController is not running")
bellows.exception.ControllerError: ApplicationController is not running

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/usr/src/homeassistant/homeassistant/components/zha/helpers.py", line 1335, in handler
return await func(self, *args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/src/homeassistant/homeassistant/components/zha/switch.py", line 63, in async_turn_off
await self.entity_data.entity.async_turn_off()
File "/usr/local/lib/python3.13/site-packages/zha/application/platforms/switch.py", line 100, in async_turn_off
await self._on_off_cluster_handler.turn_off()
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/general.py", line 472, in turn_off
result = await self.off()
^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/init.py", line 84, in wrapper
with wrap_zigpy_exceptions():
~~~~~~~~~~~~~~~~~~~~~^^
File "/usr/local/lib/python3.13/contextlib.py", line 162, in exit
self.gen.throw(value)
~~~~~~~~~~~~~~^^^^^^^
File "/usr/local/lib/python3.13/site-packages/zha/zigbee/cluster_handlers/init.py", line 76, in wrap_zigpy_exceptions
raise ZHAException(message) from exc
zha.exceptions.ZHAException: Failed to send request: ApplicationController is not running

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/usr/src/homeassistant/homeassistant/components/websocket_api/commands.py", line 245, in handle_call_service
response = await hass.services.async_call(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...<7 lines>...
)
^
File "/usr/src/homeassistant/homeassistant/core.py", line 2802, in async_call
response_data = await coro
^^^^^^^^^^
File "/usr/src/homeassistant/homeassistant/core.py", line 2845, in _execute_service
return await target(service_call)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1007, in entity_service_call
single_response = await _handle_entity_call(
^^^^^^^^^^^^^^^^^^^^^^^^^^
hass, entity, func, data, call.context
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
)
^
File "/usr/src/homeassistant/homeassistant/helpers/service.py", line 1079, in _handle_entity_call
result = await task
^^^^^^^^^^
File "/usr/src/homeassistant/homeassistant/components/zha/helpers.py", line 1337, in handler
raise HomeAssistantError(err) from err
homeassistant.exceptions.HomeAssistantError: Failed to send request: ApplicationController is not running

@petepall
Copy link

Also getting the following in the logbook

Logger: homeassistant.components.websocket_api.http.connection
Bron: components/websocket_api/commands.py:537
integratie: Home Assistant WebSocket API (documentatie, problemen)
Eerst voorgekomen: 21:30:40 (4 gebeurtenissen)
Laatst gelogd: 21:35:08

[281472362322608] Error handling message: Unknown error (unknown_error) Peter from 192.168.0.123 (Mozilla/5.0 (iPad; CPU OS 18_2 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Home Assistant/2024.12.2 (io.robbie.HomeAssistant; build:2024.1058; iPadOS 18.2.0) Mobile/HomeAssistant, like Safari)
[281472527399904] Error handling message: Unknown error (unknown_error) Peter from 192.168.0.123 (Mozilla/5.0 (iPad; CPU OS 18_2 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Home Assistant/2024.12.2 (io.robbie.HomeAssistant; build:2024.1058; iPadOS 18.2.0) Mobile/HomeAssistant, like Safari)
Traceback (most recent call last):
File "/usr/src/homeassistant/homeassistant/components/websocket_api/decorators.py", line 28, in _handle_async_response
await func(hass, connection, msg)
File "/usr/src/homeassistant/homeassistant/components/websocket_api/commands.py", line 537, in handle_manifest_list
raise int_or_exc
homeassistant.loader.IntegrationNotFound: Integration 'miele' not found.

@petepall
Copy link

petepall commented Dec 29, 2024

Attached is a debug log
home-assistant_zha_2024-12-29T20-36-57.561Z.log

what I also noticed is that when I unplug the controller and plug it back in or when I do a full restart of the home assistant green device the zigbee switches work for about 10 minutes after which none of my zigbee devices are working anymore until I do the same thing again. This makes home assistant useless for me at this point as none of my automations that I need are working anymore.

@puddly
Copy link
Contributor

puddly commented Dec 29, 2024

@petepall Can you enable ZHA debug logging, let it run for however long it takes to get ApplicationController is not running, and then disable it once ZHA is working again (if it recovers)? Unfortunately, your log doesn't have much context, all I can see is that your radio on /dev/ttyUSB1 is not responding when ZHA tries to connect to it.

How are you running Home Assistant? Docker? A VM?

@petepall
Copy link

@petepall Can you enable ZHA debug logging, let it run for however long it takes to get ApplicationController is not running, and then disable it once ZHA is working again (if it recovers)? Unfortunately, your log doesn't have much context, all I can see is that your radio on /dev/ttyUSB1 is not responding when ZHA tries to connect to it.

How are you running Home Assistant? Docker? A VM?

Hi puddly, my Zhang’s is not recovering anymore until I disconnect and connect the device or do a full system reboot.
I’m running home assistant on a home assistant green device.

@petepall
Copy link

I tested connecting the controller directly to the home assistant green (with extension cable) and using a powered usb hub. Issue happens in both cases.
Running a log now after I disconnected and re-connected the device

@petepall
Copy link

Here is a new log that spans from the device working to it getting disabled and working again.
I also tried a new usb extension cable to check if that might be causing an issue

home-assistant_zha_2024-12-29T21-19-22.257Z.log

@puddly
Copy link
Contributor

puddly commented Dec 29, 2024

@Ithrial It looks like the firmware on your stick partially crashes during OTA and the stick refuses to acknowledge any data sent to it. At the same time, it isn't hearing back any acknowledgements from ZHA. It's as if the serial connection's TX line disconnects at some point. This is all only happening when there is a lot of data flowing back and forth (i.e. OTA + controlling a device at the same time).

I also see that you're running a firmware build with hardware flow control disabled. Can you try an alternative firmware for it that's configured like the one we use for the SkyConnect? sonoff-e_zigbee_ncp_hw_flow_7.4.4.0.gbl.zip (make sure to unzip it first). You can flash it the same way you probably upgraded your stick originally (via the web flasher), just pick this GBL file. Once that's done, re-configure ZHA to change the serial port's flow control to hardware and try things once more.

@puddly
Copy link
Contributor

puddly commented Dec 29, 2024

@petepall I see you're also running EmberZNet 7.4.4.0 on your ZBT-1. Do you remember when you installed it? How did you do so?

@petepall
Copy link

@puddly I received the stick with this software on it. I today unboxed it and start using it as I thought the issue was with my sonoff dongle P

@Ithrial
Copy link

Ithrial commented Dec 29, 2024

@puddly - Thank you for looking at the logs.
Install Firmware

I'm installing the firmware that you linked now and I have some firmware updates needed on the plugs - will change to hardware flow control and reply back

@Ithrial
Copy link

Ithrial commented Dec 29, 2024

@puddly First follow-up: Hardware: reconfigure with hardware flow control hung on initializing. Rebooted HA after ZHA hung at Initializing. ZHA started and everything is detected but now the Third Reality plugs are showing as up to date, even after a manual updates check. Can confirm network is up by toggling the TR Plugs power on and off. Will report back with next firmware update

@Ithrial
Copy link

Ithrial commented Dec 30, 2024

@puddly: 2nd Follow-up: Just got the prompt for my 4x Third Reality Zigbee Plugs. I updated the 1st plug thats closest to the Coordinator. Completely 100% successfully with 0 crashes. I forgot to enable Debugging mode so thankfully it completed on the first click. I enabled Debugging mode and continued with the next closest which is about 12-15ft away. in the Kitchen - it too completed 100% in the first go. Next I tried upgrading the 2 farthest away (both equidistant from the Kitchen Plug). They're slowly but surely updating (update much slower when multiple devices are updating at the same time which makes sense in a network with limited bandwidth). The update of one of the 2 simultaneous updates did crash around 80ish percent, however I was able to resume it without having to restart HA, which is a first. The 2nd of the 2 simultaneous updates completed successfully with no issues.

I'm not sure if hardware data flow control is enabled or no but I'm perfectly content in the state that its in right now, from a stability perspective

The log file from the 3 updates is 155MB for some reason so it wont let me upload the thread so I had to upload to Google Drive - See Log

@puddly
Copy link
Contributor

puddly commented Dec 30, 2024

Thanks! So as far as I can tell, the firmware doesn't crash at all once you use hardware flow control. OTA failing with concurrent updates is a different issue but I think we have the firmware problem figured out. What hardware and software are you running Home Assistant OS/container on?

@petepall The ZBT-1 shouldn't ship with 7.4.4.0, it should come with 7.3.1.0. Was your ZBT-1 used? If not, which distributor did you buy it from?

@Ithrial
Copy link

Ithrial commented Dec 30, 2024

@puddly Thanks for looking at the logs. I'm running the HAOS via the generic x86 install options on a dedicated N97 based mini pc.

@petepall
Copy link

petepall commented Dec 30, 2024

@puddly the ZBT-1 was not used as far as I know. I bought it from HA shop.
There was mentioned on their website that for usage with zigbee2MQTT ember would de needed so I selected the option for ember upgrade as I would potentially use the stick with Zigbee2MQTT

Today I changed again the extension cable and removed my powered usb hub and till now the devices are now working.

@puddly
Copy link
Contributor

puddly commented Dec 30, 2024

@petepall Can you link to the store?

@petepall
Copy link

@petepall Can you link to the store?

Here is the link https://www.hashop.nl/Home-Assistant-SkyConnect#ProductReviewText

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests