You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 30, 2020. It is now read-only.
Launch fleet units by running sudo docker run --net=host tleyden5iwx/couchbase-cluster-go update-wrapper couchbase-fleet launch-cbs --version 3.0.1 --num-nodes 3 --userpass "user:passw0rd" -- this dynamically generates fleet units based on templates, then submits them via the fleet api.
At this point, my journalctl -b -u fleet.service --no-pager logs are:
Run the same command as earlier to kick things off: sudo docker run --net=host tleyden5iwx/couchbase-cluster-go update-wrapper couchbase-fleet launch-cbs --version 3.0.1 --num-nodes 3 --userpass "user:passw0rd"
Didn't reproduce the bug this time, but I repeated the Stop + destroy units and Restart units steps three times (third time a charm!) and was able to reproduce it.
Fleet units:
$ fleetctl list-units
UNIT MACHINE ACTIVE SUB
couchbase_node@1.service 8995d6d7.../10.156.7.12 active running
couchbase_node@2.service ad8cb97d.../10.239.174.35 active running
couchbase_node@3.service cc2b61a5.../10.141.247.11 active running
couchbase_sidekick@1.service 8995d6d7.../10.156.7.12 failed failed
couchbase_sidekick@2.service ad8cb97d.../10.239.174.35 active running
couchbase_sidekick@3.service cc2b61a5.../10.141.247.11 failed failed
On machine 11, which has one of the failed units, there is an error:
ERROR manager.go:136: Failed to trigger systemd unit couchbase_sidekick@3.service start: Unit couchbase_sidekick@3.service failed to load: No such file or directory.
Likewise on machine 12 which also has a failed unit, there is an identical error:
ERROR manager.go:136: Failed to trigger systemd unit couchbase_sidekick@1.service start: Unit couchbase_sidekick@1.service failed to load: No such file or directory.
The text was updated successfully, but these errors were encountered:
I'm running CoreOS alpha 612.1.0 and launching / destroying units via the fleet REST api. I'm seeing sporadic issues where units fail to start.
Here's a full walkthrough of what I'm doing to reproduce the issue:
Start units
sudo docker run --net=host tleyden5iwx/couchbase-cluster-go update-wrapper couchbase-fleet launch-cbs --version 3.0.1 --num-nodes 3 --userpass "user:passw0rd"
-- this dynamically generates fleet units based on templates, then submits them via the fleet api.At this point, my
journalctl -b -u fleet.service --no-pager
logs are:Stop + destroy units
This just stops and destroys all units, its essentially the equivalent of
fleetctl stop * && fleetctl destroy *
:Verify everything is clean:
At this point, my
journalctl -b -u fleet.service --no-pager
logs are:Restart units
Run the same command as earlier to kick things off:
sudo docker run --net=host tleyden5iwx/couchbase-cluster-go update-wrapper couchbase-fleet launch-cbs --version 3.0.1 --num-nodes 3 --userpass "user:passw0rd"
Didn't reproduce the bug this time, but I repeated the Stop + destroy units and Restart units steps three times (third time a charm!) and was able to reproduce it.
Fleet units:
Unit files:
Journalctl logs:
Analyzing the logs
On machine 11, which has one of the failed units, there is an error:
Likewise on machine 12 which also has a failed unit, there is an identical error:
The text was updated successfully, but these errors were encountered: