You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a node is rebooted and misses writes because of that, these writes should be replicated when the node enters the cluster again.
Actual behavior
Rebooting a node while there are concurrent write accesses on all nodes sometimes causes the node to lose a record and not replicate it afterwards.
Steps to reproduce
We run a test cluster with three nodes behind a load balancer, each with a java-based application server (Apache Karaf). We start a load test that accesses a java-based REST-API which then uses the Orientdb-client to make a write-access at the local node. Our load test runs 10 threads in parallel load balanced across all nodes. We use one technical user, and the writes consist of two new vertexes and an edge connecting the two wrapped in a transaction, without unique indexes.
When we reboot the second node (clean shutdown, no kill -9 or poweroff), the other two nodes continue working. But after the second node rejoins the cluster, sometimes certain records are missing:
Query on node 1 ("ErzeugtAm" is the createDate):
orientdb {db=b-OS}> SELECT @Rid, ErzeugtAm FROM TAAAngebot WHERE ErzeugtAm < '2018-03-19 08:39:35' AND ErzeugtAm > '2018-03-19 08:39:20';
I had the same issue with Orientdb 3.0.4, few documents are missing after a node restart.
I found this :
When a node is down, if I remove the entire database instance folder and restart the node, it works fine (= transfers the entire database).
Maybe the function OSyncDatabaseDeltaTask causes a problem.
OrientDB Version: 2.3.30
Java Version: 1.8.0_45
OS: Ubuntu 14.0.4.3 LTS
Distributed Config: 3 nodes, no sharding
Expected behavior
When a node is rebooted and misses writes because of that, these writes should be replicated when the node enters the cluster again.
Actual behavior
Rebooting a node while there are concurrent write accesses on all nodes sometimes causes the node to lose a record and not replicate it afterwards.
Steps to reproduce
We run a test cluster with three nodes behind a load balancer, each with a java-based application server (Apache Karaf). We start a load test that accesses a java-based REST-API which then uses the Orientdb-client to make a write-access at the local node. Our load test runs 10 threads in parallel load balanced across all nodes. We use one technical user, and the writes consist of two new vertexes and an edge connecting the two wrapped in a transaction, without unique indexes.
When we reboot the second node (clean shutdown, no kill -9 or poweroff), the other two nodes continue working. But after the second node rejoins the cluster, sometimes certain records are missing:
Query on node 1 ("ErzeugtAm" is the createDate):
Query on node 2:
Query on node 3:
There are no visible errors in the logfiles:
server-1.log
server-2.log
server-3.log
Regards,
KluSe
The text was updated successfully, but these errors were encountered: