OrientDB concurrent writes in distributed mode - not working #8663

arpitaagarwal · 2018-11-16T08:05:25Z

OrientDB Version: 3.0.10

Java Version: 8

OS: MacOS 10.13.6 and Ubuntu 16.04.3 LTS (Its a containerised environment)

Expected behavior

We have a write intensive application and need to deploy OrientDB in distributed mode for HA in production. We are using OrientDB blueprint APIs.
OrientDB client should be able to connect to the cluster and create/write edges and vertices concurrently distributing load across different nodes. This is required for -
(1) Faster writes
(2) High Availability

Actual behavior

2 nodes cluster works fine with 1-2 threads. As number of threads increases, it starts giving timeout exceptions and OrientDB client hangs forever waiting for synchronisation to complete. It never recovers.

Steps to reproduce

A default deployment of OrientDB 3.0.10 without any config changes.
Start 2 nodes on orientdb through dserver.sh file on same machine. Both these nodes will start communicating with each other by sharing ip-address through multicast.
Use orientdb client connects to server with ipaddress, no stick ports.
Client writes a vertex-edge-vertex (Person-FriendOf-Person) through a single commit operation.
Add unique index on Person.name to avoid creating multiple vertex of same name.
Run this test multiple time with single thread (works fine).
Increase number of threads to see following in logs - Timeout (1244ms) on waiting for synchronous responses from nodes=[node2, node1] responsesSoFar=[node2] request=(id=1.105226 task=TxPhase1) [ODistributedDatabaseImpl]

Can someone suggest what's wrong in this deployment? We need to write millions of vertex-edge-vertex combinations and build a graph.
cc @lvca @tglman @Andrey Lomakin @luigidellaquila

cschockaert · 2019-05-22T15:57:51Z

+1 got exactly the same problem with odb 3.3.18 here
never got this problem in odb latest 2.X version

Jotschi · 2019-05-22T16:25:18Z

@tglman Do you think this issue is similar to the ones that were handled in private issue (10586 - Cluster communication failure) - Is this solved with 3.0.19?

Jotschi · 2019-05-22T16:26:24Z

Relates to #8742

jonsalvas mentioned this issue Aug 6, 2019

ODistributedRecordLockedException Timeout (100ms) on acquiring lock on record #xx:-1 on server 'xxx'. It is locked by request null #8974

Closed

andrii0lomakin closed this as completed Aug 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OrientDB concurrent writes in distributed mode - not working #8663

OrientDB concurrent writes in distributed mode - not working #8663

arpitaagarwal commented Nov 16, 2018

cschockaert commented May 22, 2019

Jotschi commented May 22, 2019

Jotschi commented May 22, 2019

OrientDB concurrent writes in distributed mode - not working #8663

OrientDB concurrent writes in distributed mode - not working #8663

Comments

arpitaagarwal commented Nov 16, 2018

OrientDB Version: 3.0.10

Java Version: 8

OS: MacOS 10.13.6 and Ubuntu 16.04.3 LTS (Its a containerised environment)

Expected behavior

Actual behavior

Steps to reproduce

cschockaert commented May 22, 2019

Jotschi commented May 22, 2019

Jotschi commented May 22, 2019