Edge count num doesnot equals the edge_df.count? #644
Replies: 1 comment
-
After debugging the write process, it was found that duplicate vertex primary keys (PK) occurred during the definition of the Vertex_DF, leading to duplicate PKs in the vertex indexing. This caused duplicate edges when the internal edge_df joined the edge table with the vertex mapping, resulting in an increased edge count. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I read three columns from orc files, and construct a edge_df use: orcfiles_df.select("src", "dst", "attr"), and use GraphWrite to write a graphdata , use putEdgeData(tag, edge_df) to write . but when write successfully, I read the write data with EdgeReader,the edge_count files read num is more bigger than the edge_df.count(). By the way , the vertex_df.count() equals the numbers in vertex_count file .
Is there any method that requires special attention when using it, which I might have overlooked?
Beta Was this translation helpful? Give feedback.
All reactions