Support 1 to 1 associations at the schema level #2511

camway · 2018-07-31T20:11:11Z

Me and another member of the dgraph forum have been going back and forth on how exactly this should be done. Feel free to read

https://discuss.dgraph.io/t/return-single-object-instead-array-for-exactly-one-relation-between-node/2927/7

The solution proposed by bennyrio (the person I've been talking to in the forum) was elaborated on after creating this. I do not want to misrepresent his view on this so below is a direct link to his explanation:

https://discuss.dgraph.io/t/return-single-object-instead-array-for-exactly-one-relation-between-node/2927/8

The essence of my request is that I think we should change the schema so that uid associations are capable of being defined as 1 to 1. Unfortunately, my proposal will be a breaking change, but I think it will make uid predicates consistent with the other predicate types.

Currently a schema of this:

a: uid .
b: [string] .
c: int .

Exhibits these behaviors:

a is capable of returning 0 or more nodes
b is capable of returning 0 or more strings
c is capable of returning 0 or 1 integers

My proposal would look like this:

a: uid .
b: [string] .
c: int .
d: [uid] .

Exhibiting these behaviors:

a is capable of returning 0 or 1 nodes
b is capable of returning 0 or more strings
c is capable of returning 0 or 1 integers
d is capable of returning 0 or more nodes

This would also require that any mutation which would violate this 1 to 1 association should abort with an error.

This would also mean that existing schemas would have to be updated, but I believe this will make for consistency across all predicates types when defining a list vs a single item.

The text was updated successfully, but these errors were encountered:

ahopkins · 2018-08-06T19:09:16Z

Honestly ... as a newcomer to dgraph ... this is how I initially expected it to work.

If I want to map multiple "things", I wrap it in []. For example: [str]. That makes sense.

EXCEPT if I am mapping to multiple uid. Then I just leave it as uid. That does not make sense.

manishrjain · 2018-08-07T06:04:29Z

Hmm... This looks like a very convincing argument. In essence, if you can have multiple edges from one node to another, the schema must be defined as [uid], and the result would be a list of JSON objects. Otherwise, if defined as uid, then there can only be at most one edge and the result would be a JSON map.

I haven't thought through all the implications of this, but as an initial thought, it makes a lot of sense to be this way.

slawo · 2018-08-09T12:36:31Z

There should be a consistency tag put on this.

insanitybit · 2018-08-11T19:33:00Z

The proposed behavior is absolutely what I had expected when going into dgraph, and I even just today had to remind myself that [uid] wasn't an option.

camway · 2018-08-13T16:33:40Z

@ahopkins and @insanitybit, that seems to be the general consensus. I don't know what it will take to get this implemented, but I think I can safely speak for us (and others) in saying we want this functionality. It seems like this is something many people have found workarounds for, but it seems like it's something the database should do for it's users.

bandirsen · 2018-08-16T04:33:38Z

@camway

Thing to consider, predicate with scalar value and predicate with uid relation is different thing.

p: [string] mean, a node can have zero or one of this predicate, and the predicate hold list of string value, its predicate/edge multiplicity is still zero or one, same as p: string, but the value type is different, now its a list of string.

while p:uid mean, a node can have zero or one of this predicate, and p:[uid] mean a node can have zero, one or more of this predicate, which mean predicate/edge multiplicity is changed.

The difference is clear when we do set mutation, in current implementation, when we do set mutation on a already existed scalar value predicate with a new value, it will automatically update the old value with new value (or if it was a list type, it will add new item to the list). On the other hand, set mutating on already existed uid predicate with new uid will add new uid-to-uid relation, which mean predicate/edge multiplicity is changed

Since this feature request is about predicate/edge multiplicity constraints not value type, I think it would be better if we implement this feature as '@' tag, just like @reverse tag, for example:
p: uid @one @Index @reverse which mean a node can have zero or one of this predicate.
p: uid @many @Index @reverse** which mean a node can have zero, one or more of this predicate

ahopkins · 2018-08-16T04:42:14Z

@bandirsen I cannot speak to the technical implementation reasons for one versus the other. But if it is something that can be accomplished (liminiting cardinality at the dB level), then it seems like a syntactic issue. And, it seems counterintuitive to handle cardinality with brackets in one place and a directive in another.

camway · 2018-08-16T13:57:41Z

@bandirsen I'm going to try to proceed through this as methodically as possible. Please tell me where I've gone wrong because I truly do not understand why you are still fighting this change.

A few notes.

p: [string] mean, a node can have zero or one of this predicate, and the predicate hold list of string value, its predicate/edge multiplicity is still zero or one, same as p: string, but the value type is different, now its a list of string.

p: [string] means a node can have zero or more of this predicate. Writing it in your form makes it sound worse than it is. It would be like saying I have (2mill/1mill) options, instead of saying I have 2 options.

while p:uid mean, a node can have zero or one of this predicate, and p:[uid] mean a node can have zero, one or more of this predicate, which mean predicate/edge multiplicity is changed.

In the proposed change, the simplified form is:
p:uid means zero or one
p:[uid] means zero or more

I know you've been pushing for this to be a directive since before I jumped on this bandwagon, and I'm not opposed to that being an option. I don't see any problem with adding a directive such as @one that causes the relationship to drop the array and retrieve just a single item. This would be useful for problems where I want a key in my response such as "most_recent_post" in my response shape. Of course the posts in this case need to hold zero or more, but in this use case it would be useful to not have it as an array. It would also be useful in certain cases to combine this with first, last, and order statements. This was why I didn't ask you to close your github issue, I thought both changes were equally valid.

What you have not addressed is the major inconsistency in the syntax when defining the schema. That is the primary problem this issue is designed to fix, and a fix for it appears no where in your argument. This issue is also designed to allow the database - at its schema level - to address the problem of one to one associations which your argument also fails to address.

The inconsistency issue is the primary problem here. It seems everyone who starts using Dgraph (including me) instinctively tries to use uid and [uid] because after seeing how all other predicates work, this would just make sense. It's counter-intuitive. Unless your solution includes a way to solve this inconsistency, then I cannot discount the value of this proposed change.

Please tell me if I've gone wrong somewhere. I do not understand your stance against this change, and I'm trying my best.

bandirsen · 2018-08-16T17:37:05Z

@camway @ahopkins

p: [string] means a node can have zero or more of this predicate

I think I was wrong all this time, I always thought in DGraph treat list type is someway like this.

<aNode> <aPredicate> ["string1", "string2", "string3"]

if it's treat like this:

<aNode> <aPredicate> "string1"
<aNode> <aPredicate> "string2"
<aNode> <aPredicate> "string3"

Then it p:[string] will have same behaviour like p:[uid] for all mutation process and my arguments to recommends predicate multiplicity as @ directive are wrong.

camway · 2018-08-16T17:43:43Z

@bandirsen I'd have to defer to the Dgraph team for that. Honestly, the internals of this database are something I find very interesting, but I struggle to understand how much of the system works. Part of this is I just don't have enough available time to invest in order to learn it. The other part - simply put - is it's complicated.

@manishrjain could you or another member of the team give us your thoughts on @bandirsen's post above?

manishrjain · 2018-08-16T18:04:28Z

Dgraph stores values and nodes in posting list format. Multiple values for the same (sub, pred) are stored in a single posting list. Similarly, multiple nodes for the same (sub, pred) are stored in a single posting list as well.

We currently enforce that there's only one value when a user specifies a non-list data type (like string, or int). When a user specifies a list type (like [string]), then the posting list holds all the elements of the list in a single posting list object.

Currently, we don't enforce the singularity of a uid type in a posting list, and so for a node connection, a posting list can hold as many uids as provided. With this proposal, uid type would also start to enforce a single uid per posting list for the predicate. Hope that sheds some light on the internals.

camway · 2018-08-16T18:19:41Z

@manishrjain Thank you for the fast response. I think that clears up everything for me.

bandirsen · 2018-08-16T19:00:35Z

@camway @manishrjain

Thank you for explanation, It's clear now, in dgraph scalar value and node are treated equally, then p:[uid] is better choice.

I come from Neo4J, I always thought, predicate with scalar value is like Node Property and predicate with uid is like Edge, and Neo4J treat Node Property and Edge differently.

camway · 2018-08-16T20:49:28Z

@bandirsen No problem here. I've looked into Neo4J, but I've never used it. It's on my list to learn.

pmualaba · 2018-08-27T09:56:27Z

@manishrjain
quote: ...Currently, we don't enforce the singularity of a uid type in a posting list, and so for a node connection, a posting list can hold as many uids as provided. With this proposal, uid type would also start to enforce a single uid per posting list for the predicate....

The desired outcome of this proposal is that the GraphQl+- queryresult JSON object, should always return an Object { } for uid predicates, and an Array [ ] for [uid] predicates. I hope this change can be pushed higher up in the priority list, because it is part of the foundation on which the upper layer application code is built.

camway · 2018-08-27T13:26:59Z

@pmualaba

The desired outcome of this proposal is that the GraphQl+- queryresult JSON object, should always return an Object { } for uid predicates, and an Array [ ] for [uid] predicates.

I think what @manishrjain was describing is the behavior change "under the hood". What you described is ultimately what the low level behavior change would result in. @manishrjain please correct me if I'm wrong here.

I hope this change can be pushed higher up in the priority list, because it is part of the foundation on which the upper layer application code is built.

I believe the "p1" tag applied stands for Priority 1 (again @manishrjain correct me if I'm wrong). Which would mean it's essentially at the highest priority a feature request can be at.

ahopkins · 2018-08-27T13:38:31Z

For what it is worth.... I am building this into my library pydiggy. Not ideal because it is application logic that should probably be handled at the data layer. But it is a basic enough concept that I think it is important for the end user of my library to have this "feature".

The idea is to use type annotations:

class A(Node):
    pass

class B(Node):
    single: A

class C(Node):
    multi: List[A]

To achieve this:

a1 = A()
a2 = A()

b.single = a1

c.multi = [a1, a2]

Then, after querying the DB, and hydrating them back from JSON to Python objects, we still get:

>>> b.single
<A:123>

>>> c.multi
[<A:123>, <A:456>]

Vliro · 2018-11-29T14:17:14Z

How is this coming along? It is the single most needed feature for our backend, because as of right now objects that are singular are json-parsed as arrays, which makes frontend a nightmare.

manishrjain · 2018-11-29T15:24:24Z

Given this is a breaking change, we will make it part of v1.1 release. Eta Jan 2019.

Vliro · 2018-11-29T15:42:22Z

Thank you for the update @manishrjain . Looking forward to it! Will it be available earlier as part of a nightly release, or merged at the date of arrival?

martinmr · 2019-01-14T18:53:59Z

PR referenced above will address this issue. See PR for more details.

martinmr · 2019-01-18T22:23:40Z

Closing as the fix has been submitted. Feature should be available starting with the 1.1 release.

There is an issue introduced for this change in the qmstr repository: #398 Due to Dgraph behavior: dgraph-io/dgraph#2511, when querying for file node with filedata, the response returns an array of filedata, instead of 0 or 1 filedata. This breaks qmstr when unmarshaling to file node's structure.

manishrjain changed the title ~~Feature - support 1 to 1 associations at the schema level~~ Support 1 to 1 associations at the schema level Aug 7, 2018

manishrjain added breaking_change kind/enhancement Something could be better. priority/P1 Serious issue that requires eventual attention (can wait a bit) labels Aug 7, 2018

bandirsen mentioned this issue Aug 7, 2018

Feature Request: Query filter to return single object instead array on node-to-node relationship #2508

Closed

manishrjain mentioned this issue Oct 23, 2018

Assign single UID to predicate #1519

Closed

martinmr mentioned this issue Jan 14, 2019

Add ability to set schema to a single UID schema. #2895

Merged

martinmr closed this as completed Jan 18, 2019

martinmr self-assigned this Jan 18, 2019

GiasemiSh mentioned this issue Sep 2, 2019

Remove convenience function which decodes query response to structure QMSTR/qmstr#398

Closed

pepoospina mentioned this issue Sep 27, 2019

Support 1 to 1 associations seems to fail in V1.1.0 #4080

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support 1 to 1 associations at the schema level #2511

Support 1 to 1 associations at the schema level #2511

camway commented Jul 31, 2018 •

edited

Loading

ahopkins commented Aug 6, 2018 •

edited

Loading

manishrjain commented Aug 7, 2018

slawo commented Aug 9, 2018

insanitybit commented Aug 11, 2018

camway commented Aug 13, 2018

bandirsen commented Aug 16, 2018

ahopkins commented Aug 16, 2018

camway commented Aug 16, 2018

bandirsen commented Aug 16, 2018 •

edited

Loading

camway commented Aug 16, 2018

manishrjain commented Aug 16, 2018

camway commented Aug 16, 2018

bandirsen commented Aug 16, 2018

camway commented Aug 16, 2018

pmualaba commented Aug 27, 2018

camway commented Aug 27, 2018 •

edited

Loading

ahopkins commented Aug 27, 2018 •

edited

Loading

Vliro commented Nov 29, 2018

manishrjain commented Nov 29, 2018

Vliro commented Nov 29, 2018

martinmr commented Jan 14, 2019

martinmr commented Jan 18, 2019

Support 1 to 1 associations at the schema level #2511

Support 1 to 1 associations at the schema level #2511

Comments

camway commented Jul 31, 2018 • edited Loading

ahopkins commented Aug 6, 2018 • edited Loading

manishrjain commented Aug 7, 2018

slawo commented Aug 9, 2018

insanitybit commented Aug 11, 2018

camway commented Aug 13, 2018

bandirsen commented Aug 16, 2018

ahopkins commented Aug 16, 2018

camway commented Aug 16, 2018

bandirsen commented Aug 16, 2018 • edited Loading

camway commented Aug 16, 2018

manishrjain commented Aug 16, 2018

camway commented Aug 16, 2018

bandirsen commented Aug 16, 2018

camway commented Aug 16, 2018

pmualaba commented Aug 27, 2018

camway commented Aug 27, 2018 • edited Loading

ahopkins commented Aug 27, 2018 • edited Loading

Vliro commented Nov 29, 2018

manishrjain commented Nov 29, 2018

Vliro commented Nov 29, 2018

martinmr commented Jan 14, 2019

martinmr commented Jan 18, 2019

camway commented Jul 31, 2018 •

edited

Loading

ahopkins commented Aug 6, 2018 •

edited

Loading

bandirsen commented Aug 16, 2018 •

edited

Loading

camway commented Aug 27, 2018 •

edited

Loading

ahopkins commented Aug 27, 2018 •

edited

Loading