Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Raise an error when two classes have the same ambiguous attribute #352

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

vincentkelleher
Copy link

@vincentkelleher vincentkelleher force-pushed the raise_error_for_ambiguous_attributes branch 3 times, most recently from fab66c3 to ad8d5ad Compare December 5, 2024 12:18
Copy link
Member

@cmungall cmungall left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we extend the test to also make sure that no premature exceptions are raised when using schemaview to cycle through all slots and do a lookup (or to ensure that randomAttribute is not returned on get_slots without attributes=True)

@sneakers-the-rat
Copy link
Contributor

in my opinion the problem is the ambiguity of get_slot returning attributes at all, and there is actually nothing wrong with the schema in this case - both attributes can be defined within the context of independent classes without expectation that they represent the same thing. i think i said this in the other issue but i think there are 3 dispositions for an attribute

  • attribute defined only within current class (no parents): the current case - the attribute is only defined within the context of the class and does not have independent meaning otherwise
  • attribute defined in a subclass and a parent class: the attribute is only defined within the class lineage, and Subclass.attribute has an implicit is_a relationship with ParentClass.attribute
  • attribute explicitly inherits from a slot via an explicit is_a and extends it: attribute defined within a class is independent with an inheritance relationship to the slot, which does have independent meaning outside the class.

So I think the error message is misleading, or at least not how linkml should behave (in my opinion) - imo it should say something more like "Multiple attributes with the same name found in classes: {list of class names where attr is defined}. Access the attribute from the class directly using induced_slot(slot_name, class_name)"

@@ -631,8 +631,8 @@ def get_slot(self, slot_name: SLOT_NAME, imports=True, attributes=True, strict=F
for c in self.all_classes(imports=imports).values():
if slot_name in c.attributes:
if slot is not None:
# slot name is ambiguous: return a stub slot
return SlotDefinition(slot_name)
raise ValueError(f'Attribute "{slot_name}" is already defined in another class, please use a '
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would be great to start making more specific exception types in these packages - super useful for downstream programming. so idk like

class AmbiguityError(ValueError):
    pass

class AmbiguousNameError(AmbiguityError):
    pass

and then one would both catch ValueError's as well as having a more specific catch for that type of exception if someone wanted to (rather than needing to catch ValueError and then check the message text to know what kind of ValueError was raised)

@vincentkelleher
Copy link
Author

Thank you for your feedback 🙏

If I try to summarize everything before coding a solution:

  1. @cmungall you are asking me to add a test where I would look for an actual slot (which I can add to ImportantSecondClass for example) but with the attributes=False parameter to make sure my change doesn't interfere with usual slot collection ?
  2. @sneakers-the-rat I kind of understand your point with certain cases of ambiguous attributes being possible as they are not supposed to be considered as slots which would be normal for the Python generator for example as attributes of a class are isolated from the outside world. The issue is that we will have two different behaviors in generators such as the Python/Java/... generators and the JSON-LD Context/SHACL/... generators. Maybe we should consider having separate methods for slots and attributes but that has massive impacts in existing code 🤔
  3. @sneakers-the-rat I totally agree with more specific exceptions, I just didn't want to force my coding habits on others 😇

@sneakers-the-rat
Copy link
Contributor

The issue is that we will have two different behaviors in generators such as the Python/Java/... generators and the JSON-LD Context/SHACL/... generators.

Again writing from my opinion here and dont mean to hold this up - imo all the RDF generators should give URIs like {prefix}{class_name}.{attribute_name} to attributes and then use RDFs/OWL terms to indicate relationships, so e.g. if you had two example attrs and a slot, they would be like

http://example.com/ClassA.example
http://example.com/ClassB.example
http://example.com/example

My comment here was just about the error message wording tho - it's a little unclear about what the problem is and what needs to be done: so it could say something like directed at a schema author "these attributes will be ambiguous in RDF generators and you may need to rename them or restructure the schema" or directed at a generator developer/direct user of the schema "to access these attributes use induced_slot"

@vincentkelleher
Copy link
Author

@sneakers-the-rat I think I got your point 👍

In that case we can keep the fact that it throws an exception when ambiguous attributes are found except I will write a clearer error message to push developers/ontologists to either fix the schema or use another method.

If ever your attribute prefixing solution comes to fruition, we can remove this exception altogether.

Is that okay for you ?

Closes linkml/linkml#2403

Signed-off-by: Vincent Kelleher <vincent.kelleher@gaia-x.eu>
@vincentkelleher vincentkelleher force-pushed the raise_error_for_ambiguous_attributes branch from ad8d5ad to 1dd9f6f Compare December 23, 2024 13:40
@vincentkelleher
Copy link
Author

@cmungall @sneakers-the-rat I've just updated the pull-request with your suggestions 😇

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Ambiguous attributes should not be accepted ?
3 participants