Type guessing should attempt to go via elements first #4697

IAlibay · 2024-08-31T09:17:40Z

Current behaviour

The current behaviour, and something we reinforce in #3753 is that type works by attempting to guess elements from atom names.

Proposed behaviour

The proposal here is to always try to attempt to read from elements first rather than guessing through names. If elements exist AND they are complete, then you return those, rather than guessing them.

Where would this matter?

A good example here is FHIAIMS where:

names == elements
types == guessed elements from names

In this case it would have just been "safer" (i.e. fewer code bits gone through) to just do names == elements == types.

Target release

This needs discussion. From my own limited look at things, there aren't any cases where making this behaviour change would negatively impact behaviour. Indeed, I'm not sure I can see any cases where behaviour would change.

IAlibay · 2024-08-31T09:21:17Z

cc @lilyminium

lilyminium · 2024-09-02T02:00:55Z

Agreed, I think this would make a lot more sense! It also avoids potential weirdness in cases like the RDKitParser, where type guessing can occur as trying to guess the element of atom names that can be variously MonomerInfo names or _TriposAtomNames. However this probably needs @MDAnalysis/coredevs consensus.

isolated-matrix · 2024-10-05T09:43:37Z

Hi there, I was just wondering, would this proposed change address the issue where, as is stated in the MDAnalysis documentation, 'atoms named “CA” are much more likely to represent an alpha-carbon than a calcium atom'? Because I'm trying to analyse a system containing argon (Ar) and MDAnalysis is coming up with an error stating that the atom mass cannot be guessed.

lilyminium self-assigned this Sep 2, 2024

orbeckst added the Component-Topology label Sep 26, 2024

IAlibay modified the milestones: Release 2.8.0, Release 3.0 Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type guessing should attempt to go via elements first #4697

Type guessing should attempt to go via elements first #4697

IAlibay commented Aug 31, 2024

IAlibay commented Aug 31, 2024

lilyminium commented Sep 2, 2024

isolated-matrix commented Oct 5, 2024

Type guessing should attempt to go via elements first #4697

Type guessing should attempt to go via elements first #4697

Comments

IAlibay commented Aug 31, 2024

Current behaviour

Proposed behaviour

Where would this matter?

Target release

IAlibay commented Aug 31, 2024

lilyminium commented Sep 2, 2024

isolated-matrix commented Oct 5, 2024