ACSF descriptors for ML #87
-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi @WSukmas, Q1: So, my question is how do I exploit these lines of data as descriptors for ML? Can I just average all the rows of each sample and use it to train the model to predict the formation energy? Q2: if I'm working on multi-element systems, let's say, "A-O" systems where A can be any elements. Now that species = ["Ac, Al, Ag, ....] becomes a huge list, which consequently leads to a ridiculously large set of descriptors. So, is there an alternative method to work around this problem, or I've missed something? |
Beta Was this translation helpful? Give feedback.
Hi @WSukmas,
Q1: So, my question is how do I exploit these lines of data as descriptors for ML? Can I just average all the rows of each sample and use it to train the model to predict the formation energy?
A1: ACSF is a local descriptor, so it only describes a localized region in your structure. In order to use such a localized feature to predict a global property (formation energy per atom = formation energy / number of atoms), you need to jump through some additional hoops. Either you somehow squeeze out a single vector from the ACSF of all atoms (average, REMatch, etc.) or then you essentially train a more complex model that can e.g. predict the formation energy by looking at each ACSF…