Implement energy landscapes and pathways from simulation #178

SCiarella · 2023-10-24T14:47:22Z

This branch implements the analysis of optimal pathways between sites.
Only site-to-site pathways are implemented for now, but later it will be extended to identify percolating pathways and reproduce literature results.
The notebook 'paths.ipynb' shows the addition of this branch.

stefsmeets · 2023-10-30T09:47:45Z

Nice work! Let me know when this is finished, and I'll happily review this code.

Couple of things so far:

Please put any documenting notebooks here to avoid ballooning this repo.
Do we need a custom Dijksta implementation? Why not use a well-tested one from scipy or networkx?
Can you add some tests for these new functions?
What is this direct path for?

SCiarella · 2023-11-06T16:15:39Z

The branch has been updated and it is ready for review. In particular, the specific comments by @stefsmeets have been addressed in the following way:

Please put any documenting notebooks here to avoid ballooning this repo.

Done

Do we need a custom Dijksta implementation? Why not use a well-tested one from scipy or networkx?

Switched to networkx

Can you add some tests for these new functions?

I am not an expert with testing, so I would like to discuss next time about the 'best practices' in this direction

What is this direct path for?

It is used to show that it does not work, as mentioned by @tfamprikis in #144. I have now moved direct_path from the main gemdat package to path.ipynb, since its use is only pedagogical.

src/gemdat/transitions.py

v1kko

Looks good to me if you add a test for the perculating pathway

stefsmeets

Hi @SCiarella , Nice work! Just made a pass through the code, some small comments from my side for now. I will have a look at the notebook tomorrow and the PR as a whole! 😃

src/gemdat/plots/plotly/_density.py

src/gemdat/plots/plotly/_paths.py

src/gemdat/transitions.py

src/gemdat/volume.py

stefsmeets · 2023-11-08T16:24:08Z

src/gemdat/volume.py

+        """
+        prob = self.data / self.data.sum()
+        free_energy = -kBT * np.log(prob)
+        return np.nan_to_num(free_energy)


Maybe not for this PR, but for the long term we could consider that all methods like this return an instance of Volume with some flag that tells that it is a probability or free energy, and have some .units attribute to keep track of the units.

src/gemdat/transitions.py

stefsmeets

I just checked the notebook, and I'm wondering if you can think a bit of how you want people to use this code. I find some parts not very obvious and I think it can be streamlined quite a bit.

Feel free to tackle this in a follow-up PR if that makes more sense.

Let me know what you think!

Free energy

This is the current api in this PR:

from gemdat.transitions import optimal_path, free_energy_graph

diff_volume = trajectory_to_volume(diff_trajectory, resolution=0.3)

F = diff_volume.get_free_energy(kBT=trajectory.metadata['temperature'])

F_graph = free_energy_graph(F, max_energy_threshold=1e7, diagonal=True)

start_point = (49, 26, 10)
end_point = (10, 0, 8)

path, path_energy = optimal_path(
    F_graph,
    start_point,
    end_point,
)

fig1 = plots.path_on_landscape(diff_volume, path, structure)
fig1.show()

fig2 = plots.energy_along_path(energy_path=path_energy)
fig2.show()

Playing around a little bit, this is what I would suggest we aim for:

we can grab the temperature from the trajectory metadata
F is a an instance of Volume or a subclass of Volume with energy related methods
no need for any extra gemdat imports
does not hide the networkx graph, so users can do whatever they want with it

import networkx as nx

diff_volume = trajectory.to_volume(resolution=0.3)

F = diff_volume.to_free_energy()
G = F.to_graph(max_energy_threshold=1e7, diagonal=True)

source = (49, 26, 10)
target = (10, 0, 8)

optimal_path = nx.shortest_path(G,
                                source=source,
                                target=target,
                                weight='weight')

fig1 = plots.path_on_landscape(diff_volume, optimal_path, structure)
fig1.show()

energy = [G.nodes[node]['energy'] for node in optimal_path]

fig2 = plots.energy_along_path(energy_path=energy)
fig2.show()

Percolation

Same goes for the percolation, my suggestion would be for an api that looks something like this:

no need to import anything = more discoverable
returning a dataclass of some sort makes it easier to extend the code
hides the wrapping in a method

peaks = F.find_peaks()
best_path = F.find_best_perc_path(peaks, perc_xyz=(True, False, False))

print(f"Total Energy required: {best_path.total_energy_cost}")
print(f"Starting Point: {best_path.starting_point}")
print(f"Best Path: {best_path.best_path}")
print(f"Best Path Energy: {best_path.best_path_energy}")

fig1 = plots.path_on_landscape(diff_volume, structure, path=best_path)
fig1.show()

fig2 = plots.energy_along_path(path=best_path)
fig2.show()

Fix plots location Update percolation function Fix missing type hints Created Pathway class

v1kko

Looks good to me, I suggested a few small changes, but otherwise I think its good to go, any remaining comments can be converted into issues 🚀

v1kko · 2023-11-14T08:54:57Z

src/gemdat/path.py

+    wrapped_coord = coord % size
+    return wrapped_coord


I think that a function for this single statement is not necessary, and the function can be ommited

Ok, I have removed the function

src/gemdat/path.py

SCiarella · 2023-11-17T08:46:53Z

I have updated the branch as suggested. The remaining comments about API and usage can be discussed in another issue

v1kko · 2023-11-17T09:08:18Z

I have updated the branch as suggested. The remaining comments about API and usage can be discussed in another issue

Perfect, after you create the issue, let's merge this 🚀

SCiarella added 2 commits October 24, 2023 16:45

add site-site path analysis

6c8b2f2

Add identification of best percolating path

18357db

SCiarella mentioned this pull request Oct 26, 2023

Implement energy landscapes and pathways from simulation #144

Closed

4 tasks

stefsmeets changed the title ~~pathways branch~~ Implement energy landscapes and pathways from simulation Oct 30, 2023

stefsmeets marked this pull request as draft October 30, 2023 09:07

stefsmeets self-requested a review October 30, 2023 09:07

SCiarella added 2 commits November 6, 2023 12:05

use Networkx pathfinding

5219840

add path-landscape plots

cddec9e

SCiarella marked this pull request as ready for review November 6, 2023 16:16

v1kko linked an issue Nov 7, 2023 that may be closed by this pull request

Calculate transition energy between sites #92

Closed

v1kko reviewed Nov 7, 2023

View reviewed changes

src/gemdat/transitions.py Outdated Show resolved Hide resolved

Remove percolation mask

d1b6ee6

v1kko marked this pull request as draft November 8, 2023 09:35

v1kko marked this pull request as ready for review November 8, 2023 09:35

SCiarella and others added 5 commits November 8, 2023 14:58

Add paths notebook

3edac8f

Merge remote-tracking branch 'origin/main' into pathways

0cbd144

Fixed plot.density args

a8c8a05

moved plot._paths to plot.plotly._paths

4c9ea04

Add free energy test

5b5840f

v1kko self-requested a review November 8, 2023 15:06

v1kko approved these changes Nov 8, 2023

View reviewed changes

stefsmeets reviewed Nov 8, 2023

View reviewed changes

Add percolation test

e215a2f

stefsmeets requested changes Nov 9, 2023

View reviewed changes

Add to_probabiity method for Volume

8644fc5

Fix plots location Update percolation function Fix missing type hints Created Pathway class

v1kko self-requested a review November 14, 2023 08:51

v1kko reviewed Nov 14, 2023

View reviewed changes

Remove _wrap_pbc function

f7efc88

Fix movements np.array

d910e57

SCiarella mentioned this pull request Nov 17, 2023

Improve API for pathways #186

Closed

SCiarella merged commit c78696d into main Nov 17, 2023
3 checks passed

stefsmeets deleted the pathways branch November 20, 2023 08:47

stefsmeets mentioned this pull request Nov 20, 2023

Percolation tests are slow #187

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement energy landscapes and pathways from simulation #178

Implement energy landscapes and pathways from simulation #178

SCiarella commented Oct 24, 2023 •

edited

Loading

stefsmeets commented Oct 30, 2023 •

edited

Loading

SCiarella commented Nov 6, 2023

v1kko left a comment

stefsmeets left a comment

stefsmeets Nov 8, 2023

stefsmeets left a comment

v1kko left a comment

v1kko Nov 14, 2023

SCiarella Nov 17, 2023

SCiarella commented Nov 17, 2023

v1kko commented Nov 17, 2023

Implement energy landscapes and pathways from simulation #178

Implement energy landscapes and pathways from simulation #178

Conversation

SCiarella commented Oct 24, 2023 • edited Loading

stefsmeets commented Oct 30, 2023 • edited Loading

SCiarella commented Nov 6, 2023

v1kko left a comment

Choose a reason for hiding this comment

stefsmeets left a comment

Choose a reason for hiding this comment

stefsmeets Nov 8, 2023

Choose a reason for hiding this comment

stefsmeets left a comment

Choose a reason for hiding this comment

Free energy

Percolation

v1kko left a comment

Choose a reason for hiding this comment

v1kko Nov 14, 2023

Choose a reason for hiding this comment

SCiarella Nov 17, 2023

Choose a reason for hiding this comment

SCiarella commented Nov 17, 2023

v1kko commented Nov 17, 2023

SCiarella commented Oct 24, 2023 •

edited

Loading

stefsmeets commented Oct 30, 2023 •

edited

Loading