Select link analysis #260

janzill · 2021-06-12T01:40:07Z

Select link analysis via on-disk path files. This is slower than an implementation during assignment, however it has the advantage that it works for any link(s) as long as path files have been stored during assignment.

The main reason to go down this path is that path file saving is now available and therefore this implementation is a lot less work.

janzill · 2021-06-12T07:39:27Z

@pedrocamargo basic functionality for MSA and FW is here. I need to

turn my dev notebook stuff into proper tests
handle matrices with several cores, atm it's all 2d
implement the calculation CFW and BFW contributions per iteration
implement turning network ids into compressed graph ids, atm you need to provide the latter

Pipfile

aequilibrae/paths/assignment_paths.py

pedrocamargo · 2021-06-13T03:00:56Z

aequilibrae/paths/select_link.py

+#  - some assertions as per constructor comments
+#  - demand matrix operations do not take cores into account (i.e. we assume we have a 2d demand matrix per class)
+#  - tests
+


We could add a TODO here for later that is to have the code to find the matrices that were used from the assignment report.

yes I'd like that. at the moment we're passing in the demand, but doing this automatically would be much better

pedrocamargo · 2021-06-13T03:02:47Z

aequilibrae/paths/select_link.py

+        }
+        return select_link_matrices
+
+    def run_select_link_analysis(self, link_ids: List[int]) -> None:


Should the annotation be " --> AequilibraeMatrix"?

it's a dict of dict (mode and link id) of np.array, but should probably make it an Aequlibraematrix instead of np.array

aequilibrae/paths/select_link.py

janzill · 2021-06-17T02:01:12Z

Wouldn't it make much more sense to parallelize it over origins? I guess it would allow us to use a higher number of cores. Wouldn't it?

Sure, it's just not as straight forward with having to read in files, and the resulting path-link arrays and indexes not being fixed length. I didn't have much time at the end of the day (middle of the night really :) ) so I decided to go for the quickest win

pedrocamargo · 2021-06-17T06:27:52Z

Wouldn't it make much more sense to parallelize it over origins? I guess it would allow us to use a higher number of cores. Wouldn't it?

Sure, it's just not as straight forward with having to read in files, and the resulting path-link arrays and indexes not being fixed length. I didn't have much time at the end of the day (middle of the night really :) ) so I decided to go for the quickest win

Do you mean that the reading would have to happen in each thread separately?

janzill · 2022-12-02T03:20:47Z

I think this could be handled much more efficiently with duckdb - just need to optimise the on-disk path file storage (currently many small parquet files because it is set up to save one per origin and traffic class and iteration - could be as simple as combining these) and then merge demand and query. I won't have time to do this anytime soon but will come back to it at some point.

pedrocamargo · 2023-02-13T11:20:49Z

@jamiecook , I had forgotten about this PR Jan had prepared a long time ago. Let's discuss this. We either incorporate it scrap it

janzill · 2023-02-13T23:09:25Z

I'm happy for this to be deleted given that there is a proper implementation during assignment now. This was meant as a quick fix to enable the functionality without spending too much time on it. It's also old - if I'd do this today I'd just process the parquet files with duckdb and most of the code above would go away.

Art-Ev · 2023-11-07T13:16:37Z

@pedrocamargo, don't know what has been decided for this PR but when calibrating a model, a solution to do link analysis without re-running a full assignment can save a lot of time (with a menu like flow bundle in VISUM) even with the needed storage. Being able to do both (save for later or direct link analysis during assignment like Jake commits) would be awesome!

If we keep both solutions, I would be happy to try to work on the use of path_file. But I don't think I'll be really relevant about the creation of path_file itslef (structure, duckdb or feather, ...). Maybe somebody could have a brillant idea for a quick-win based on this PR for path file savinf ? (@janzill, @Jake-Moss or maybe @djfrancesco ?)

janzill · 2023-11-07T19:37:40Z

@pedrocamargo, don't know what has been decided for this PR but when calibrating a model, a solution to do link analysis without re-running a full assignment can save a lot of time (with a menu like flow bundle in VISUM) even with the needed storage. Being able to do both (save for later or direct link analysis during assignment like Jake commits) would be awesome!

If we keep both solutions, I would be happy to try to work on the use of path_file. But I don't think I'll be really relevant about the creation of path_file itslef (structure, duckdb or feather, ...). Maybe somebody could have a brillant idea for a quick-win based on this PR for path file savinf ? (@janzill, @Jake-Moss or maybe @djfrancesco ?)

I think we already have all the pieces for this, but there are some improvements that would help make this more useable.

Changes to path files. Currently, these are in feather format by default and we partition over origins by including the origin id in the file name. I suggest we switch to using parquet and hive-partitioning files over origins (i.e., origins are not part of the files or file names themselves but stored as the folder name where all data in a folder is understood to refer to one origin). Duckdb supports this if we want to go down that path.
I never implemented path weights for CFW and BFW, but I think the logic to generate these might exist already in extract turning movements #358

quick-refactoring

3f71a92

janzill added the WIP Work in Progress label Jun 12, 2021

Jan Zill added 10 commits June 12, 2021 13:29

select link refactoring

b6db762

sl matrix per requested link

4865136

specify key and val type for typing dict

6187ff5

removes unused variable

9d04522

fixes name and id confusion in class identifiers

7a29966

fixes name and id confusion in class identifiers

4471b54

only works with simplified graph ids for now

1cdc8bd

fixes

a28f3d6

select correct destinations for indexes

28dfa35

logging

75a9c2f

Jan Zill added 4 commits June 13, 2021 12:13

path file parsing tests

6641b2b

moves path file index lookup manipulation out of inner link loop

fec41b3

path file parsing and od path generation tests

e4e60d7

tests for path extraction from path files

0d1e4c3

pedrocamargo reviewed Jun 13, 2021

View reviewed changes

Jan Zill added 12 commits June 13, 2021 13:13

updates some return types

af0a174

adds select link tests

4d112ae

add Pedro's suggestion for doc

de1cabc

account for matrix cores in select link

6a6cad3

skip zero demand origins in sl

bb9051a

skip zero demand origins in sl

4e972de

cater for 2d demand matrices, which shouldn't exist but somehow do

c0678de

cater for 2d demand matrices, which shouldn't exist but somehow do

46baabf

basic logic bug, duh

8b08900

adds cythonised select link, parallelisation not done yet

f8e53fe

clean up

1042a44

prange for select link array

2d32beb

pedrocamargo deleted the branch master February 24, 2024 06:31

pedrocamargo closed this Feb 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Select link analysis #260

Select link analysis #260

janzill commented Jun 12, 2021

janzill commented Jun 12, 2021

pedrocamargo Jun 13, 2021

janzill Jun 13, 2021

pedrocamargo Jun 13, 2021

janzill Jun 13, 2021

janzill commented Jun 17, 2021 •

edited

Loading

pedrocamargo commented Jun 17, 2021

janzill commented Dec 2, 2022

pedrocamargo commented Feb 13, 2023

janzill commented Feb 13, 2023

Art-Ev commented Nov 7, 2023

janzill commented Nov 7, 2023

Select link analysis #260

Select link analysis #260

Conversation

janzill commented Jun 12, 2021

janzill commented Jun 12, 2021

pedrocamargo Jun 13, 2021

Choose a reason for hiding this comment

janzill Jun 13, 2021

Choose a reason for hiding this comment

pedrocamargo Jun 13, 2021

Choose a reason for hiding this comment

janzill Jun 13, 2021

Choose a reason for hiding this comment

janzill commented Jun 17, 2021 • edited Loading

pedrocamargo commented Jun 17, 2021

janzill commented Dec 2, 2022

pedrocamargo commented Feb 13, 2023

janzill commented Feb 13, 2023

Art-Ev commented Nov 7, 2023

janzill commented Nov 7, 2023

janzill commented Jun 17, 2021 •

edited

Loading