small performance improvement for streets API #2988

misaugstad · 2022-08-09T17:07:37Z

Brief description of problem/feature

I was creating some documentation for our streets API, and I noticed a pretty obvious and easy to correct inefficiency in how we are doing it. The relevant parts of the algorithm basically look like...

for every street s:
    for every clustered attribute c:
        create a jts Point object from the c's latitude and longitude, Point(c)
        if the Point(c) is within 10 meters of s:
            increase cluster count for s

But we could easily just create the Point objects for all of the attributes just once, instead of for every street. I imagine that this could have a pretty big overhead if we are using this on a large area. It should look like this...

for every clustered attribute c:
    create a jts Point object from the c's latitude and longitude, Point(c)
for every street s:
    for every Point(c):
        if Point(c) is within 10 meters of s:
            increase cluster count for s

I can only assume that we do this for the neighborhoods API as well, but it shouldn't be as big of a deal there since a city generally has 10's of neighborhoods but 1000's of streets.

FYI I'm looking at the computeAccessScoresForStreets() function in ProjectSidewalkAPIController.scala.

The text was updated successfully, but these errors were encountered:

jonfroehlich · 2022-08-10T15:59:48Z

I'm all for performance improvements, and I love how your brain is constantly looking for ways to do things better! 🧠

davphan · 2023-10-25T01:38:16Z

Can every label only be assigned to one street? If so, can we do something like:

for every clustered attribute c:
    create a jts Point object from the c's latitude and longitude, Point(c)
for every Point(c):
    for every street s:
        if Point(c) is within 10 meters of s:
            increase cluster count for s
            break

to further increase performance? Or do we need to loop through every single point for every street?

misaugstad · 2023-10-25T19:00:50Z

Although we could assign each label to a single street (in fact, there is a street_edge_id column in the label table), I think that for this situation, it makes more sense to include labels on multiple streets. This primarily is about intersections, where curb ramps (or lack thereof) impacts all streets at the intersection, so it makes sense to count them for multiple streets.

misaugstad added Easy Fix Potential Intern Assignment API labels Aug 9, 2022

misaugstad assigned davphan Oct 19, 2023

davphan mentioned this issue Oct 25, 2023

Implemented performance improvement to access scores API call. #3415

Merged

2 tasks

misaugstad closed this as completed in #3415 Nov 1, 2023

misaugstad mentioned this issue Nov 1, 2023

v7.16.1 #3418

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

small performance improvement for streets API #2988

small performance improvement for streets API #2988

misaugstad commented Aug 9, 2022

jonfroehlich commented Aug 10, 2022

davphan commented Oct 25, 2023

misaugstad commented Oct 25, 2023

small performance improvement for streets API #2988

small performance improvement for streets API #2988

Comments

misaugstad commented Aug 9, 2022

Brief description of problem/feature

jonfroehlich commented Aug 10, 2022

davphan commented Oct 25, 2023

misaugstad commented Oct 25, 2023