Skip to content

Commit

Permalink
Deep dive into the similarity code
Browse files Browse the repository at this point in the history
Primarily to understand why it is worse than the DBSCAN trip clustering code.

As you may recall, the results for DBSCAN were pretty respectable.
Most trips were in a cluster, and the median cluster : trip-in-cluster ratio
was below 40% for all datasets.
#28 (comment)

Why is similarity so much worse than the DBSCAN-based clustering?
Should we switch to DBSCAN instead (horrifying proposition given the tight deadline)?

See the notebook conclusion to find out!

@corinne-hcr, this is more on the lines of what I expected you to do back when
you were evaluating the first round/common trips, and definitely when we were
getting poor results with the clustering.

+ change the original DBSCAN notebook to have the modified viz code
  • Loading branch information
shankari committed Jul 23, 2021
1 parent 952c476 commit 18144ef
Show file tree
Hide file tree
Showing 2 changed files with 979 additions and 43 deletions.
Loading

0 comments on commit 18144ef

Please sign in to comment.