Comparator to flag invalid primary tag combinations #102

bkowshik · 2017-03-14T09:41:56Z

Objective

Fixes: Features on OpenStreetMap with more than one primary tag #94

Amazing stats @bkowshik, this directly gives the probability of any tag combination on OSM. With these 286 invalid and 330 possibly erroneous tag combination, can we already start flagging them?

Happy about how we parse a csv file instead of a json, making it easy on the 👀

Next actions

Would love to get some 👀 @amishas157
Merge to master

cc: @planemad @geohacker

geohacker · 2017-03-14T09:55:55Z

@bkowshik this uses a csv file for the tag combination scores - how do you plan to make this maintainable? should we use the taginfo API directly?

bkowshik · 2017-03-14T10:30:17Z

At the moment, we have a script to generate the csv file here:

https://github.com/mapbox/osm-compare/blob/master/scripts/primary-map-features.js

Options

@geohacker, I could think of the following 3 options:

csv file in the repository
- TagInfo is not queried for each feature
- Simple to build and easy to understand
- Needs to be updated manually using the script ^
csv file created during deploy
- TagInfo is not queried for each feature
- Latest data is queried from TagInfo during deploy
- Data is as frequent as the number of deploys
- Makes the deploy a little slower
Query TagInfo
- TagInfo is queried for each feature which is a lot of features!!!
- We have the latest data and no manual maintenance

Unless you are seeing something here, I am happy with the current setup, (option 1). We will get a better sense of what is needed once we deploy and 👀 the results for a few days.

planemad · 2017-03-14T10:42:53Z

👍 the current CSV is good till we see some results from this. We can think about scaling with taginfo if this works as expected.

amishas157 · 2017-03-14T12:22:25Z

@bkowshik This looks good to go. And also regarding keeping the tag info data updated, i think manually running scripts is fine. As there should not be much difference in tag-info data on daily basis, if we think of it in terms of percentage. But as we are scaling our comparators based on taginfo, we can put some cron jobs to run these scripts on a monthly basis or so.

bkowshik · 2017-03-16T04:22:41Z

Published to npm as version: 4.14.0

geohacker · 2017-03-16T04:45:33Z

🎉

Bhargav Kowshik added 3 commits March 14, 2017 15:01

Flag tag combinations with count less than 1

9b7330b

Add invalid_tag_combination comparator to index.js

f18738e

Add description about comparator on comparator README

71121c3

bkowshik requested a review from amishas157 March 14, 2017 09:41

bkowshik mentioned this pull request Mar 14, 2017

Features on OpenStreetMap with more than one primary tag #94

Closed

Add csv module to package.json

1f8980f

bkowshik merged commit 63e93aa into master Mar 16, 2017

bkowshik deleted the invalid-tag-combinations branch March 16, 2017 04:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparator to flag invalid primary tag combinations #102

Comparator to flag invalid primary tag combinations #102

bkowshik commented Mar 14, 2017

geohacker commented Mar 14, 2017

bkowshik commented Mar 14, 2017

planemad commented Mar 14, 2017

amishas157 commented Mar 14, 2017 •

edited

Loading

bkowshik commented Mar 16, 2017

geohacker commented Mar 16, 2017

Comparator to flag invalid primary tag combinations #102

Comparator to flag invalid primary tag combinations #102

Conversation

bkowshik commented Mar 14, 2017

Objective

Next actions

geohacker commented Mar 14, 2017

bkowshik commented Mar 14, 2017

Options

planemad commented Mar 14, 2017

amishas157 commented Mar 14, 2017 • edited Loading

bkowshik commented Mar 16, 2017

geohacker commented Mar 16, 2017

amishas157 commented Mar 14, 2017 •

edited

Loading