-
-
Notifications
You must be signed in to change notification settings - Fork 28
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(nl): improve support of NL addresses (#126)
* Examples of correct addresses in NL. * Recognise 6 position postcodes in NL, addressing #127. * Check for -plein as street type. * Adjustments for typo 'plain' instead of 'plein' in libpostal resources. Addressing #128. * Put 'test' back in as per #126 (comment). * Prevent 'St' as a suffix in NL streetnames. Add Jr and Sr. Addressing #126. * First of 4 digits cannot be 0. No 'SA','SD' or 'SS'. Addressing #127. * Adjective 'korte' no longer a personal title. Addressing #129. * Tests and fix for typo 'BurgeRmeester'. Addressing #130. * Tests, code, and configs for '-daal', '-burg', '-baan'. Addressing #131 and #133. * Config for '-burg' not separable. Addressing #131. * Remove test for NL postal codes WITH spaces. Regex ok. #134 * Fix formatting spaces at the end of the line. * Also use directionals to parse Dutch street addresses. #137 * Add Dutch titulature for street name recognition. (#130) * Add 'plantsoen' as a street type. (#128). * Remove test for NL locality with stopword.
- Loading branch information
1 parent
e92f96c
commit a1af69b
Showing
13 changed files
with
125 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
{"zipex":"1234 AB,2490 AA","key":"NL","name":"NETHERLANDS","fmt":"%O%n%N%n%A%n%Z %C","require":"ACZ","zip":"\\d{4} ?[A-Z]{2}","posturl":"http://www.postnl.nl/voorthuis/","id":"data/NL"} | ||
{"zipex":"1234 AB,2490 AA","key":"NL","name":"NETHERLANDS","fmt":"%O%n%N%n%A%n%Z %C","require":"ACZ","zip":"[1-9][0-9]{3} ?(?!SA|SD|SS)[A-Z]{2}","posturl":"http://www.postnl.nl/voorthuis/","id":"data/NL"} |
2 changes: 2 additions & 0 deletions
2
resources/pelias/dictionaries/libpostal/af/personal_titles.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
!kort|k | ||
!korte|kte |
1 change: 1 addition & 0 deletions
1
resources/pelias/dictionaries/libpostal/nl/concatenated_suffixes_inseparable.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
burg|brg|bg |
7 changes: 6 additions & 1 deletion
7
resources/pelias/dictionaries/libpostal/nl/concatenated_suffixes_separable.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,6 @@ | ||
dijk | ||
baan | ||
daal | ||
dijk | ||
!plain|pln. | ||
plein|pln | ||
plantsoen|plnts |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
noordzijde|nz|n.z.|n.z|n z | ||
oostzijde|oz|o.z.|o.z|o z | ||
westzijde|wz|w.z.|w.z|w z | ||
zuidzijde|zz|z.z.|z.z|z z |
4 changes: 4 additions & 0 deletions
4
resources/pelias/dictionaries/libpostal/nl/personal_suffixes.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
junior|jr|jnr | ||
senior|sr|snr | ||
# st is not a personal suffix in Dutch | ||
!st |
43 changes: 43 additions & 0 deletions
43
resources/pelias/dictionaries/libpostal/nl/personal_titles.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,43 @@ | ||
!kort|k | ||
!korte|kte | ||
aalmoezenier | ||
admiraal|adm | ||
bisschop|biss | ||
#typo in LibPostal resource | ||
!burgermeester|burg|bgm | ||
burgemeester|burg|bgm | ||
commissaris|comm | ||
deken|dkn | ||
directeur|dir | ||
frater|fr | ||
graaf | ||
gravin | ||
goeverneur|goev | ||
gouverneur|gouv | ||
heer|hr | ||
jonker|jkr | ||
juffrouw|juffr | ||
hertog|htg | ||
kanunnik|kan | ||
kapelaan|kap | ||
kapitein|kapt | ||
keizer | ||
luitenant generaal|lt gen | ||
!mevrouw|mevr | ||
mevrouw|mevr|mw | ||
madame|mad | ||
majoor|maj | ||
notaris|not | ||
overste|ov | ||
pater|ptr | ||
prelaat|prlt | ||
rector|rect | ||
schepen|sch | ||
schout|sch | ||
schout bij nacht|sbn | ||
secretaris|secr | ||
sekretaris|sekr | ||
veldmaarschalk|veldm | ||
vicaris|vic | ||
wethouder|weth | ||
zusters|zr |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters