You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some Canadian (especially french) postal addresses are not correctly parsed. One of the reason is the lack of support for some of the french vocabulary (like 'rue' for street or 'boul' for boulevard). These are quite easy to add to the vocabulary but even then, some addresses are just not parsed correctly. Here are a few examples after I've added support for some of the french keywords.
609 boulevard des Bois Francs Sud, Victoriaville, QC
{'word': '609', 'tag': 27}
{'word': 'boulevard', 'tag': 30}
{'word': 'qc', 'tag': 14}
In this example, "des Bois Francs" is the full street name, "Sud" is the direction, "Victoriaville" is the city and finally "QC" is the state (tagges as abbreviation above)
181 rue Principale, Gatineau, QC
{'word': '181', 'tag': 27}
{'word': 'street', 'tag': 30}
{'word': 'qc', 'tag': 14}
"Principale" is the street name, "Gatineau" the city and "qc" the state/province (tagged as abbreviation above)
1016 Bouvier, Québec, QC
{'word': 'qc', 'tag': 14}
Here the street name "Bouvier" is not prefixed by anything and 'qc' is tagged as an abbreviation not a state
These are only a few examples but it seems like street names that are not prefixed are a problem as well as street names composed of multiple words. I can help adding support for french keywords in some areas if required. Also, the fact that the order of some words is different in french vs english is likely to cause issues as "Main street" would become "rue Main" in french and the code seem to look for the different addresses components in a given order.
Is there anything that can be done to better support french postal addresses like these above?
The text was updated successfully, but these errors were encountered:
Some Canadian (especially french) postal addresses are not correctly parsed. One of the reason is the lack of support for some of the french vocabulary (like 'rue' for street or 'boul' for boulevard). These are quite easy to add to the vocabulary but even then, some addresses are just not parsed correctly. Here are a few examples after I've added support for some of the french keywords.
609 boulevard des Bois Francs Sud, Victoriaville, QC
{'word': '609', 'tag': 27}
{'word': 'boulevard', 'tag': 30}
{'word': 'qc', 'tag': 14}
In this example, "des Bois Francs" is the full street name, "Sud" is the direction, "Victoriaville" is the city and finally "QC" is the state (tagges as abbreviation above)
181 rue Principale, Gatineau, QC
{'word': '181', 'tag': 27}
{'word': 'street', 'tag': 30}
{'word': 'qc', 'tag': 14}
"Principale" is the street name, "Gatineau" the city and "qc" the state/province (tagged as abbreviation above)
1016 Bouvier, Québec, QC
{'word': 'qc', 'tag': 14}
Here the street name "Bouvier" is not prefixed by anything and 'qc' is tagged as an abbreviation not a state
These are only a few examples but it seems like street names that are not prefixed are a problem as well as street names composed of multiple words. I can help adding support for french keywords in some areas if required. Also, the fact that the order of some words is different in french vs english is likely to cause issues as "Main street" would become "rue Main" in french and the code seem to look for the different addresses components in a given order.
Is there anything that can be done to better support french postal addresses like these above?
The text was updated successfully, but these errors were encountered: