Hazm 0.9.4
- Added
join_abbreviations
to skip abbrs tokenizing using ParsiNorm's abbreviation lists. #216 @optimopium @sir-kokabi. - Added
MizanReader
to read Mizan corpus. @sir-kokabi. - Added
NaabReader
to read Naab corpus. @sir-kokabi. - Added
NerReader
to read NER corpus. @sir-kokabi. - Improved
Normalizer
by adding support for normalizing words with the suffix 'هایی'. @sir-kokabi. - Fixed #298: Incompatibility issues with numpy. @mhdi707 @sir-kokabi
Full Changelog: v0.9.3...v0.9.4