Hazm 0.8
Release notes:
- Add WordEmbedding (Download the pre-trained model(Fasttext) from here)
- Add SentenceEmbedding (Download the pre-trained model from here)
- Add Documentation webpage (link)
- Improve normalizer, informal normalizer, and tokenizer
- Add Degarbayan and MirasText corpus reader
What's Changed
- fixed MAGHSURAH Y bug in normalizer by @mavahedinia in #116
- change list to set in stopwords_list method to remove duplicate stop … by @Azdy-dev in #175
- Add Degarbayan interface by @maanijou in #176
- fix endless loop in python3 by @mohamad-qodosi in #186
- Update README.md by @edalatfard in #187
- Fix self.words in WordTokenizer by @SinRas in #190
- Fix some tokenization issues by @behnam-sa in #199
- Modifying the extra space and newline removal patterns by @asdoost in #200
- improvement of InformalNormalizer by @riasati in #214
- add some rules to InformalNormalizer by @riasati in #215
- Embedding by @E-Ghafour in #229
- Embedding by @imani in #230
New Contributors
- @mavahedinia made their first contribution in #116
- @Azdy-dev made their first contribution in #175
- @maanijou made their first contribution in #176
- @mohamad-qodosi made their first contribution in #186
- @edalatfard made their first contribution in #187
- @SinRas made their first contribution in #190
- @behnam-sa made their first contribution in #199
- @asdoost made their first contribution in #200
- @riasati made their first contribution in #214
- @E-Ghafour made their first contribution in #229
- @imani made their first contribution in #230
Full Changelog: v0.7...v0.8.2