-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training on Arabic language #115
Comments
Hello @lecidhugo ! You can create the resources for a new language with https://github.com/kermitt2/grisp Once done, you can start an environment for Arabic with entity-fishing, the knowledge base will be automatically build. Then you need to train a ranker and a selector model as described here -> https://nerd.readthedocs.io/en/latest/train.html#training-with-wikipedia Loading the You don't need to create embeddings if I remember well, it should work without them. However it improves a bit the disambiguation. This is also quite time consuming (it should be half day for Arabic given the number of articles). There are 1,080,907 articles in Arabic, so it's a pretty big number, it should be doable and provide decent results. |
Thank you very much for your kind reply! |
Note that Arabic is now supported by default, with already trained models and KB resources available, see the documentation. |
Hello,
Is there any document or guide on how to train on Arabic ?
Is this possible ? if yes what are the requirements ?
Thanks in advance,
The text was updated successfully, but these errors were encountered: