Skip to content

Word frequencies for all the texts (~10,600) in `as is` and `normalized versions`

Notifications You must be signed in to change notification settings

OpenArabic/Collection_WordFrequencies_Reports

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Collection_WordFrequencies_Reports

Word frequencies for all the texts (~10,600) in as is and normalized versions

  • 1gram : forlder with word frequencies for each text (as is—no normalization of orthography);

  • 1gram_NRM : folder with word frequencies for each text (normalizedalifs simplified into one form; carriers of medial and finals hamzaŧs removed);

  • 1gram_NRM_Lengths : file with lengths of each text in words

About

Word frequencies for all the texts (~10,600) in `as is` and `normalized versions`

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published