Skip to content

A utility for crawling websites and building frequency lists of words

License

Notifications You must be signed in to change notification settings

calebwin/frequent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Frequent

frequent is a utility for crawling websites and building word frequency list. Mainly made because I wanted to be able to find top n most common words on different websites, but I imagine there might be more useful applications. Or not.

import frequent

# get most frequent words from the w3schools website
# limit crawl depth to 25
word_frequencies = frequent.word_frequencies("https://www.w3schools.com", 25)

# get the top 50 words
top_words = website_word_frequencies.most_common(50)

# print the top 50 most frequent words
print(top_words)

About

A utility for crawling websites and building frequency lists of words

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages