duden words.rb uses the list in words.txt to download every duden html page from https://www.duden.de/rechtschreibung/* to data/. Warning: Don't do it all at once :)