A small collection of replies that were generated by ChatGPT and downloaded from the web. Used to analyse word frequency distribution in AI-generated texts.
ChatGPT detector demo: https://textvisualization.app/chatgpt-detector/
Details, findings, explanations in this post: The Intricate Tapestry of ChatGPT Posts: Why LLM overuses some words at the expense of others?
Word frequency data for Project Gutenberg books is collected from Wikipedia, project-gutenberg list