Skip to content

A lightweight and efficient PHP library tailored for developers working on Natural Language Processing (NLP) tasks in Brazilian Portuguese.

License

Notifications You must be signed in to change notification settings

yeremi/stopwords

Repository files navigation

Stopwords

A PHP Library for Managing Stopwords

Stopwords is a lightweight and efficient PHP library designed to simplify Natural Language Processing (NLP) tasks. By default, it includes a comprehensive list of 508 stopwords in Portuguese, categorized into:

  • Pronouns
  • Numerals
  • Temporal terms
  • Locative terms
  • Prepositions
  • Conjunctions
  • Articles
  • Adverbs
  • Interjections
  • Contractions
  • Miscellaneous terms

The library is highly flexible and can be extended to include stopword dictionaries for other languages, making it a versatile tool for a wide range of text processing needs.

Leverage Stopwords to streamline text processing, enhance SEO strategies, and improve accuracy in text-based applications.

Table of Contents

  1. Key Features
  2. Installation
  3. How to use
    1. Basic Examples
    2. Creating Custom Dictionary
    3. Migration v1 to v2
  4. Contributing
  5. License

Key Features

  1. NLP Optimization: Efficiently filter out common stopwords, saving processing time and enhancing text analysis.

  2. SEO Utility: Reduce irrelevant data during search engine crawling or indexing, improving website performance.

  3. Customizable: Easily extend or modify stopword lists to suit specific project requirements.

  4. Composer-Ready: Quickly integrate the library into your project using Composer.

Installation

Add the library to your project using Composer:

composer require yeremi/stopwords

Contributing

Contributions are welcome! If you encounter any issues or have suggestions for improvement, feel free to open an issue or submit a pull request.

License

Stopwords is licensed under the MIT License. See the LICENSE file for details.