Skip to content

Releases: dlite-tools/NLPiper

0.3.1

14 Apr 14:36
b602d46
Compare
Choose a tag to compare

📝 Changelog:

  • 🛠️ Improvements:

    • Support Python 3.7
  • 📚 Documentation:

    • Improve general documentation, add new maintainer

0.3.0

06 Apr 13:55
29636ae
Compare
Choose a tag to compare

📝 Changelog:

  • ✨ Features:

    • Add Spacy and TorchText Embeddings transformer
  • 📚 Documentation:

    • Improve documentation and migrate links to new repository host

0.2.1

04 Mar 12:15
e0d6aeb
Compare
Choose a tag to compare

📝 Changelog:

  • 🛠️ Improvements:

    • Allow Document and Token to be set extra attributes
  • ♻️ Refactoring:

    • Move BaseTransform out of init for better integration with IDEs
  • 📚 Documentation:

    • Improve documentation cleaners for sphinix better compatibility and explanation on transforms

0.2.0

01 Mar 16:30
c80517a
Compare
Choose a tag to compare

📝 Changelog:

  • ✨ Features:

    • Add transformer for embeddings
    • Add ner and embeddings token wise
    • Add embeddings document wise
    • Improve Stanza integration allowing to extract lemma and ner directly to the token object
  • 🐛 Bug:

    • Fix bug on importing nlpiper
  • 📚 Documentation:

    • Update and improve documentation

0.1.0

17 Feb 16:02
b1b9fff
Compare
Choose a tag to compare

📝 Changelog:

  • ✨ Features:
    • Create transformation pipeline to a document
    • Rollback transformation steps applied to a document
    • Add document Cleaners: CleanURL, CleanEmail, CleanNumber, CleanPunctuation, CleanEOF, CleanMarkup, CleanAccents
    • Add document Tokenizers: BasicTokenizer, MosesTokenizer, StanzaTokenizer
    • Add document Normalizers: CaseTokens, RemovePunctuation, RemoveStopWords, VocabularyFilter, SpellCheck, Stemmer
    • Add Embeddings: GensimEmbeddings