Releases · londogard/londogard-nlp-toolkit

31 Aug 04:38

Lundez

v1.2.0-BETA2

955c97a

1.2.0-BETA2 Pre-release

Pre-release

🚀 MultiClass Logistic Regression
🚀 OneHotEncoder

Assets 2

29 Aug 07:22

Lundez

v1.2.0-BETA

df133d6

1.2.0-BETA Pre-release

Pre-release

🚀 TransfomersPipeline

✅ PyTorch through JIT-saved TorchScript models
✅ ONNX Models directly through the hub, e.g. TokenClassificationPipeline.create("optimum/bert-base-NER")
Where optimum/bert-base-NER is a model on the HuggingFace Hub
✅ Load both PyTorch (TorchScript) & ONNX model through local path
ClassificationPipeline and TokenClassificationPipeline exists

See the following test for some examples on how to use it

Assets 2

06 Feb 15:26

Lundez

v1.1.1

b15b8db

1.1.1 Latest

Latest

Bumping DJL to 1.5.0 as it's now released for real.

Full Changelog: v1.1.0...v1.1.1

Assets 2

03 Jan 18:54

Lundez

v1.1.0

e988168

1.1.0

What's Changed

feat: BagOfWords, TfIdf & BM-25 by @Lundez in #42
feat: Adding new CLF by @Lundez in #46
feat: Cooccurence keywords by @Lundez in #77
perf: LightWordEmbeddings with more efficient caching by @Lundez in #81
docs: Adding doc-generation and deployment by @Lundez in #82
chore: Bump multiple dependencies by @Lundez

Full Changelog: v1.0.0...v1.1.0

Contributors

Lundez

Assets 2

25 Aug 18:33

Lundez

v1.1.0-BETA

919b1e6

1.1.0-BETA Pre-release

Pre-release

This is the initial BETA for 1.1.0

🚀 Vectorizers (BagOfWord through CountVectorizer)
🚀 Transformers (TF-IDF, BM25 which also exists as Vectorizers using BagOfWord as input)
🚀 Regression (SimpleLinearRegression)
🚀 Classifier (LogisticRegression without intercept & Naïve Bayes)
🚀 Sequence Classifier (Hidden Markov Model)

✅ Moved majority of code to multik
✅ Started adding DJL PyTorch Tensor support, ramping up for neural networks
✅ Added some Metrics

🙄 ...And some extra!

Assets 2

07 Apr 15:49

Lundez

v1.0.0

a74ab3f

1.0.0

🎉 1.0.0! 🎉

It's finally here.
For this update some dependencies has been pruned & tests has been added. Following we'll add ourself to kotlin-jupyter.

Assets 2

30 Mar 15:53

Lundez

v1.0-beta

1f96911

1.0-beta Pre-release

Pre-release

First beta 🎉

API is stabilizing.

🚀 BytePieceEmbeddings (https://nlp.h-its.org/bpemb/) -- Supporting 275 (!) languages out of the box with a lot of customability of sizes.
🚀 SentencePiece Tokenizer -- Supporting 275 (!) languages out of the box with a lot of customability of sizes. (OBS: JNI-based)
🚀 FastText (non-ngram) support -- Supporting 175 languages out of the box.
🎉 Documentation now in a Kotlin Notebook (README.ipynb). This means you can run the code yourself simply locally
... And some minor bugg-fixes in DownloadHelper where it'd redownload some WordFrequencies etc.

Assets 2

11 Mar 18:26

Lundez

v1.0-SNAPSHOT

a521468

1.0-SNAPSHOT Pre-release

Pre-release

This is a SNAPSHOT of the 1.0 release.
Stability of API should be ok for "completed" segments.

✅ Stopwords
✅ WordFrequencies
✅ Tokenizer + CharTokenizer & SimpleTokenizer
✅ Basic Trie Structure (no 'merge node' function yet)
✅ Stemmer
✅ Embeddings (Word Embeddings & Light Word Embeddings)
❓ Sentence Embeddings (AvgSentence & USif should be good to go!)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

Releases: londogard/londogard-nlp-toolkit

1.2.0-BETA2

1.2.0-BETA

1.1.1

1.1.0

What's Changed

Contributors

1.1.0-BETA

1.0.0

1.0-beta

1.0-SNAPSHOT