Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

updated universe for number spacy (#13424) [ci skip] #13424

Merged
merged 2 commits into from
Sep 10, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
40 changes: 40 additions & 0 deletions website/meta/universe.json
Original file line number Diff line number Diff line change
Expand Up @@ -5404,6 +5404,46 @@
"NLP",
"custom components"
]
},
{
"id": "number-spacy",
"title": "Number spaCy",
"slogan": "Enhancing Numeric Entity Recognition in Text with spaCy",
"description": "Number spaCy is a custom spaCy pipeline component that enhances the identification of number entities in text and fetches the parsed numeric values using spaCy's token extensions. It uses RegEx to identify number entities written in words and then leverages the [word2number](https://github.com/akshaynagpal/w2n) library to convert those words into structured numeric data. The output numeric value is stored in a custom entity extension: `._.number`. This lightweight component can be seamlessly added to an existing spaCy pipeline or integrated into a blank model. If using within an existing spaCy pipeline, ensure to insert it before the NER model.",
"github": "wjbmattingly/number-spacy",
"pip": "number-spacy",
"code_example": [
"import spacy",
"from number_spacy import find_numbers",
"",
"nlp = spacy.blank('en')",
"nlp.add_pipe('find_numbers')",
"",
"doc = nlp('I have three apples. She gave me twenty-two more, and now I have twenty-five apples in total.')",
"",
"for ent in doc.ents:",
" if ent.label_ == 'NUMBER':",
" print(f'Text: {ent.text} -> Parsed Number: {ent._.number}')"
],
"code_language": "python",
"url": "https://github.com/wjbmattingly/number-spacy",
"thumb": "https://github.com/wjbmattingly/number-spacy/raw/main/images/number-spacy-logo.png?raw=true",
"image": "https://github.com/wjbmattingly/number-spacy/raw/main/images/number-spacy-logo.png?raw=true",
"author": "W.J.B. Mattingly",
"author_links": {
"twitter": "wjb_mattingly",
"github": "wjbmattingly",
"website": "https://www.wjbmattingly.com"
},
"category": [
"pipeline"
],
"tags": [
"spacy",
"number",
"NLP",
"entity recognition"
]
}
],
"categories": [
Expand Down
Loading