Skip to content

Conversation

@LoicGrobol
Copy link
Collaborator

  • allow substracting the average embedding of the POS of the word (pre-clustering)
    • Inferring POS from a lexicon (typically SUBTLEX)
    • Using POS-tagging from SpaCy (make this optional to avoid a forced spacy dep)
  • doc
  • tests

This is an optional extra step/subcommand to avoid having it take over the otherwise straigtforward embedding and clustering code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants