Skip to content

Releases: idio/json-wikipedia

Scala 2.12

11 Feb 03:00
Compare
Choose a tag to compare
1.5.0

1.5.0

Spark 2.4.4

05 Nov 01:25
Compare
Choose a tag to compare
  • upgrading spark version

Parsing inline references

24 Nov 15:17
Compare
Choose a tag to compare

The main part of this release is the parsing of Inline wikipedia references into a list for each paragraph. The point of this is to be able to remove those references from the main text, because so far they were appearing as part of it. That causes problems when calculating, for example, the context vector of a word.

Issues

#45 Replace defunct idio repos for the open source and maintained alternatives
#44 Add references to Article Parser
#6   symbol in text/annotations
#23 Garbage namespaces