Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

creation of OPENNLP-855 contributed by amensiko #3

Closed
wants to merge 1,325 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1325 commits
Select commit Hold shift + click to select a range
19c5601
OPENNLP-605 Added a method to extract serializers from a custom featu…
kottmann Mar 12, 2014
fb8eef6
OPENNLP-605 Now the Custom Feature Generators gets configurged proper…
kottmann Mar 12, 2014
0609c82
OPENNLP-665 Added Spanish head rules file and implementation. Thanks …
kottmann Mar 12, 2014
f0409c3
OPENNLP-663 Added a method to retrieve all possible pos tags.
kottmann Mar 13, 2014
4ef9914
OPENNLP-665 Updated the ParserModel to work with all Head Rule implem…
kottmann Mar 21, 2014
f038721
OPENNLP-669 Remove duplicate throw for IOException.
Mar 22, 2014
9edb41b
OPENNLP-669 Repaired broken javadoc link to new function replacing th…
Mar 22, 2014
397ab9b
OPENNLP-669 Repaired broken javadoc link to new functions replacing t…
Mar 22, 2014
acafac7
OPENNLP-669 Repaired <href> tag for the website reference.
Mar 22, 2014
3522c1f
OPENNLP-669 Repaired <code> start tag.
Mar 22, 2014
15ef00c
OPENNLP-669 Replaced <param> tags with <code> tags.
Mar 22, 2014
bb62cab
OPENNLP-669 Replaced < and > with the HTML equivalent &lt; and &gt;.
Mar 22, 2014
a1dbd2d
OPENNLP-669 Removed two @return tags for functions that did not. Rep…
Mar 22, 2014
1823489
OPENNLP-669 Removed @thows for function that did not throw.
Mar 22, 2014
12fdad6
OPENNLP-669 Removed @thows for function that did not throw.
Mar 22, 2014
6efdafb
OPENNLP-669 Replaced < and > with &lt; and &gt;
Mar 22, 2014
bcae293
OPENNLP-669 Replaced < and > with &lt; and &gt;
Mar 22, 2014
d8305e6
OPENNLP-669 Replaced < and > with &lt; and &gt;
Mar 22, 2014
40e36a4
OPENNLP-669 Fixed full parameter name to postag.
Mar 22, 2014
16c4431
OPENNLP-669 Removed @params tag for cg context generator which no lon…
Mar 22, 2014
82f392c
OPENNLP-669 Removed two @params, and added @throws and added to the f…
Mar 22, 2014
baaa5f1
OPENNLP-669 Fixed double </code> tags.
Mar 22, 2014
6ee207f
OPENNLP-669 Replaced < with &lt;.
Mar 22, 2014
41c8d42
OPENNLP-669 Lists tags <li> need to be enclosed with <ul> </ul> tags …
Mar 22, 2014
bedf27e
OPENNLP-669 Fixed typo in last fix. &gt and not &ft
Mar 22, 2014
4d4776b
OPENNLP-669 Fixed typo in last fix. &gt and not &ft
Mar 22, 2014
443bd1c
OPENNLP-669 Fixed lists <li> and replaced < and > with &lt; and &gt;.
Mar 22, 2014
7b67839
OPENNLP-669 Fixed lists <li> and replaced < and > with &lt; and &gt;.
Mar 22, 2014
0938a5b
OPENNLP-669 Replaced > with &gt;
Mar 22, 2014
5b6ee2c
OPENNLP-669 Fixed @links... to point to non-deprecated function.
Mar 22, 2014
0bc5d66
OPENNLP-669 Fixed @links... and replaced > with &gt;
Mar 22, 2014
40852c2
OPENNLP-669 Fixed @links... and http reference.
Mar 22, 2014
a15f19f
OPENNLP-669 Fixed @links... for deprecated.
Mar 22, 2014
3820481
OPENNLP-669 Fixed < and > usage with &lt; and &gt;
Mar 22, 2014
463cb60
OPENNLP-669 Fixed proper @param and @return usage.
Mar 22, 2014
0887047
OPENNLP-669 Fixed < and > usage with &lt; and &gt;
Mar 22, 2014
19ba3f2
OPENNLP-669 Fixed @links... for deprecated.
Mar 22, 2014
1d88986
OPENNLP-669 Fixed @links... for deprecated.
Mar 22, 2014
3980b5b
OPENNLP-669 Finally replaced > with &gt;
Mar 22, 2014
69f1bb2
OPENNLP-669 Fixed list <li> and changed a @link to @code because I co…
Mar 23, 2014
14ddc7a
OPENNLP-669 Changed a @link to @code because I could not find the link.
Mar 23, 2014
a103654
OPENNLP-669 Replaced < and > with &lt; and &gt; to generate the examp…
Mar 23, 2014
e84af58
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
a337cfb
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
fcfe2c3
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
af17e6a
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
f2b6fb4
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
5db1d9d
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
67dcead
OPENNLP-669 Fixed < and > replaced with &lt; and &gt;
Mar 25, 2014
845f217
OPENNLP-669 Fixed <table> to have <caption> since every table needs a…
Mar 25, 2014
a04be90
OPENNLP-669 Fixed @param wrong variable name.
Mar 26, 2014
a996e06
OPENNLP-669 Fixed @throws for the correct exception thrown.
Mar 26, 2014
3c205c3
OPENNLP-669 Fixed @throws for the correct exception thrown. This fil…
Mar 26, 2014
a40c3fa
OPENNLP-669 Added <caption> to the <table> sections.
Mar 26, 2014
b39b005
OPENNLP-669 and OPENNLP-580 Fixed usage of loadResources() in UMIA as…
Mar 26, 2014
ef69859
OPENNLP-665 Added option to cmd line tool to specifiy head rules seri…
kottmann Mar 27, 2014
05bf5c9
OPENNLP-670 seqCodec is not initialized in the default constructor wh…
kottmann Apr 3, 2014
52a2a68
Attached is the patch for the current L-BFGS implementation. It inclu…
kottmann Apr 3, 2014
8ba662d
OPENNLP-569 Disabled tests for now.
kottmann Apr 3, 2014
4a4e37e
OPENNLP-569 Fixing remaining issues with the unit tests. Thanks to Vi…
kottmann Apr 4, 2014
c48a435
OPENNLP-665 Now language parameter defines the default head rules impl
kottmann Apr 4, 2014
1ece547
OPENNLP-81 Added doccat evaluator, with misclassified and fine graine…
wcolen Apr 10, 2014
fcb082f
OPENNLP-81 Removed detailed F1 CL argument. It is included in the fin…
wcolen Apr 11, 2014
b7aa41c
OPENNLP-177 Added DoccatCrossValidator to the CLI
Apr 11, 2014
3ab4b01
OPENNLP-672 Added feature generators parameters to CLI
Apr 11, 2014
087dabf
OPENNLP-674 Added factory to Doccat
wcolen Apr 16, 2014
1f46315
OPENNLP-674 Use tokenizer from the factory
wcolen Apr 16, 2014
81d03cc
OPENNLP-673 Added prefix to the NGram feature generator
wcolen Apr 16, 2014
9fca7a4
No jira, updates svn ignore
kottmann Apr 22, 2014
63e2ebc
OPENNLP-674 Added javadoc comment regarding the capability of getting…
wcolen Apr 22, 2014
c0fc716
OPENNLP-671 Add L1-regularization into L-BFGS. Thanks to Vinh Khuc f…
kottmann Apr 26, 2014
c712627
OPENNLP-677 Now headrules serializer for 1.5.x models is mapped corre…
kottmann Apr 28, 2014
dfcc289
OPENNLP-679
Apr 29, 2014
344603f
OPENNLP-680 Added instructions for OntoNotes
kottmann Apr 29, 2014
dd35984
OPENNLP-681 Added year to copyright, removed non-printed date
kottmann Apr 29, 2014
8c3ee4d
OPENNLP-678 Removed trailing whitespaces
kottmann May 2, 2014
f01fbe8
removing language specific casting for headrules to allow training of…
ragerri May 6, 2014
35addd9
removing language specific casting for headrules from treeinsert parser
ragerri May 6, 2014
f7694d1
OPENNLP-31 add evaluation suppor to parser working
ragerri May 9, 2014
d58b2c2
OPENNLP-684
May 12, 2014
867c207
OPENNLP-695 Added support to extra info field to Doccat
wcolen May 13, 2014
d347a27
OPENNLP-682 Added descriptions for the training parameters of L-BFGS
May 14, 2014
f5ee3fc
OPENNLP-699
May 19, 2014
d7b406f
OPENNLP-699
May 20, 2014
5839052
OPENNLP-699
May 20, 2014
8bd786d
OPENNLP-687 fmeasure update to avoid duplicate true positives
ragerri Jun 4, 2014
b4aca1d
Updated default values for L1Cost, L2Cost, and changed the according …
Jun 16, 2014
82846a7
OPENNLP-690 added short description help for ParserEvaluator cmdline
ragerri Jul 2, 2014
58a144c
OPENNLP-703 Improved NegLogLikelihood so that it runs faster and uses…
Jul 21, 2014
da9c165
OPENNLP-703 Added a parallel version of NegLogLikelihood. Added a tes…
Jul 21, 2014
505d9ed
OPENNLP-704 Fixed a bug in SentenceDetectorTool where performance mon…
Aug 5, 2014
aed6121
OPENNLP-690 adding documentation for parser evaluator tool
ragerri Sep 11, 2014
8fb72f4
OPENNLP-690 fixing small bug introduced while wriinting doc for parser
ragerri Oct 1, 2014
4df685a
OPENNLP-717 bug fixed by giving access to feature generator in TokenN…
ragerri Oct 8, 2014
a0fd3ca
OPENNLP-718 TokenNameFinder factory not initialized by void init() me…
ragerri Oct 10, 2014
e9d88a2
OPENNLP-718 TokenNameFinder factory not initialized by void init(); n…
ragerri Oct 10, 2014
c51be59
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
89b28af
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
4406881
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
eb27294
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
bc2da1f
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
7828f75
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
be86e17
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
7b3f4b6
OPENNLP-720 Fixed javadoc error
kottmann Oct 14, 2014
5dd350e
OPENNLP-721 fixing xpath expression in GeneratorFactory
ragerri Oct 16, 2014
c373ab4
OPENNLP-699 Markable File Input Stream should not be public, the fact…
kottmann Oct 20, 2014
27e6603
OPENNLP-711 useTokenEnd=false case is now calculating the correct sen…
kottmann Oct 20, 2014
b14280a
OPENNLP-722 Ordering of features is removed from the Comparable Event…
kottmann Oct 21, 2014
e11b140
OPENNLP-724 bug fixed opening the input stream from the file
ragerri Oct 23, 2014
81520e6
OPENNLP-725 now the serializer is chosen from dict attribute and elem…
ragerri Oct 27, 2014
e2bdcca
OPENNLP-676 Fixed bug in the AnnotationComboIterator. The iterators w…
kottmann Oct 27, 2014
d87342e
OPENNLP-579 Updated the Entity Linker interface
kottmann Oct 29, 2014
906b331
OPENNLP-725 adding javadoc
ragerri Nov 3, 2014
8986604
OPENNLP-729 TokenNameFinderCrossValidator now trains with the fold at…
ragerri Nov 11, 2014
84211ca
OPENNLP-733 Moved pom.xml to root
kottmann Nov 20, 2014
99dd267
OPENNLP-733 Moved pom.xml to root
kottmann Nov 20, 2014
340c6b2
OPENNLP-735 Updated parent pom to version 14
kottmann Nov 20, 2014
82447cb
OPENNLP-734 Added AL 2.0 header
kottmann Nov 20, 2014
e38c6d4
OPENNLP-734 Added RAT exclud for snowball stemmer source code
kottmann Nov 20, 2014
9691949
OPENNLP-736 - Update Apache parent pom to version 16
wcolen Dec 2, 2014
12fad55
OPENNLP-734 - Added Apache header to DoccatFactoryTest
wcolen Dec 2, 2014
cd8d272
OPENNLP-734 - Added test data to the RAT exclude list
wcolen Dec 2, 2014
8bc1979
OPENNLP-651 First draft of 1.6.0 README
wcolen Dec 9, 2014
ec1dc51
OPENNLP-651 Updated the copyright in the NOTICE header to 2014
wcolen Dec 10, 2014
f93cf78
[maven-release-plugin] prepare release opennlp-1.6.0
wcolen Dec 10, 2014
b719cdd
[maven-release-plugin] prepare for next development iteration
wcolen Dec 10, 2014
e1e3880
[maven-release-plugin] rollback the release of opennlp-1.6.0
wcolen Dec 10, 2014
d6c46bf
OPENNLP-651 Removing empty '/opennlp' folder
wcolen Dec 10, 2014
adc4a27
OPENNLP-651 Fixed SCM paths
wcolen Dec 10, 2014
8aec110
[maven-release-plugin] prepare release opennlp-1.6.0-RC1
wcolen Dec 10, 2014
dc574c4
[maven-release-plugin] prepare for next development iteration
wcolen Dec 10, 2014
efc8e23
[maven-release-plugin] rollback the release of opennlp-1.6.0-RC1
wcolen Dec 10, 2014
21b936d
[maven-release-plugin] prepare release opennlp-1.6.0-rc1
wcolen Dec 10, 2014
8ed1563
[maven-release-plugin] prepare for next development iteration
wcolen Dec 10, 2014
c48af1c
OPENNLP-739 Starting PerformanceMonitor in Chunker
wcolen Dec 18, 2014
9bd958c
OPENNLP-738 Removed code verifying number of events in AbstractDataIn…
wcolen Dec 19, 2014
00a9325
OPENNLP-740 - added svn:ignore for IntelliJ IDEA configuration files
tteofili Jan 17, 2015
a484ef5
OPENNLP-730 Added a missing return statement. Without the return all …
kottmann Jan 22, 2015
fbf713e
[maven-release-plugin] prepare release opennlp-1.6.0-rc2
wcolen Jan 22, 2015
8552dfb
[maven-release-plugin] prepare for next development iteration
wcolen Jan 22, 2015
51209f9
Added eclipse files to svn:ignore.
kottmann Jan 28, 2015
8aea525
OPENNLP-744 Added support for attribute annotation in the brat .ann f…
kottmann Jan 28, 2015
8c1513a
OPENNLP-745 ObjectStream now implements AutoCloseable so that it can …
kottmann Jan 28, 2015
c8e9afa
OPENNLP-746 - added unit test for NGramModel
tteofili Jan 29, 2015
d04c4ac
Added target folder to svn ignore.
kottmann Jan 29, 2015
b666226
OPENNLP-746 - added missing AL header to test ngram model, using utf-…
tteofili Jan 29, 2015
c6d1caf
OPENNLP-746 - ignored testSerialize until it's fixed
tteofili Jan 29, 2015
14bba96
OPENNLP-749 Fixed array index.
kottmann Jan 30, 2015
883c9a8
OPENNLP-761. The beam size is no longer set when the Chunker is creat…
kottmann Mar 6, 2015
62d3723
OPENNLP-762 The beam size specified in the params is now written in t…
kottmann Mar 6, 2015
1ef777f
OPENNLP-714 added brown clustering features for token, token class an…
ragerri Mar 9, 2015
4f59af0
OPENNLP-714 as an aside, removing unused imports in TokenNameFinderMo…
ragerri Mar 9, 2015
4b6db8f
OPENNLP-716 adding local features that combine well with Brown cluste…
ragerri Mar 9, 2015
b223045
OPENNLP-715 removing html tags from javadoc to avoid jenkins build er…
ragerri Mar 9, 2015
8619fb9
OPENNLP-715 extending word cluster feature generator to also process …
ragerri Mar 9, 2015
6fcc881
OPENNLP-763 Parser is now using the new methods of the POS Tagger and…
kottmann Mar 9, 2015
27599fe
OPENNLP-715 refactoring from specific word2vec naming to wordcluster …
ragerri Mar 9, 2015
3a4ad3e
OPENNLP-619 Changed heap size to 1024M to match heap size in opennlp …
kottmann Mar 9, 2015
14f680c
OPENNLP-764 - applied patch from Pablo Duboue, clearing adaptive data…
tteofili Apr 1, 2015
7f82168
OPENNLP-764 - reverted previous commit as adaptive data is already cl…
tteofili Apr 1, 2015
f9fe194
OPENNLP-751 Corrected default beam size (copy and paste error) and be…
kottmann Apr 7, 2015
cd992e5
OPENNLP-765 Added CONLL-X Pos Tagger performance tests
kottmann Apr 15, 2015
4c3275c
OPENNLP-766 Added automated name finder evaluation test using CONLL 2…
kottmann Apr 15, 2015
b976f90
OPENNLP-767 Organized imports
kottmann Apr 17, 2015
9c72b0d
OPENNLP-767 Organized imports
kottmann Apr 17, 2015
e71d166
OPENNLP-767 Organized imports
kottmann Apr 17, 2015
960a6e3
OPENNLP-767 Correct indentation
kottmann Apr 17, 2015
2b94d08
OPENNLP-767 Removed trailing white spaces on all lines
kottmann Apr 17, 2015
2e7e8a6
OPENNLP-767 Removed trailing white spaces on all lines
kottmann Apr 17, 2015
720a54b
OPENNLP-767 Removed trailing white spaces on all lines
kottmann Apr 17, 2015
0a35219
OPENNLP-767 Correct indentation
kottmann Apr 17, 2015
8111caf
OPENNLP-768 Added cross validation support for the parser.
kottmann Apr 17, 2015
a0e81d3
OPENNLP-769 First draft of evaluation tests using OntoNotes4
kottmann Apr 17, 2015
aefca10
OPENNLP-770 Evaluation using CONLL 2000
wcolen Apr 30, 2015
4578397
OPENNLP-771 Evaluation using Arvores Deitadas
wcolen Apr 30, 2015
8282a31
[maven-release-plugin] prepare release opennlp-1.6.0-rc3
wcolen Apr 30, 2015
2cbde4e
[maven-release-plugin] prepare for next development iteration
wcolen Apr 30, 2015
c3ff2a0
OPENNLP-769 Adjusted score of the parser tot he actual value
kottmann Apr 30, 2015
7c66a35
OPENNLP-774 Corrected expected scores
kottmann May 13, 2015
6e45ef8
OPENNLP-768 Renamed ParserCrossEvaluator to ParserCrossValidator to f…
kottmann May 13, 2015
24ccf75
Updated year from 2014 to 2015
kottmann May 13, 2015
47ad612
Fixed a typo
kottmann May 13, 2015
5a74d3c
No jira, removed main. Contained some debug code which really shouldn…
kottmann May 20, 2015
ab629c4
OPENNLP-778 Added compatibility code path to deal with 1.5.x models
kottmann May 20, 2015
0601d15
OPENNLP-779 The hash is now computed correctly
kottmann May 21, 2015
6c3cf22
No jira, removed a comment and some commented left over code
kottmann May 22, 2015
44b28c4
OPENNLP-781 Removed old unused maxent util classes
kottmann May 22, 2015
6a8739d
OPENNLP-747 Removed version from compiler plugin. Default from parent…
kottmann May 22, 2015
6c80eea
OPENNLP-782 Added snowball stemmer notice, removed jwnl notice
kottmann May 22, 2015
9c5094e
OPENNLP-775 add support for lowercased word cluster dictionaries
ragerri May 22, 2015
a4d5bc7
OPENNLP-783 Updated OSGi java dependency to Java 7
kottmann May 22, 2015
634bc4a
OPENNLP-715 cleaning up old w2vclass after fixing this issue
ragerri May 22, 2015
1267687
[maven-release-plugin] prepare release opennlp-1.6.0-rc4
wcolen May 23, 2015
1223e76
[maven-release-plugin] prepare for next development iteration
wcolen May 23, 2015
73b3be4
Removed the known osgi issue, not a problem anymore
kottmann May 27, 2015
3dc1b8f
Removed JWNL and added snowball stemmer license
kottmann May 27, 2015
44a1e66
OPENNLP-784 Now recognizes all kinds of white spaces to seperate a li…
kottmann May 28, 2015
bfcd00c
No jira, removed left over debug main method.
kottmann May 28, 2015
76cccdc
OPENNLP-768 Removed old EventStream related classes. Those are either…
kottmann May 28, 2015
68c559e
OPENNLP-785 Factory is now included in the model
kottmann May 29, 2015
512f297
OPENNLP-787 Cluster id String objects are now reused
kottmann Jun 3, 2015
93f8b99
[maven-release-plugin] prepare release opennlp-1.6.0-rc5
wcolen Jun 11, 2015
e3a1062
[maven-release-plugin] prepare for next development iteration
wcolen Jun 11, 2015
3e13fa9
[maven-release-plugin] prepare release opennlp-1.6.0-rc6
wcolen Jun 16, 2015
8269965
[maven-release-plugin] prepare for next development iteration
wcolen Jun 16, 2015
a1aa116
OPENNLP-799
kottmann Jul 29, 2015
4a6ccf1
OPENNLP-777 - naive bayes classifier (patch from Cohan Sujay Carlos)
tteofili Aug 11, 2015
6c0854f
OPENNLP-777 - fixed javadoc causing failures with java8
tteofili Aug 12, 2015
9096572
OPENNLP-810 POSTagger incorrectly tries to set pos probability to po…
kottmann Sep 3, 2015
2b2b85f
OPENNLP-811 update namefinder documentation
ragerri Sep 3, 2015
2158d31
OPENNLP-811 minor formatting in namefinder documentation
ragerri Sep 3, 2015
1177f53
OPENNLP-690 adding parser evaluation documentation
ragerri Sep 7, 2015
1950e00
OPENNLP-219 adding training API documentation for Parser
ragerri Sep 8, 2015
905fddf
OPENNLP-812 added lemmatizer documentation section; currently only fo…
ragerri Sep 8, 2015
9c974c1
OPENNLP-777 - added model RW test, minor javadoc comment tweaks
tteofili Sep 17, 2015
e0afa85
No jira, added javadoc
kottmann Sep 17, 2015
e58e451
OPENNLP-819 Now reads multiple files from a directory and extracts th…
kottmann Sep 17, 2015
aff8625
No jira, removed unused imports to fix compiler warnings
kottmann Sep 18, 2015
6f63a5b
OPENNLP-777 Added sample params file for Naive Bayes classifier
kottmann Sep 18, 2015
6abf00f
OPENNLP-818 Added external resource dependency support to the Diction…
kottmann Sep 23, 2015
5dcde71
OPENNLP-822 The model now always includes the default name finder con…
kottmann Oct 20, 2015
cf1d0bf
OPENNLP-822 Deprecated createFeatureGenerator and added a comment.
kottmann Oct 20, 2015
c9b2916
OPENNLP-822 Fixed a bug which prevented custom configuration from bei…
kottmann Nov 4, 2015
9869036
OPENNLP-823 Removed deprecated constructors
kottmann Nov 4, 2015
6c2e67f
OPENNLP-823 Now uses non-deprecated constructor to create a name finder
kottmann Nov 5, 2015
ab68702
OPENNLP-777 - NBModel always smoothed, removed DoccatNB as NB's to be…
tteofili Jan 8, 2016
a3f2c52
OPENNLP-829 - added javadoc to DocumentCategorizer and DoccatModel
tteofili Jan 17, 2016
fa00621
OPENNLP-829 - added some javadocs
tteofili Jan 18, 2016
cffaed7
OPENNLP-659 - added support for language models
tteofili Feb 8, 2016
14fe547
OPENNLP-659 - added missing package info
tteofili Feb 8, 2016
a0bed44
OPENNLP-760 first commit of statistical lemmatizer: features and sample
ragerri Feb 18, 2016
56e6a6d
OPENNLP-760 adding factory and string utils to induce lemma classes
ragerri Feb 18, 2016
d2c514a
OPENNLP-760 adding learnable lemmatizer and model
ragerri Feb 18, 2016
95ccd11
OPENNLP-760 modifying dictionary lemmatizer to use general lemmatizer…
ragerri Feb 18, 2016
0526adb
OPENNLP-488 - applied patch from Jeff Zemerick to avoid NPE
tteofili Mar 9, 2016
0236fe9
OPENNLP-659 - added missing javadocs, minor tweaks
tteofili Mar 9, 2016
cac4db6
OPENNLP-837 - applied patch from Jeff Zemerick to throw an exception …
tteofili Mar 14, 2016
87b7c14
OPENNLP-835 Fix early termination, reset behavior and minor memory leak
kottmann Apr 27, 2016
3119500
OPENNLP-847 Change visibility of deprecated train methods.
kottmann Apr 27, 2016
411d061
OPENNLP-760 adding APL header to lemmatizer component classes
ragerri Apr 27, 2016
e35eb55
OPENNLP-847 Update to use non-deprecated train method
kottmann Apr 28, 2016
1643314
OPENNLP-844 ngram feature range in doccat now as parameter
ragerri Apr 29, 2016
81891ea
OPENNLP-848 Print training data summary at end of training.
kottmann May 21, 2016
218361e
OPENNLP-849 Improve handling of InputStreams
kottmann May 21, 2016
0035671
OPENNLP-830 Replace the IndexHashMap with java.util.HashMap
kottmann May 24, 2016
f695920
creation of OPENNLP-855 contributed by amensiko
Jul 3, 2016
534027e
.DS_Store removed
Jul 3, 2016
4fb05c7
Cleaned up
Jul 4, 2016
1394869
Working Evaluator and CrossValidator created
Jul 10, 2016
7fb9c62
Code cleaned up, Javadoc created
Jul 11, 2016
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
208 changes: 207 additions & 1 deletion KEYS
Original file line number Diff line number Diff line change
Expand Up @@ -70,4 +70,210 @@ v1VPKPblEwJfuqC3DQh3XWzs3AwjKLXXfwznF7slqBRT48BwdLsietnovoTsZXYg
1cqtTc96tSwb2XW3iA2uZlD4aTkrOmm3FKbauC/rFmCjkpvwpvqcIdpib4M2DgNx
zAZ2cJnxw3f57qc9Yh5qvhDUephwOAlAy8ekc1AmX14F+mwYE3GjcqeGdEbLNw==
=GLHu
-----END PGP PUBLIC KEY BLOCK-----
-----END PGP PUBLIC KEY BLOCK-----

-----------------------------------------------------------------------------------
pub 2048R/44DC2602 2010-12-03 [expires: 2015-12-02]
Key fingerprint = 00E3 DDC8 D514 81CE A29A 01AC C25E 6698 44DC 2602
uid James Kosin <james.kosin@gmail.com>
uid James Kosin <jkosin@apache.org>
sub 2048R/6131CA9C 2010-12-03 [expires: 2015-12-02]

-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: GnuPG v1.4.11 (MingW32)

mQENBEz4aAgBCAC092PLZqCzabrpoSORtz6asDeXqB2ORDmh8dQJ7x4pcjV6kNac
TnfNj4/xAHEKv6RFFhbvLWhB+wlYrh55VybQBmvxdiG9YlnwS9FicGIPS5no0TQb
EEyDE77ZXlUVli7vrpEWZ4ziO4FuuqJqRU+DNwRCC4RFJo27ULQQmOJwyS66J+1L
PSdO5DS5eQj/kg7Qqk8qSXdfk9QJpgFC9Bw+DM9tojrQ96wGmdUH/l47AFUmLqvw
04uKQGfWm1NVuVCqafoQ1LZAjwNpI81GcbYuFNo2oXJ3leRO+9K8VqHe8Ba0Q5i6
yP4Qv5fBrJYE1Xhf7bqMxlqYpNyEr3VjcCL/ABEBAAG0H0phbWVzIEtvc2luIDxq
a29zaW5AYXBhY2hlLm9yZz6JAT4EEwECACgFAkz4abICGyMFCQlmAYAGCwkIBwMC
BhUIAgkKCwQWAgMBAh4BAheAAAoJEMJeZphE3CYCuYQIALKG5NlRqs5QavwG8nnX
gl56yrVsN3P9GV153I3HCO57nxFqvOAQ1SDXMn5LijOxGDi5e9Ik4+KDq4K7I9Gv
W9AOXqPCZZsNW9Nc+PCK1cs/LosQRuYaPib1kBM4z5pk4U86IFo7DvALnG4bmpgF
WBEQ54/OdOdJBS2sFMFNOlpujFo8yntuq8NSGrhGGu90z/sIWkzlTlBiWWFAJAm3
hAbir1by3x2U+bTVmi48ZZMGwlaxCY9Di2nHwN5yFZJHl0b4CbdhwOocnCntBY0M
Pamh0XBdcFduPZAyFvn8aBChdRUJsaceGRpoJjnGkKw/A2nh6rBUaLfv6MWLpPi8
fVG0I0phbWVzIEtvc2luIDxqYW1lcy5rb3NpbkBnbWFpbC5jb20+iQFBBBMBAgAr
AhsjBQkJZgGABgsJCAcDAgYVCAIJCgsEFgIDAQIeAQIXgAUCTPhpywIZAQAKCRDC
XmaYRNwmArh5CACc2GUgc5jO2K+sir4bW151k68lM+dEdTm1XV9rxUzk4mv2vwFS
ABTnrtAyZxA4ubLAS1pfsV7uXY5TSNVRFUTHebdHL5dP24SNvAAWKhS0s0j2ojbe
amzi9BEt3SZt5i2TbkfPFyCYTNaSEfIbkuYoq142iNmmNdf2VBFqLBDA/Y5hOXOS
y28yeGeMKmj2LUYVamuvFnsUjBIvUgSqyaR/wfwnBdqwKde9VVUlQtrENAHT+fcX
9mV6qxTu0f7Ykna/xOyGrc6tvp0+cSryguWE0z0GikeT9QKsexuiRRrz2rA1gyPR
yugpy4p6/t4TSEQtM+O14EAMx+sbtivuWxQLuQENBEz4aAgBCADGtWJtGMK1777F
q4N85JLKgFoKHJDHA3xHkdFidxY4cooSExOBOuFpuH6PdU8D76X8tSE86YpVYmJo
8SgdgzNFO+Trwlv/oe3xGC+O0i/teCCmEn1pNlMMvJ7pNRMQKFh7FfS5ObYKRVeF
IXKx6JQsjVwmKjgW/tnyhFnDP3jJqEZDmesFD61E2/5rNX+a8qMddfpYR0RHpRR4
WJd6SqKwflkbXkW+t/ln52GHgCkx+WPfbRjE4Zh5KZqmaIkAByraWegLJgQZ+syD
/JCa+2flF1ydB+q+sKBRYMPbEDBavB4naNPQzA+Iut/7aD5Ht81uCOBXZDzRMf33
YEqPcsTXABEBAAGJASUEGAECAA8FAkz4aAgCGwwFCQlmAYAACgkQwl5mmETcJgKm
Mgf9HB9Jg6WiIYzgmelBAmLlbITlbH5iMVvm3TbBSxN/AD14VFx9Gyd8AiHPdH7B
kpQet0URZZmDhL++4uD71wtdb/v1NQlZt8Dyw2BHN8w8iwIk36pxgMLuxdp+Xcs9
L1OZwGEWgAO/PPKWEy6bWcO3MvAU+gEjKKKQMNkkUymbDAr0J7K7qCrH8lhErFTs
S9Xv/cxLewQs921l/LUIS+vxL5hUaArhF7yvbLx8OzbAyoaMzDljTbSKH9zM2Ryp
gPPQm2kx4WCL1OLc9faqJmYXwsa0O7zq3uJEZx/nCoSgF+u9uqkjdFcwEarBtM6Y
M3lA67tQpl6feMswEOgsEql7Bg==
=jXKL
-----END PGP PUBLIC KEY BLOCK-----

-----------------------------------------------------------------------------------
pub 1024D/91DDAC20 2008-01-25
uid J�rn Kottmann <joern@apache.org>
sig 3 91DDAC20 2011-02-22 J�rn Kottmann <joern@apache.org>
sub 2048g/7B06114B 2008-01-25
sig 91DDAC20 2008-01-25 J�rn Kottmann <joern@apache.org>

pub 4096R/5EE31F7F 2011-02-22
uid J�rn Kottmann <joern@apache.org>
sig 3 5EE31F7F 2011-02-22 J�rn Kottmann <joern@apache.org>
sig 91DDAC20 2011-02-22 J�rn Kottmann <joern@apache.org>
sub 4096R/87CFF9D9 2011-02-22
sig 5EE31F7F 2011-02-22 J�rn Kottmann <joern@apache.org>

-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: GnuPG/MacGPG2 v2.0.16 (Darwin)

mQGiBEeZsXMRBACW7VN2XbSW2IiAECQbECh3a54Kq7K4nct04zDBerjwxXRYBJaR
dGLkZ1iuto/fTWf9LedYctT5teRuLTw+hJNT3GmPl2RKsnQoCYnlrzXfQ8dkGvwH
d9RBx/3ax9BE2z9aQW+MbCDFPzJLEKk4XGsoRWBRy+DVFKG8CbANCJiSkwCgvPIn
v7y19Yk4XFKorfR012I6LH8D/1DeePwUEBrphORoDn7KInvZuDhjOLxGNp1puU5l
PLnhAFDRMN9VXuG4GFy7DFmyhcXv/AwI7AKA1sX5gQT3dQ5m+vTCmBbMX0bKAgud
gMXtkqSlnBJwM6M00dZOtngxYV6ocsoONKzkBYyNWGCinUhDxtTs+rR7V3LbEMz+
73TUA/9lGAsg1lOVjqZbDiYC0AVQIM+SFVlRRloTQaVyLdwaaLrIsGtDq/bwsVv7
S+gTgQtrwjxLwF0vL2rjgK4IDI6zZqAgdBagK9GKxmdKh3wizkWMPSaNkfQ7IXqp
Z9M1Gkgbvr4o/x1Al64ZbiXihjMw1wVatHlS/TFbvQOJDQMBzLQhSsO2cm4gS290
dG1hbm4gPGpvZXJuQGFwYWNoZS5vcmc+iGEEExECACECGwMCHgECF4AFAk1kGDMF
CwkIBwMFFQoJCAsFFgIDAQAACgkQ2kbYYZHdrCDRxgCgmVHGB6yD0OJU1yxKtaoJ
R9mBQKgAoISW2Z3xbTufMrlXc9hAUNLRxHgHuQINBEeZsaQQCAC+cK1uFagdbUQo
65cfKeVQOMaWA46W63BpX+ZOuQ7AvuV0w+5TAzh/VCEoaS9G9lwhXmBG9eKpSLRz
cLv9rj7OOPWLYU9HRhMJ7A9inhx1uOOAbwzhmAbEYjiDTutz9c4cjF9dxM0adboI
/nDNV92FhL3i4GFS+mkVPrPYnjtOJmrQnsIFKmLkq//va/Hy7X/Unjr3HVVWWYvm
Up7R/5YcDpK+J/a04KBK1E59mVKO8D3XKa2+nyzRwu9PgT8AhGPESC/YLG/Eq+Xx
nLuO+Th0oe8t6gWhhhtkMawttzx22LeS6OXagK5wO8I8AqokhuAOtnto5sf3mODF
37rPW4QnAAMFB/9IIv7BDRimEr707yxty8YeEn6+wJgO93lWZXvoz3yTUXw0w9ug
abQNYkJoVK2eDAzazC2m9cw1F2rVrP1tD0L3bFhKqnsp8rEWPYEsDNtLwTkBXYz8
7BSgIrFQFoVJM0gQAgWsvJy2PdubYqJzOEhVAzVq7hhvsMvcgI/3kwTbkNaRrODw
RX+66I6JSUtuxWLqMpX4MYV3LG6gp0dVA+yWZjPgWKDFfWh4SdA2dbYFpSHpZIRn
Ou5OmwxpNb429nz4iZmB4+qSqU+Y1JYrtSdwA0BgF1OSwEJe+piwbAqTv1UUNJoo
GtYLuAuqezckLTe281eGQNtOoukAt5El1OvJiEkEGBECAAkFAkeZsaQCGwwACgkQ
2kbYYZHdrCDW+ACdGhmTNDDusXBzUJIjDhVDoFvigsYAniLz783Y6+1ic8DdTfqR
CAffspdh
=HErw
-----END PGP PUBLIC KEY BLOCK-----

-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: GnuPG/MacGPG2 v2.0.16 (Darwin)

mQINBE1kFMQBEADLi0EsQnl2Ntttis+lN8FcyeEilETbRRiT3QNnse/OXKjJx5jn
3I8qTPUQE765ZxVceX1f6qbcz8pzGtq6qwdn/pp68CL9OCkU4mbGssr1mZJyg40R
vEXfpuYPBLx//nruXWvlShhqWg1OnN/VriuCG0yogYiWLiGGj7uGAcg9EMufN+hZ
GdHZZyQ99lsI1V6/fbpPdWxDNoW9C/mu9pouUMej2VoZOhquluDXrZrlO5BNcVhB
8uvvkwL5OxM7JVsoKTCQEBNrCQw8WQtKZNCHs04cS8srAIxSq72GIFfyruX+73Gv
qWeOzxxw0Gzs2R0A3Xu5XqxkqiRty2QPeGXQrym5jptV7W6rVzIXcUSaWi/SANxb
ruS1xBhNCXl/+n786N91G+iEFFDI4IAMPFauf+bMAkz22wYJeefL+fJdda4pTCw1
OyGSytwju6YBayoQ5PdceiXb/POug/fmTut+N1QNuVzoyRyXn8WcJujUvjcQW+in
739Pq3VrnGnsNgccTtzN0ySLaaPF7xrhC5Mq+sO9CReRmbF5LAbL0tnVeCGEStXi
tsWUzHfURPVK+BnQy77AJUcnF4+tfJHFSvnVgbVh32Safdjs+CwrhqTlPWLn5E8n
naF7SUMbDihvoPtGXlqnyy2wUOx0WtFgqZMf99It5eJM821J/AQVeN4pNwARAQAB
tCFKw7ZybiBLb3R0bWFubiA8am9lcm5AYXBhY2hlLm9yZz6JAjYEEwEKACEFAk1k
FMQCGwMFCwkIBwMFFQoJCAsFFgIDAQACHgECF4AACgkQ6tKtml7jH39QWQ/zBi18
aAtshZThYqF2Pj80hQ604MsFvlmhqG9/puJYMkKEXyMuhepan1cIJJceCw7GXB+P
M8wVwePkMo5adzcr/0wnH++hC6P75IimWo2kf0hL2l1z0zkq9LQQt0JRjY9ZqxDw
REj+6WSZl2l3JJVnTycpbzDzuLSAxHcmdIG9bkqnwrZ77GwO8J/MN+MUycUMT1fd
7iWPJpvjqnOWNE1+L3cFReUt6VgzVidJ7jHrOz49rTmqlF97+yjUy1tyRIDeLl3C
9V+oZM17ATlw5PJ04iB/q2Lg/caY9H5FsAcvjDmCNN6Eutyx0K0DC6DvZA9y5c5v
OacJhtXR2jXukMnHs2p2WW2E6XoS4KQomC/F6qZngVpH18+bzMzsQu7ASud3epBd
9iDbiJ+UgjtUj6ec+FWc56qKhdmvdtqmzdvgSsQN+JaOpyCsWJg77sBM6/Z76BFL
1RC63mhuUd67bAdUNKKiw4oZ9QMJCd6KP/Fm+e98huetZcPTf//BglyePTkl4yfV
iL5tk0T/kn2YNEG/mjev5HNROg0KoF0REl6FZ5+WwFnjmdjAw9BZRMNDwTDFSj3c
Sl0PUZ2+R2V0gf3TBdxVhYCzPzLujpYjp//UgrJk6XWYmMNUGMQJq7P+LuUye0h2
TTz0eGDn/i1JDf89ET2v6+xg3GrVSSKHCTRzgIhGBBARCgAGBQJNZBd1AAoJENpG
2GGR3awgWjYAnjYtcttIduCQA191T4DvQYzdWuK7AJ4qAUXX4lbkzeLGEgNolVJd
jtKD77kCDQRNZBTEARAA6cF+kZaIb4IlMO1FbTXowCOkEYOWhOP+5eXCcc1q3ZB0
HNV3kRCxKH1dbaDOhAxmuP0nMLwSDkQcFFGxfXAIfV5Miwtdtlwc7+jrCYMRzqZ4
zRCWmEdJ5bTE3bdiYc1Wo4/8dPfB9hn6sv/MGjlWy/DB4tElFYA1JDOyCdQ0SSwh
yc15Yv+i78URjUf7q1WUGHhH7YN1lSldXvAiU4ZYioHLiLxMuhmGtXMoiE2+jRFt
E8x9RwQCvDUjBMXqZ82FM/aqVeqMqcYi8F3iELDbZrVGoGBQ2cJh9a/rSvUTBEPg
OSrPieDQqTlNK36isGBdLETDU1gPIXtHoUlbGpe9CvK6wazAjeEV+ck8mvRz20iC
i3RzkkvlN2TV+s0VNGQTztK2DQblwuv0yeEGuiq2GlakMnhsSJLWgYYcPDirJR/j
2qsFIOZOxtWqqPfB3wJyl2wmBXjSIfpb7BCulCSP1QVHos1OITbFB0QLST9twWPw
+cSF8tezJ3rbGUvvkBQQgpdDMUMceaTL84PGvUW1z5uz0HW3jo5ISihNRUN+zcQR
63q5+/Mw3Oar2dutmFxogC5iuIj+jZjRUewVaa+BC/YRNNeDEWSm63NVco+NRlS5
+Zsd2+831HuTtwuaSavSOJuZCx5HBA5578OEfwpFQRSYDIVJ1en09D/4K3GrvdEA
EQEAAYkCHwQYAQoACQUCTWQUxAIbDAAKCRDq0q2aXuMff0Y8D/48gN/HXOurJhES
78hZsjkoIE1CUzL0MDtil5RI2XUqdRn2bIsVhsMtEK81aZjM66XLsRSuFaaAZwn0
QecxTI8gd4U6VV0DRP0R8Yq1Dcg8vaPAZVwPXlC0SDQaGrlR4bpW1mO51nSUnxlq
la7zM1vzmboVn3nD+OjOshSPMQRnxWN/8L0pyQep7IA68UeJdRo/9DsoIJXU9vMF
14YIfE78jiXlv0MmDtQgQTv3amP4ktm6fcbXlTrr4tgiWYDbRXeerMY473pLLYtq
7UgtRSSZvQupBBB6KflojCfHrX53VItive2QcW0Grz7Rcz4/E3Rjr2Rhv5RRrpeg
2gF3gi0PP0Wl5k5sgMmF2Xx78SGv5eww1JD/ZCXQDYyzwV2+6En8BlOvUcSVkSfo
9dAel7PpcV59RZcc/oKWh7hZO2sbUUNwQGEAz1Rfz8s5HzQHB5Y90n80XjPsDYg9
XJ88V2f4lU+/dQjasBXph0e7LvlkZrn50ji/sfwpuBT/6++Jf2dr1330VukWXyDg
4U0dMVq7wNbB10sLJIdBPOvWb8jEOsv6hA28M9WOLM8fd6petg7n4zkRAjU9Hlk2
UPF+BhTMjtxCA7+XTVIHXkOBEWiA6b9WRyK9y3T2pLFvwQi8qhCk0DgY4tUX3Yoz
K8lv1puwHj4laJEwSV7NpnveVzKRIw==
=cn0A
-----END PGP PUBLIC KEY BLOCK-----

-----------------------------------------------------------------------------------
pub 4096R/BC0737A6 2013-02-05
uid William Daniel Colen de Moura Silva (CODE SIGNING KEY) <colen@apache.org>
sig 3 BC0737A6 2013-02-05 William Daniel Colen de Moura Silva (CODE SIGNING KEY) <colen@apache.org>
sub 4096R/EC30C4FB 2013-02-05
sig BC0737A6 2013-02-05 William Daniel Colen de Moura Silva (CODE SIGNING KEY) <colen@apache.org>

-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: GnuPG/MacGPG2 v2.0.18 (Darwin)
Comment: GPGTools - http://gpgtools.org

mQINBFEQszABEACpumcwNdeLSnleW2/a7oVGxIvIRQueFKUoIME+nnnazPhcXfxT
CCefEQkG5PsVZrFT3Koop3jgcof6v/Jo8XHF2RMnxZH+wgehxxBY+co6N3jxjNq4
WsKRYJXKMOO1/+vMGdJI1Dc7KINNDAC2T32UOaQSzdke5tOmLlIRFYH1+u8QweB5
z5YDOZK5ZY4nsfUv0qxnw1rkUDuv73/SEVR5YIbzuuTsZ6fOJj1NLXVq7yYuHjbJ
ZFumaS0YlJ5QE4IqU4LEgDIdcIdwjJFu8YK7xsURdWwc/5TNHy5jcp9mwFycRmq1
WndO9yeNx0hysbVkvmWQtnN4i62bT3tex34KvH9X2JEi6aGuSDOSMuSiwVsYiqj9
DZGA+bVuzyTfcBU3tNVPT4Pr9b5T4yrxoYbFUeAzVpOH6PC5EyRzjceLNGWhivgE
s+jJZ9neAy9OBKjg1t0DR2XpA4A3oViuO82EEs5i23l2SUTa3oyd3VoYBDjtZMUT
SqShRP+XCO4VhRXPh46cTMcWtf33Db9szPq6NZJwN4NBYSC3cEZiFjY/QcdU3mUZ
ZztIVkX5272CHjgVREFwvS+PV75daV0G6sIOMvxO9saQCjtTq+zplJOCJUQJCaOe
Lmo0Vl6m7h+qSLBLO7EXRbpM56HbHdT8qWMueiITPak/mS6c0ELsrYD6wQARAQAB
tElXaWxsaWFtIERhbmllbCBDb2xlbiBkZSBNb3VyYSBTaWx2YSAoQ09ERSBTSUdO
SU5HIEtFWSkgPGNvbGVuQGFwYWNoZS5vcmc+iQI3BBMBAgAhAhsDAh4BAheABQJR
ELUhBQsJCAcDBRUKCQgLBRYCAwEAAAoJEB0+sPi8Bzem5ZoP/0FrWwhbIRFngxP6
IMlWpz7Mjlz3pF2cSfdoouF6SjJvm+CPbxrd1DNsWrpeuMS29R9yPMzsrq0q4LB3
dmqofNphwcg2G0y9zxysyC15w5m5ZKI9zSvBksLdo8dEFRKW3Ko/AgwypWi2IaKI
Q7w7LSMRAGR5/j96z/5bbfUEjDFOBFXyXnANadGULMGGh3b6NTxDz7OG2hJve1tr
5eo4/Y9bKz7ISLvAr33v/QOzHnuyjZo9iLXrpw2vIhWh6wRfZmFUg9Z5Tk3DUs+t
CspkUY4q8EKj+A9RHKRLAjGFLsOHzRttHh1GVrRizQNYV5V4+YtwmaMqLizeABXL
g5wrKtHxGLp34KFDe4N7UO5yUf0ZyhBlFBRQjT0dQhPVnYNHU0ODpmnyOZJ+fgT9
65emPQGHerLRosEDdji2xEIpZg3kARUDimi2r+LxNcSa62SWohKr5jYjJSXlSvfX
9aZa9DfHUgDIK0n+8v+SgO1S+wtpmpXMe88NYm8MTNCyoK/Az1IhaMWga9hwhkc7
ReNJamumqjp5YJ6gWOr+0ARENJQDSxw8lXIjKzpPPn1uf3RwijtTxepImAygU7aQ
kOiAJlfTpR3vwKlu4BMVrNKJRoRWpiD1pB4RO45QME6JJ30UgFBCRhv+hGjccofJ
Dm/+ftWZIQt5NBDm4v4NdSODII8NuQINBFEQszABEACggwZ1EfGmA/i2Q2MiHLGz
Wg4KxaXrEpQ7zLiMYVhcWkdiYqs9U4ca5o4aFPpHXoQ1YNyHP1IrqWzP1JqI7z8t
7tO4HlB3ww5/jMQQJsRzYGsndBhES1KI4hO9E7YZdMP8JfI9mQlDAfHQwb1GRSjB
Z3NsnD+C0MhozPf5pbBZKP70JUvaQ1taeGQr1VoGOt6+UG/uH0M4dEcptWEVUlH6
LrUI3ZNgb4semArikA2/aTp25kcMRqHT+KLN9WvELe/vSJQEdWr6DCggvKMwS4Hz
k1kFkWFp/GVAR6/TrDxA7hljKPXRKvOL2jP0AJeXapIDHOaWyuRVIm3uPLTwdHgz
O3lno7a7tZpX4vgdqMTovWnhheh07XXeUQNJ0Cy/fGi8cvlce3i/5Xm905PlBiIj
zgLx+yojfIpoLcZZ8MDx0c6t6Inbi70qPSfx0McCm4APTksttaja7wPj2bsPG873
YwEHgzyaAO4xqcR7mhx+LrRzrghhyr2Ra3VmqEKsDG6Pdz2qxegIkKAzpuqVvrAC
e89BynltFu+2/HkrwoQvevdYNgueNlL772m7dqqEwYrUc7dOGbLBjJ2uJhpvZeok
P6XA/GODTkW5Vi7qPqTvQQJKrvQ8/jczS7L8RW8zOipNIsa7434KtNZT6kvfporS
zTYWgCpTmQ6hwb036WmM8QARAQABiQIfBBgBAgAJBQJRELMwAhsMAAoJEB0+sPi8
Bzemy9wQAIRWWEzVF4ZK8Wq5KRBJzfT+57jc0qP+dvhjORFIHton0sFJIqKxTF/s
jkbIilnkR0FcwtcNIduXNfSbqOuQjfpXmmOOjQfww3aeStY5i5r0tD4yGtXAK90P
q+R5fG5Gnw3P1dB7GXRZ+bQjBfJ/XpjX5qn3z6vrhj6DHu0oqx5n2rZhIOXfofVs
Otmk8BoeYFJPg6VExJRXiWmZHLLTqEJsZQDupv3+btmRb7++sjp/krCHjffZiiyd
eJDUVQwpY5nRedjYrgLQhVcMR2o/g5xS5lfW7khqXQ4sqg5PILNJFeXiEjJo4ZQP
2dcsINvlARtnu++WXMbDwsS4R7baFFFWQ3BsMoS1/O7lXNq+7aesRH5cPCFzj1XF
Q0wz4SgieylOc5VYQff2QWKWg+3gNsiaYnmJzD2lhaw9Ye0tht3Q5PYmp8g9J6Sd
QT7axwnrWq2IbEk2EOOln1C22KNXciPBEUoZ6Vt+BfXKdSKP5Bgc65+YMPX3N9Ws
Bwcmc5PmowPW2SEq1ngpU21WJhLZdoPuiimE6645T2QuNRVOzyY+3QHTmtBEzATA
O8DxQXqXzYERWC/iPa6px/QLon+6CPcuZnHEkvwajqb0yxB7hzK8YNYQgBrCcFMM
+bCJaQ7cLaC4frKoo2YMGTOteL2xG+shakapJPFsdBLnrbifFcrd
=FZ1t
-----END PGP PUBLIC KEY BLOCK-----
29 changes: 29 additions & 0 deletions opennlp-tools/LICENSE → LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -200,3 +200,32 @@
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

The following license applies to the Snowball stemmers:

Copyright (c) 2001, Dr Martin Porter
Copyright (c) 2002, Richard Boulton
All rights reserved.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above copyright notice,
* this list of conditions and the following disclaimer.
* Redistributions in binary form must reproduce the above copyright
* notice, this list of conditions and the following disclaimer in the
* documentation and/or other materials provided with the distribution.
* Neither the name of the copyright holders nor the names of its contributors
* may be used to endorse or promote products derived from this software
* without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
12 changes: 12 additions & 0 deletions NOTICE
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
Apache OpenNLP
Copyright 2010, 2015 The Apache Software Foundation

This product includes software developed at
The Apache Software Foundation (http://www.apache.org/).


The snowball stemmers in
opennlp-tools/src/main/java/opennlp/tools/stemmer/snowball
were developed by Martin Porter and Richard Boulton.
The full snowball package is available from
http://snowball.tartarus.org/
44 changes: 44 additions & 0 deletions opennlp-distr/README
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
Apache OpenNLP ${pom.version}
===============================


Building from the Source Distribution
-------------------------------------

At least Maven 3.0.0 is required for building.

To build everything execute the following command in the root folder:
mvn clean install

The results of the build will be placed in:
opennlp-distr/target/apache-opennlp-[version]-bin.tar-gz (or .zip)

What is new in Apache OpenNLP ${pom.version}
---------------------------------------

This release introduces many new features, improvements and bug fixes. The API
has been improved for a better consistency and 1.4 deprecated methods were
removed. Now Java 1.7 is required.

Additionally the release contains the following noteworthy changes:

- Added evalutation support to the parser and doccat components
- Added support to Evalita 07/09, Brat and OntoNotes corpus formats
- Now L-BFGS is stable
- Added Snowball to the Stemmer package
- NameFinder now supports a user defined factory
- Added pluggable machine learning support
- Added a lemmatizer module
- Added Cluster, Document Begin and Clark feature generators to the Name Finder
- Added Liblinear as a Machine Learning addon
- Entity Linker now has a command line interface
- Added sequence classification support

A detailed list of the issues related to this release can be found in the release
notes.

Requirements
------------
Java 1.7 is required to run OpenNLP
Maven 3.0.0 is required for building it

76 changes: 76 additions & 0 deletions opennlp-distr/RELEASE_NOTES.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
<!--
***************************************************************
* Licensed to the Apache Software Foundation (ASF) under one
* or more contributor license agreements. See the NOTICE file
* distributed with this work for additional information
* regarding copyright ownership. The ASF licenses this file
* to you under the Apache License, Version 2.0 (the
* "License"); you may not use this file except in compliance
* with the License. You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing,
* software distributed under the License is distributed on an
* "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
* KIND, either express or implied. See the License for the
* specific language governing permissions and limitations
* under the License.
***************************************************************
-->

<html>
<head>
<title>Apache OpenNLP ${pom.version} Release Notes</title>
</head>
<body>
<h1>Apache OpenNLP ${pom.version} Release Notes</h1>

<h2>Contents</h2>
<p>
<a href="#what.is.opennlp">What is Apache OpenNLP?</a><br/>
<a href="#major.changes">Major Changes in this Release</a><br/>
<a href="#get.involved">How to Get Involved</a><br/>
<a href="#report.issues">How to Report Issues</a><br/>
<a href="#list.issues">List of JIRA Issues Fixed in this Release</a><br/>
</p>

<h2><a name="what.is.opennlp">1. What is Apache OpenNLP?</a></h2>
<p>
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.
It supports the most common NLP tasks, such as tokenization, sentence segmentation,
part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing services.
OpenNLP also included maximum entropy and perceptron based machine learning.
</p>
<p>
The goal of the Apache OpenNLP project will be to create a mature toolkit for the above mentioned tasks.
An additional goal is to provide a large number of pre-built models for a variety of languages,
as well as the annotated text resources that those models are derived from.
</p>
<h2><a name="major.changes">Major Changes in this Release</a></h2>
<p>
Please see the <a href="README">README</a> for this information.
</p>

<h2><a name="get.involved">How to Get Involved</a></h2>
<p>
The Apache OpenNLP project really needs and appreciates any contributions,
including documentation help, source code and feedback. If you are interested
in contributing, please visit <a href="http://opennlp.apache.org/">http://opennlp.apache.org/</a>
</p>

<h2><a name="report.issues">How to Report Issues</a></h2>
<p>
The Apache OpenNLP project uses JIRA for issue tracking. Please report any
issues you find at
<a href="http://issues.apache.org/jira/browse/opennlp">http://issues.apache.org/jira/browse/opennlp</a>
</p>

<h2><a name="list.issues">List of JIRA Issues Fixed in this Release</a></h2>
<p>
Click <a href="issuesFixed/jira-report.html">issuesFixed/jira-report.html</a> for the list of
issues fixed in this release.
</p>
</body>
</html>
Loading