Skip to content

Latest commit

 

History

History
30 lines (20 loc) · 2.59 KB

README.md

File metadata and controls

30 lines (20 loc) · 2.59 KB

#GermaNER - Free Open German Named Entity Recognition Tool

GermaNER is licensed under ASL 2.0 and other lenient licenses, allowing its use for academic and commercial purposes without restrictions.

##GermaNER in three lines

To tag German texts:

  1. Download the binary from here or if you don't have enough memory, use GermaNER without freebase features from [here] (https://github.com/tudarmstadt-lt/GermaNER/releases/download/germaNER0.9.1/GermaNER-nofb-09-09-2015.jar).
  2. Tokenize your text so that it is one word per line. Sentences should be marked with a blank new line. Read details [here] (https://github.com/tudarmstadt-lt/GermaNER/blob/master/germaner/src/main/java/de/tu/darmstadt/lt/ner/doc/File-Format.md).
  3. Run the jar file as follows (see details here)

java -Xmx4g -jar GermaNER-09-09-2015.jar -t YourTokenizedTestFile -o OutputFileName

                          OR (if you have less memmory)

java -Xmx1300m -jar GermaNER-nofb-09-09-2015.jar -t YourTokenizedTestFile -o OutputFileName

The tagged document will be under output/result.tsv

Contents