- Text extraction from PDFs, Microsoft Word DOCX and text-based formats
- Disk-persistence of inverted index
- Validation of inverted index
- Command-line utility
Make sure Java is installed on your system, with JAVA_HOME
pointing to a JDK installation.
You may clone the project from the GitHub repository, and build it with gradlew
present in the root of the
repository,
$> git clone https://github.com/shubham0204/full-text-search
$> cd full-text-search
$> ./gradlew build
To execute tests,
$> ./gradlew test
To build the fat/uber JAR,
$> ./gradlew shadowJar
$> java -jar fulltextsearch.jar index build [dir]
$> java -jar fulltextsearch.jar index info [dir]
$> java -jar fulltextsearch.jar index rm [dir]
Use fulltextsearch index --help
for description of each command.
$> fulltextsearch query [dir]