Skip to content

VincentRoma/spark-topics-extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

How it works

  1. Configure HDFS input files path in config.properties (Only Parquet for now)

  2. Configure input text column in config.properties

  3. Configure HDFS output path in config.properties

  4. spark-submit --class org.opentools.extraction.ExtractTopics --master yarn --deploy-mode cluster ExtractTopics-1.0.jar

Compilation

mvn clean compile

Uber Jar

mvn compile assembly:single

References

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages