Shingho is a robust, Python and Spark based statistical library designed for Big Data applications.
Special features of the Shingho statistical library:
- Multithreading capabilities for greater parallelization
- Leverages both SQL and MapReduce operations for faster processing
- Python 2.7+
- Spark 1.6+ (2.0.0+ recommended)
- Anaconda 4.3+
python setup.py --install
- Tutorials for using Shingho can be found here.
- Refer to the Developer's Guide to help you get started on contributing to our project.