Skip to content
Change the repository type filter

All

    Repositories list

    • A practical approach to learning machine learning.
      Jupyter Notebook
      MIT License
      5.9k000Updated Dec 17, 2018Dec 17, 2018
    • Directory of tutorials and open-source code repositories for working with Keras, the Python deep learning library
      888100Updated Dec 2, 2018Dec 2, 2018
    • Identifying Fraudulent Automobile Insurance Claims using R
      R
      1100Updated Oct 29, 2018Oct 29, 2018
    • Detect dents and scratches in cars. Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow.
      Jupyter Notebook
      MIT License
      52300Updated Oct 18, 2018Oct 18, 2018
    • ImageAI

      Public
      A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
      Python
      MIT License
      2.2k100Updated Oct 2, 2018Oct 2, 2018
    • A list of back-end related questions you can be inspired from to interview potential candidates, test yourself or completely ignore
      GNU General Public License v2.0
      1.9k000Updated Sep 28, 2018Sep 28, 2018
    • Answers to 120 commonly asked data science interview questions.
      1.3k100Updated Sep 3, 2018Sep 3, 2018
    • Automated Resume Screening System using Machine Learning (With Dataset)
      CSS
      200000Updated Jun 23, 2018Jun 23, 2018
    • List of resources & possible pathway for the Math of Machine Learning and AI.
      46400Updated Apr 29, 2018Apr 29, 2018
    • Open Source, Distributed, RESTful Search Engine
      Java
      Apache License 2.0
      25k000Updated Apr 10, 2018Apr 10, 2018
    • h2o-3

      Public
      Open Source Fast Scalable Machine Learning Platform For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), ...)
      Java
      Apache License 2.0
      2k000Updated Apr 9, 2018Apr 9, 2018
    • presto

      Public
      Distributed SQL query engine for big data
      Java
      Apache License 2.0
      5.4k100Updated Apr 9, 2018Apr 9, 2018
    • BigDL

      Public
      BigDL: Distributed Deep Learning Library for Apache Spark
      Scala
      Apache License 2.0
      1.3k100Updated Apr 9, 2018Apr 9, 2018
    • Deep Learning for Java, Scala & Clojure on Hadoop & Spark With GPUs - From Skymind
      Java
      Apache License 2.0
      3.8k100Updated Apr 8, 2018Apr 8, 2018
    • hue

      Public
      Hue is an open source Analytics Workbench for browsing, querying and visualizing data.
      Python
      Apache License 2.0
      1.6k000Updated Apr 8, 2018Apr 8, 2018
    • luigi

      Public
      Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
      Python
      Apache License 2.0
      2.4k000Updated Apr 8, 2018Apr 8, 2018
    • keras

      Public
      Deep Learning for humans
      Python
      Other
      19k100Updated Apr 8, 2018Apr 8, 2018
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      C++
      Other
      22k100Updated Apr 8, 2018Apr 8, 2018
    • pandas

      Public
      Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
      Python
      BSD 3-Clause "New" or "Revised" License
      18k100Updated Apr 8, 2018Apr 8, 2018
    • matplotlib: plotting with Python
      Python
      7.6k100Updated Apr 8, 2018Apr 8, 2018
    • xgboost

      Public
      Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
      C++
      Other
      8.7k000Updated Apr 8, 2018Apr 8, 2018
    • scikit-learn: machine learning in Python
      Python
      Other
      25k000Updated Apr 8, 2018Apr 8, 2018
    • hadoop

      Public
      Mirror of Apache Hadoop
      Java
      Apache License 2.0
      8.8k000Updated Apr 8, 2018Apr 8, 2018
    • zookeeper

      Public
      Mirror of Apache Hadoop ZooKeeper
      Java
      Apache License 2.0
      7.2k000Updated Apr 8, 2018Apr 8, 2018
    • CNTK

      Public
      Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
      C++
      Other
      4.3k000Updated Apr 8, 2018Apr 8, 2018
    • TensorLayer: A Deep Learning and Reinforcement Learning Library for Researchers and Engineers.
      Python
      Other
      1.6k000Updated Apr 8, 2018Apr 8, 2018
    • Computation using data flow graphs for scalable machine learning
      C++
      Apache License 2.0
      74k000Updated Apr 8, 2018Apr 8, 2018
    • Impala

      Public
      Real-time Query for Hadoop; mirror of Apache Impala
      C++
      Apache License 2.0
      807000Updated Apr 6, 2018Apr 6, 2018
    • 🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
      Java
      Apache License 2.0
      989000Updated Apr 5, 2018Apr 5, 2018
    • Theano

      Public
      Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.
      Python
      Other
      2.5k000Updated Apr 5, 2018Apr 5, 2018