Skip to content
@cerndb

CERN Database and Analytics Group

Popular repositories Loading

  1. dist-keras dist-keras Public archive

    Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

    Python 624 169

  2. spark-dashboard spark-dashboard Public

    Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

    Dockerfile 111 23

  3. SparkPlugins SparkPlugins Public

    Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…

    Scala 84 15

  4. hdfs-metadata hdfs-metadata Public

    Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.

    Java 56 19

  5. SparkDLTrigger SparkDLTrigger Public

    Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

    Jupyter Notebook 29 13

  6. grafana-mimir-cardinality-dashboards grafana-mimir-cardinality-dashboards Public

    Grafana Mimir dashboards used for cardinality exploration

    26 5

Repositories

Showing 10 of 66 repositories
  • opentelemetry-collector-contrib Public Forked from open-telemetry/opentelemetry-collector-contrib

    Contrib repository for the OpenTelemetry Collector

    cerndb/opentelemetry-collector-contrib’s past year of commit activity
    Go 0 Apache-2.0 2,382 0 0 Updated Oct 19, 2024
  • hadoop-xrootd Public

    Mirror of CERN db/hadoop-xrootd. Hadoop-XRootD Filesystem Connector

    cerndb/hadoop-xrootd’s past year of commit activity
    Java 6 Apache-2.0 3 3 1 Updated Sep 25, 2024
  • spark-dashboard Public

    Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

    cerndb/spark-dashboard’s past year of commit activity
    Dockerfile 111 Apache-2.0 23 1 0 Updated Aug 13, 2024
  • SparkDLTrigger Public

    Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

    cerndb/SparkDLTrigger’s past year of commit activity
    Jupyter Notebook 29 Apache-2.0 13 0 0 Updated Jun 11, 2024
  • argo-helm Public Forked from argoproj/argo-helm

    ArgoProj Helm Charts

    cerndb/argo-helm’s past year of commit activity
    Mustache 0 Apache-2.0 1,876 0 0 Updated May 28, 2024
  • SparkTraining Public

    Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/

    cerndb/SparkTraining’s past year of commit activity
    Jupyter Notebook 12 CC-BY-4.0 5 0 0 Updated May 23, 2024
  • NotebooksExamples Public

    This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery

    cerndb/NotebooksExamples’s past year of commit activity
    Jupyter Notebook 1 Apache-2.0 0 0 0 Updated May 16, 2024
  • SparkPlugins Public

    Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.

    cerndb/SparkPlugins’s past year of commit activity
    Scala 84 Apache-2.0 15 3 1 Updated Apr 2, 2024
  • sparkMeasure Public

    This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.

    cerndb/sparkMeasure’s past year of commit activity
    Scala 14 Apache-2.0 3 0 0 Updated Mar 11, 2024
  • jdbc-connector-for-apache-kafka Public Forked from Aiven-Open/jdbc-connector-for-apache-kafka

    Aiven's JDBC Sink and Source Connectors for Apache Kafka®

    cerndb/jdbc-connector-for-apache-kafka’s past year of commit activity
    Java 0 Apache-2.0 60 0 0 Updated Nov 8, 2023