Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/1416 S3 input/output for Enceladus on EMR (S2.4, H2.8.5) #1483

Merged
merged 7 commits into from
Aug 24, 2020

Commits on Aug 13, 2020

  1. Configuration menu
    Copy the full SHA
    60d2948 View commit details
    Browse the repository at this point in the history
  2. #1416 - enceladus on S3:

     - all directly-hdfs touching stuff disabled (atum, performance measurements, info files, output path checking)
    
    # Add menasfargate into hosts
    sudo nano /etc/hosts
    # paste
    20.0.63.69 menasfargate
    # save & exit (ctrl+O, ctrl+X)
    
    # Running standardization works via:
    spark-submit --class za.co.absa.enceladus.standardization.StandardizationJob --conf "spark.driver.extraJavaOptions=-Dmenas.rest.uri=http://menasfargate:8080 -Dstandardized.hdfs.path=s3://euw1-ctodatadev-dev-bigdatarnd-s3-poc/enceladusPoc/ao-hdfs-data/stdOutput/standardized-{0}-{1}-{2}-{3}" ~/enceladusPoc/spark-jobs-2.11.0-SNAPSHOT.jar --menas-credentials-file ~/enceladusPoc/menas-credentials.properties --dataset-name dk_test1_emr285 --raw-format json --dataset-version 1 --report-date 2019-11-27 --report-version 1 2> ~/enceladusPoc/stderr.txt
    dk1844 committed Aug 13, 2020
    Configuration menu
    Copy the full SHA
    d41cc7e View commit details
    Browse the repository at this point in the history
  3. #1416 - enceladus on S3 - (crude) conformance works on s3 (s3 std inp…

    …ut, s3 conf output)
    
     0- all directly-hdfs touching stuff disabled (atum, performance measurements, info files, output path checking)
    
    # Add menasfargate into hosts
    sudo nano /etc/hosts
    # paste
    20.0.63.69 menasfargate
    # save & exit (ctrl+O, ctrl+X)
    
    # Running conformance works via:
    spark-submit --class za.co.absa.enceladus.conformance.DynamicConformanceJob --conf "spark.driver.extraJavaOptions=-Dmenas.rest.uri=http://menasfargate:8080 -Dstandardized.hdfs.path=s3://euw1-ctodatadev-dev-bigdatarnd-s3-poc/enceladusPoc/ao-hdfs-data/stdOutput/standardized-{0}-{1}-{2}-{3}" ~/enceladusPoc/spark-jobs-2.11.0-SNAPSHOT.jar --menas-credentials-file ~/enceladusPoc/menas-credentials.properties --dataset-name dk_test1_emr285 --dataset-version 1 --report-date 2019-11-27 --report-version 1 2> ~/enceladusPoc/conf-log.txt
    dk1844 committed Aug 13, 2020
    Configuration menu
    Copy the full SHA
    67e4012 View commit details
    Browse the repository at this point in the history
  4. ref issue = 1416

    dk1844 committed Aug 13, 2020
    Configuration menu
    Copy the full SHA
    ac0785d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a8d53f9 View commit details
    Browse the repository at this point in the history

Commits on Aug 17, 2020

  1. PR updates

    dk1844 committed Aug 17, 2020
    Configuration menu
    Copy the full SHA
    85570d7 View commit details
    Browse the repository at this point in the history

Commits on Aug 24, 2020

  1. Merge branch 'aws-poc' into feature/1416-aws-emr-poc

    % Conflicts:
    %	pom.xml
    dk1844 committed Aug 24, 2020
    Configuration menu
    Copy the full SHA
    0bd704c View commit details
    Browse the repository at this point in the history