CGCloud deploy docs #1279

jpdna · 2016-11-17T18:49:42Z

No description provided.

fnothaft

Couple of small nits, otherwise looks great! Thanks @jpdna!

fnothaft · 2016-11-17T18:59:39Z

docs/source/40_deploying_ADAM.md

+
+#### Launch a cluster
+
+Spin up a Spark cluster with one master and two slave nodes with the command:


Prefer leader/worker to master/slave.

Also, I would note in the documents that you're setting up a cluster where the workers are m3.large. Somewhat obvious, I concede, but it's useful to note that you can set a different leader node type. Also, doesn't this command need you to provide a cluster name?

fnothaft · 2016-11-17T18:59:55Z

docs/source/40_deploying_ADAM.md

-export MY_KEYFILE="?????.pem"
-export MY_CLUSTER_NAME="adam_cluster"
-export MY_CLUSTER_SIZE=10
+[CGCloud](https://github.com/BD2KGenomics/cgcloud) lets you automate the creation, management and provisioning of VMs and clusters of VMs in Amazon EC2.


Can you wrap lines at 80 characters throughout?

fnothaft · 2016-11-17T19:00:23Z

docs/source/40_deploying_ADAM.md

+```
+cgcloud ssh spark-master
+```
+


Nit: extra whitespace.

fnothaft · 2016-11-17T19:00:54Z

docs/source/40_deploying_ADAM.md

-Export the path to your `spark-ec2` script,
+To use the ADAM application on top of Spark, we need to download and install ADAM on `spark-master`
+From the command line on `spark-master` download a release from:
+https://github.com/bigdatagenomics/adam/releases


Nit: missing period at EOL.

fnothaft · 2016-11-17T19:02:13Z

docs/source/40_deploying_ADAM.md

-alias spark_ec2_login="$SPARK_EC2_SCRIPT -k $MY_KEYPAIR -i $MY_KEYFILE login $MY_CLUSTER_NAME"
+The typical flow of data to and from your ADAM application on EC2 will be:
+- Upload data to AWS S3
+- Use Conductor (described below) or otherwise transfer from S3 to the HDFS on your cluster


Can you add an anchor link {#conductor} in the section where conductor is described, and link from here (described below) -> [(described below)](#conductor). This'll make navigation a bit easier.

fnothaft · 2016-11-17T19:03:07Z

docs/source/40_deploying_ADAM.md

+To transfer large amounts of data back and forth from S3, we suggest using [Conductor](https://github.com/BD2KGenomics/conductor).
+
+Its also possible to directly use AWS S3 as a distributed file system, but with some loss of performance.
+( example to be added )


Nit: I might drop the example to be added bit and remove the paragraph break between this paragraph and the conductor paragraph.

AmplabJenkins · 2016-11-17T19:31:13Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1622/
Test PASSed.

jpdna · 2016-11-17T20:45:57Z

ready for further review or merge

AmplabJenkins · 2016-11-17T20:47:11Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1623/
Test PASSed.

AmplabJenkins · 2016-11-17T21:06:35Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1624/
Test PASSed.

AmplabJenkins · 2016-11-17T21:26:33Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1625/
Test PASSed.

AmplabJenkins · 2016-11-17T21:46:49Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1626/
Test PASSed.

fnothaft

Few small nits, otherwise LGTM!

fnothaft · 2016-11-17T21:49:42Z

docs/source/40_deploying_ADAM.md

-alias spark_ec2_destroy="$SPARK_EC2_SCRIPT destroy $MY_CLUSTER_NAME"
-alias spark_ec2_login="$SPARK_EC2_SCRIPT -k $MY_KEYPAIR -i $MY_KEYFILE login $MY_CLUSTER_NAME"
+Spin up a Spark cluster named `cluster1` with one leader and two workers nodes 
+of instance type `m3.large`with the command:


Space between words in m3.largewith

fnothaft · 2016-11-17T21:50:13Z

docs/source/40_deploying_ADAM.md

+#### Install ADAM
+
+To use the ADAM application on top of Spark, we need to download and install 
+ADAM on `spark-master`


period at EOL

fnothaft · 2016-11-17T21:50:29Z

docs/source/40_deploying_ADAM.md

+To use the ADAM application on top of Spark, we need to download and install 
+ADAM on `spark-master`
+From the command line on `spark-master` download a release from:
+https://github.com/bigdatagenomics/adam/releases


Punctuation at EOL? Maybe remove paragraph break.

fnothaft · 2016-11-17T21:50:46Z

docs/source/40_deploying_ADAM.md

+As of this writing, CGCloud supports Spark 1.6.2, not Spark 2.x, so download
+the Spark 1.x Scala2.10 release:
+```
+wget https://repo1.maven.org/maven2/org/bdgenomics/adam/\


I would remove the \ed linebreak here.

AmplabJenkins · 2016-11-17T22:05:55Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1627/
Test PASSed.

cgcloud doc edits edits to cgcloud docs more cgcloud edits more cgcloud docs edits more cgcloud docs edits edit cgcloud docs more cgcloud doc edits

jpdna · 2016-11-17T23:10:43Z

ready again for more review or merge

AmplabJenkins · 2016-11-17T23:32:08Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/ADAM-prb/1628/
Test PASSed.

fnothaft · 2016-11-18T05:17:51Z

Merged! Thanks @jpdna!

jpdna changed the title ~~CGCloud deply docs~~ CGCloud deploy docs Nov 17, 2016

fnothaft requested changes Nov 17, 2016

View reviewed changes

jpdna force-pushed the cgcloud_doc branch 4 times, most recently from e8bed96 to 4d4ab71 Compare November 17, 2016 20:43

jpdna force-pushed the cgcloud_doc branch from 4d4ab71 to f887597 Compare November 17, 2016 21:06

fnothaft requested changes Nov 17, 2016

View reviewed changes

Add CGCloud deploy doc

fdfee7c

cgcloud doc edits edits to cgcloud docs more cgcloud edits more cgcloud docs edits more cgcloud docs edits edit cgcloud docs more cgcloud doc edits

jpdna force-pushed the cgcloud_doc branch from f887597 to fdfee7c Compare November 17, 2016 23:08

fnothaft approved these changes Nov 18, 2016

View reviewed changes

fnothaft merged commit 20a0eb2 into bigdatagenomics:master Nov 18, 2016

fnothaft mentioned this pull request Nov 18, 2016

Update usage docs running for EC2 and CDH #493

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CGCloud deploy docs #1279

CGCloud deploy docs #1279

jpdna commented Nov 17, 2016

fnothaft left a comment

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

jpdna commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

fnothaft left a comment

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

fnothaft Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

jpdna commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

fnothaft commented Nov 18, 2016


		#### Launch a cluster

		Spin up a Spark cluster with one master and two slave nodes with the command:

CGCloud deploy docs #1279

CGCloud deploy docs #1279

Conversation

jpdna commented Nov 17, 2016

fnothaft left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AmplabJenkins commented Nov 17, 2016

jpdna commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

fnothaft left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AmplabJenkins commented Nov 17, 2016

jpdna commented Nov 17, 2016

AmplabJenkins commented Nov 17, 2016

fnothaft commented Nov 18, 2016