Enable EMR cluster configuration through the InstanceGroup param #1071

apetresc · 2017-07-06T15:31:40Z

This fixes #869 and #389.

EMR has three ways to configure cluster resources: the coreInstanceCount/coreInstanceType params, the InstanceGroup structure, and the InstanceFleet structure (which only exists in very new EMR releases). At the moment, Terraform only supports the first one of those (coreInstanceCount). This is extremely limiting, among other things it doesn't allow the user to set EBS options or use spot prices.

This commit adds support for InstanceGroup to aws_emr_cluster resources, and makes the masterInstanceCount optional since there is now an alternative.

I have not added any tests yet, but manual testing indicates all the new features work as expected. Existing unit tests continue to pass.

EMR has three ways to configure cluster resources: the coreInstanceCount/ coreInstanceType params, the InstanceGroup structure, and the InstanceFleet structure (which only exists in very new EMR releases). At the moment, Terraform only supports the first one of those (coreInstanceCount). This is extremely limiting, among other things it doesn't allow the user to set EBS options or use spot prices. This commit adds support for InstanceGroup to aws_emr_cluster resources, and makes the masterInstanceCount optional since there is now an alternative.

jennyfountain · 2017-07-21T20:03:03Z

+2!!

synhershko · 2017-07-26T09:28:56Z

+1

fosskers · 2017-07-26T14:39:47Z

If anything needs a merge, it's this PR.

fosskers · 2017-07-26T18:50:06Z

I've just tested and confirmed that this works. In lieu of say:

  master_instance_type = "m3.xlarge"
  core_instance_type   = "m3.xlarge"
  core_instance_count  = 2

We can use:

  instance_group {
    bid_price = "0.05"
    instance_count = 1
    instance_role = "MASTER"
    instance_type = "m3.xlarge"
  }

  instance_group {
    bid_price = "0.05"
    instance_count = 2
    instance_role = "CORE"
    instance_type = "m3.xlarge"
  }

Results on EMR:

Note: the aws_emr_cluster resource documentation will have to be updated as well.

jennyfountain · 2017-07-26T18:56:00Z

What about disk size?

fosskers · 2017-07-26T19:47:55Z

There seem to be more options that what I used here; check the PR's diff for everything that's available.

Otherwise, disk size of the instances is controlled by the instance_type, no?

fosskers · 2017-07-26T19:52:20Z

Also, how does these changes relate to the existing aws_instance_group resource?

apetresc · 2017-07-27T04:45:27Z

@jennyfountain: Disk size is controlled through the ebs_config block. So something like:

instance_group {
    bid_price = "0.05"
    instance_count = 2
    instance_role = "CORE"
    instance_type = "m3.xlarge"
    ebs_config [
        {
            "size" = 200
            "type" = "gp2"
        }
    ]
  }

grubernaut · 2017-07-28T14:36:33Z

Hey @apetresc, thank you a million for this PR! Awesome contribution.
However, would you be able to add an additional acceptance test to the aws_emr_cluster tests, using this new configuration? Feel free to ping me afterwards, and I'll prioritize a review.

apetresc · 2017-07-28T19:41:48Z

@grubernaut Yes, absolutely :) I'm on vacation until the end of next week, but I'll add those tests as soon as I'm back.

apetresc · 2017-08-22T22:02:01Z

Hey @grubernaut, sorry for the delay! As requested, I've added an acceptance test for the instance_group field. The test passes successfully on my account. A few notes:

I opted for a single instance_group test with all of the different moving parts enabled at once (bid_price, ebs_config, etc), instead of a separate test exercising each capability. I figure since these tests cost literal money to run, it would be more prudent this way. Let me know if you want them broken up a bit.
I'm not doing many actual attribute checks in the test, because I can't figure out how I'm supposed to figure out the index of a particular entry in a TypeSet. I tried looking for examples in other resource tests, but they all either hardcode it (which seems super-fragile to me) or avoid it altogether and only test TypeList fields. Any advice on how to do this properly? Do I just need to write a helper function that just iterates over the resource?

Thanks!

apetresc · 2017-08-22T22:05:08Z

Oh, one more note:

I didn't run the full acceptance test suite to make sure I'm not introducing a regression, since I can't really justify the 💰 cost to my company. I assume your guys' CI servers will do that and report any failures, right? :)

grubernaut · 2017-08-23T13:25:17Z

@apetresc, yup, not a problem! Pulling down your branch now to run through the test suite :)

grubernaut · 2017-08-23T16:07:40Z

$ make testacc TEST=./aws TESTARGS="-run=TestAccAWSEMRCluster"                                
==> Checking that code complies with gofmt requirements...                                    
TF_ACC=1 go test ./aws -v -run=TestAccAWSEMRCluster -timeout 120m                             
=== RUN   TestAccAWSEMRCluster_basic           
--- PASS: TestAccAWSEMRCluster_basic (597.51s) 
=== RUN   TestAccAWSEMRCluster_instance_group  
--- PASS: TestAccAWSEMRCluster_instance_group (643.15s)                                       
=== RUN   TestAccAWSEMRCluster_security_config 
--- PASS: TestAccAWSEMRCluster_security_config (617.01s)                                      
=== RUN   TestAccAWSEMRCluster_bootstrap_ordering                                             
--- PASS: TestAccAWSEMRCluster_bootstrap_ordering (612.38s)                                   
=== RUN   TestAccAWSEMRCluster_terminationProtected                                           
--- PASS: TestAccAWSEMRCluster_terminationProtected (600.88s)                                 
=== RUN   TestAccAWSEMRCluster_visibleToAllUsers                                              
--- PASS: TestAccAWSEMRCluster_visibleToAllUsers (528.26s)                                    
=== RUN   TestAccAWSEMRCluster_s3Logging       
--- PASS: TestAccAWSEMRCluster_s3Logging (1174.00s)                                           
=== RUN   TestAccAWSEMRCluster_tags            
--- PASS: TestAccAWSEMRCluster_tags (645.54s)  
PASS                                           
ok      github.com/terraform-providers/terraform-provider-aws/aws       5418.745s

grubernaut

Awesome work here! Just needs documentation and it's ready to go!

apetresc · 2017-08-23T16:57:20Z

Great. I guess this just means documenting the new fields in website/docs/r/emr_cluster.html.md, right? I can put that together very quickly.

grubernaut · 2017-08-23T17:00:17Z

@apetresc yup exactly!

apetresc · 2017-08-23T18:38:02Z

@grubernaut Done!

grubernaut

LGTM, thanks!

grubernaut · 2017-08-24T16:41:01Z

@apetresc should be released with the next release of the AWS provider, which is v1.0.0, thanks!

dthauvin · 2017-09-07T10:11:48Z

Hi @apetresc
I looked your source code of that PR and i did not find anything about AutoScalingPolicy in CORE instanceGroup Or TASK instanceGroup.
Does you PR's support instanceGroup AutoScalingPolicy ?
Implement ScalingPolicy for EMR instance group can fix #1147 and #713 .

What about aws_emr_instance_group Terraform ressource ? It's now deprecated ?

Thanks for your contribution !

Enable EMR cluster configuration through the InstanceGroup param

vishnuravi3186 · 2017-09-26T13:59:51Z

Is there a way we can specify the root volume size for both Master and core instances ? Thanks for all the contribution , ebs_config is working perfectly.

apetresc · 2017-09-26T14:35:31Z

@vishnuravi3186 Yes, just create multiple instance_group entries, one for instance_role = CORE and one for instance_role = MASTER, each with their own ebs_config block. See fossker's comment for an example.

vishnuravi3186 · 2017-09-26T14:52:19Z

@apetresc Thanks for the quick reply , yep I implemented in the same way , the ebs volume attached to each node is fine with the size what i mentioned , My question is about the root volume which is by default set as 10gb , Is there a way we can increase , In Amazon UI we can mention root volume size upto 100gb , I saw somewhere a feature has been added to implement ebs root volume size.

Example:

ebs_root_volume_size = 100

Let me know if this can be enabled.

apetresc · 2017-09-26T16:18:43Z

Ooh sorry, you're right, I misunderstood.

Let me take a look at the API docs and see how easy this would be to implement...

vishnuravi3186 · 2017-09-26T16:33:11Z

@apetresc For your reference e6166eb

vishnuravi3186 · 2017-10-05T13:39:28Z

@apetresc Any updates on this feature ?

ghost · 2020-04-11T17:12:02Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks!

apetresc added 2 commits July 6, 2017 11:25

Pass bid_price to the InstanceGroup

f60b334

apetresc added a commit to apetresc/terraform that referenced this pull request Jul 10, 2017

Apply hashicorp/terraform-provider-aws#1071

688402f

Read back InstanceGroup config in resourceAwsEMRClusterRead

835a695

radeksimko added the enhancement Requests to existing resources that expand the functionality or scope. label Jul 20, 2017

pomadchin mentioned this pull request Aug 1, 2017

Add Terraform EMR deployment locationtech/geotrellis#2303

Merged

2 tasks

apetresc added 2 commits August 21, 2017 14:37

Merge remote-tracking branch 'origin/master' into emr-improvements

236c6af

Adding acceptance test for instance_group changes to aws_emr_cluster

500644a

grubernaut approved these changes Aug 23, 2017

View reviewed changes

Adding docs for the new instance_group attribute of aws_emr_cluster

490f5d5

grubernaut approved these changes Aug 24, 2017

View reviewed changes

grubernaut merged commit 930a682 into hashicorp:master Aug 24, 2017

apetresc deleted the emr-improvements branch August 24, 2017 17:30

ghost mentioned this pull request Sep 20, 2017

aws_emr_cluster instance_group not recognized #1716

Closed

nbaztec pushed a commit to nbaztec/terraform-provider-aws that referenced this pull request Sep 26, 2017

Merge pull request hashicorp#1071 from apetresc/emr-improvements

1128fa8

Enable EMR cluster configuration through the InstanceGroup param

This was referenced Nov 21, 2017

EMR refactor resource aws_emr_cluster and aws_emr_instance_group #1147

Closed

Enable EMR cluster configuration through the Instance Fleet param #2391

Closed

ghost locked and limited conversation to collaborators Apr 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable EMR cluster configuration through the InstanceGroup param #1071

Enable EMR cluster configuration through the InstanceGroup param #1071

apetresc commented Jul 6, 2017

jennyfountain commented Jul 21, 2017

synhershko commented Jul 26, 2017

fosskers commented Jul 26, 2017

fosskers commented Jul 26, 2017

jennyfountain commented Jul 26, 2017

fosskers commented Jul 26, 2017 •

edited

Loading

fosskers commented Jul 26, 2017

apetresc commented Jul 27, 2017

grubernaut commented Jul 28, 2017

apetresc commented Jul 28, 2017

apetresc commented Aug 22, 2017

apetresc commented Aug 22, 2017

grubernaut commented Aug 23, 2017

grubernaut commented Aug 23, 2017

grubernaut left a comment

apetresc commented Aug 23, 2017

grubernaut commented Aug 23, 2017

apetresc commented Aug 23, 2017

grubernaut left a comment

grubernaut commented Aug 24, 2017

dthauvin commented Sep 7, 2017 •

edited

Loading

vishnuravi3186 commented Sep 26, 2017 •

edited

Loading

apetresc commented Sep 26, 2017

vishnuravi3186 commented Sep 26, 2017 •

edited

Loading

apetresc commented Sep 26, 2017

vishnuravi3186 commented Sep 26, 2017

vishnuravi3186 commented Oct 5, 2017

ghost commented Apr 11, 2020

Enable EMR cluster configuration through the InstanceGroup param #1071

Enable EMR cluster configuration through the InstanceGroup param #1071

Conversation

apetresc commented Jul 6, 2017

jennyfountain commented Jul 21, 2017

synhershko commented Jul 26, 2017

fosskers commented Jul 26, 2017

fosskers commented Jul 26, 2017

jennyfountain commented Jul 26, 2017

fosskers commented Jul 26, 2017 • edited Loading

fosskers commented Jul 26, 2017

apetresc commented Jul 27, 2017

grubernaut commented Jul 28, 2017

apetresc commented Jul 28, 2017

apetresc commented Aug 22, 2017

apetresc commented Aug 22, 2017

grubernaut commented Aug 23, 2017

grubernaut commented Aug 23, 2017

grubernaut left a comment

Choose a reason for hiding this comment

apetresc commented Aug 23, 2017

grubernaut commented Aug 23, 2017

apetresc commented Aug 23, 2017

grubernaut left a comment

Choose a reason for hiding this comment

grubernaut commented Aug 24, 2017

dthauvin commented Sep 7, 2017 • edited Loading

vishnuravi3186 commented Sep 26, 2017 • edited Loading

apetresc commented Sep 26, 2017

vishnuravi3186 commented Sep 26, 2017 • edited Loading

apetresc commented Sep 26, 2017

vishnuravi3186 commented Sep 26, 2017

vishnuravi3186 commented Oct 5, 2017

ghost commented Apr 11, 2020

fosskers commented Jul 26, 2017 •

edited

Loading

dthauvin commented Sep 7, 2017 •

edited

Loading

vishnuravi3186 commented Sep 26, 2017 •

edited

Loading

vishnuravi3186 commented Sep 26, 2017 •

edited

Loading