Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gov Cloud AMI #895

Closed
haviland-nh opened this issue Feb 16, 2022 · 12 comments
Closed

Gov Cloud AMI #895

haviland-nh opened this issue Feb 16, 2022 · 12 comments
Labels
cloud-aws Amazon Web Services cml-image Subcommand cml-runner Subcommand documentation Markdown files

Comments

@haviland-nh
Copy link

Hello,

Our team is looking to use CML, but can't execute the AWS provider because we're working out of gov-cloud-west-1 and the AMI isn't available. Could we help you to bring it into the region?
Appreciated,

Nearly Human Team

@haviland-nh
Copy link
Author

Or if there is a better way to do that, please let us know - happy to work with you.

@dacbd
Copy link
Contributor

dacbd commented Feb 16, 2022

It looks to be partially implemented, the option to specify an image/ami exists on the runner resource here(the underlying system)
but the option isn't exposed yet via cml?

@casperdcl casperdcl transferred this issue from iterative/terraform-provider-iterative Feb 16, 2022
@casperdcl casperdcl added cloud-aws Amazon Web Services cml-runner Subcommand documentation Markdown files labels Feb 16, 2022
@0x2b3bfa0
Copy link
Member

👋🏼 Hello! We don't have access to AWS GovCloud regions, but you can just copy our AMI to your region and CML will should detect it automatically.

aws ec2 copy-image \
  --region gov-cloud-west-1 \
  --source-region us-east-1 \
  --source-image-id ami-0a8e187584f56101e
  --name iterative-cml

@0x2b3bfa0
Copy link
Member

It looks to be partially implemented, the option to specify an image/ami exists on the runner resource here(the underlying system)
but the option isn't exposed yet via cml?

CML searches for the default image using the name, so it should be enough to copy the AMI from a public region. 🙈

@0x2b3bfa0
Copy link
Member

can't execute the AWS provider

We're supposed to fallback to the official Ubuntu 18.04 AMI if the CML one is not available. Didn't it work automatically? 🤔

@haviland-nh
Copy link
Author

haviland-nh commented Feb 17, 2022 via email

@haviland-nh
Copy link
Author

haviland-nh commented Feb 17, 2022

Hello just a few things after attempting to try these items out

  • I can't copy the AMI into the region us-gov-west-1. The aws cli outputs the error below
aws ec2 copy-image --region us-gov-west-1 --source-region us-east-1   --source-image-id ami-0a8e187584f56101e   --name iterative-cml

An error occurred (InvalidRequest) when calling the CopyImage operation: Copy image not allowed from specified region.
  • Ubuntu 18 can't fall back because the image isn't being found. I believe the owner id is different.
aws ec2 describe-images --filters "Name=name,Values=ubuntu/images/hvm-ssd/ubuntu-bionic-18.04*"
{
    "Images": [
        {
            "Architecture": "arm64",
            "CreationDate": "2021-06-04T21:39:19.000Z",
            "ImageId": "ami-001a1191bd4ab2d1a",
            "ImageLocation": "aws-marketplace/ubuntu/images/hvm-ssd/ubuntu-bionic-18.04-arm64-server-20210604-10d15682-9d0b-41dd-93c2-5e7de4f74165",
            "ImageType": "machine",
            "Public": true,
            "OwnerId": "345084742485",
            "PlatformDetails": "Linux/UNIX",
            "UsageOperation": "RunInstances",
  • Our runner is hitting the error
    Creating...\n\n\t╷\n│ Error: Failed creating the machine: Nor iterative-cml nor Ubuntu Server 18.04 are available in your region

@dacbd
Copy link
Contributor

dacbd commented Feb 17, 2022 via email

@haviland-nh
Copy link
Author

haviland-nh commented Feb 17, 2022 via email

@DavidGOrtega DavidGOrtega added the cml-image Subcommand label Feb 17, 2022
@haviland-nh
Copy link
Author

That worked, just a heads up to those who might run into this, we needed to copy without an encrypted snapshot, but we successfully ran a training after following that last idea @dacbd, thank you!

Just to tie a bow in this discussion, in our trial and error, we attempted to use our existing EKS cluster with the kubernetes provider implementation, which is a nice option and maybe even preferable longer term. The documentation on the mechanics for Kubernetes is a bit small and we were wondering where to go to ask more about this topic?
Thanks again,

Nearly Human team

@0x2b3bfa0
Copy link
Member

The documentation on the mechanics for Kubernetes is a bit small and we were wondering where to go to ask more about this topic?

Yes, documentation is still a bit scarce. 🙈 Feel free to ask in the #cml channel on our Discord server.

@dacbd
Copy link
Contributor

dacbd commented Oct 17, 2022

I feel like the only remaining action point is some documentation on using cml runner with k8s iterative/cml.dev#343

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cloud-aws Amazon Web Services cml-image Subcommand cml-runner Subcommand documentation Markdown files
Projects
None yet
Development

No branches or pull requests

5 participants