Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(aws-fsx): L2 construct for FSx for Lustre #6653

Merged
merged 10 commits into from
May 4, 2020
149 changes: 148 additions & 1 deletion packages/@aws-cdk/aws-fsx/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,8 +13,155 @@
---
<!--END STABILITY BANNER-->

This module is part of the [AWS Cloud Development Kit](https://github.com/aws/aws-cdk) project.
[Amazon FSx](https://docs.aws.amazon.com/fsx/?id=docs_gateway) provides fully managed third-party file systems with the
native compatibility and feature sets for workloads such as Microsoft Windows–based storage, high-performance computing,
machine learning, and electronic design automation.

Amazon FSx supports two file system types: [Lustre](https://docs.aws.amazon.com/fsx/latest/LustreGuide/index.html) and
[Windows](https://docs.aws.amazon.com/fsx/latest/WindowsGuide/index.html) File Server.

## FSx for Lustre

Amazon FSx for Lustre makes it easy and cost-effective to launch and run the popular, high-performance Lustre file
system. You use Lustre for workloads where speed matters, such as machine learning, high performance computing (HPC),
video processing, and financial modeling.

The open-source Lustre file system is designed for applications that require fast storage—where you want your storage
to keep up with your compute. Lustre was built to solve the problem of quickly and cheaply processing the world's
ever-growing datasets. It's a widely used file system designed for the fastest computers in the world. It provides
submillisecond latencies, up to hundreds of GBps of throughput, and up to millions of IOPS. For more information on
Lustre, see the [Lustre website](http://lustre.org/).

As a fully managed service, Amazon FSx makes it easier for you to use Lustre for workloads where storage speed matters.
Amazon FSx for Lustre eliminates the traditional complexity of setting up and managing Lustre file systems, enabling
you to spin up and run a battle-tested high-performance file system in minutes. It also provides multiple deployment
options so you can optimize cost for your needs.

Amazon FSx for Lustre is POSIX-compliant, so you can use your current Linux-based applications without having to make
any changes. Amazon FSx for Lustre provides a native file system interface and works as any file system does with your
Linux operating system. It also provides read-after-write consistency and supports file locking.

### Installation

Import to your project:

```ts
import * as fsx from '@aws-cdk/aws-fsx';
```

### Basic Usage

Setup required properties and create:

```ts
const stack = new Stack(app, 'Stack');
const vpc = new Vpc(stack, 'VPC');

const fileSystem = new LustreFileSystem(stack, 'FsxLustreFileSystem', {
lustreConfiguration: { deploymentType: LustreDeploymentType.SCRATCH_2 },
storageCapacityGiB: 1200,
vpc,
vpcSubnet: vpc.privateSubnets[0]});
rix0rrr marked this conversation as resolved.
Show resolved Hide resolved
```

### Connecting

To control who can access the file system, use the `.connections` attribute. FSx has a fixed default port, so you don't
need to specify the port. This example allows an EC2 instance to connect to a file system:

```ts
fileSystem.connections.allowDefaultPortFrom(instance);
```

### Mounting

The LustreFileSystem Construct exposes both the DNS name of the file system as well as its mount name, which can be
used to mount the file system on an EC2 instance. The following example shows how to bring up a file system and EC2
instance, and then use User Data to mount the file system on the instance at start-up:

```ts
const app = new App();
const stack = new Stack(app, 'AwsCdkFsxLustre');
const vpc = new Vpc(stack, 'VPC');

const lustreConfiguration = {
deploymentType: LustreDeploymentType.SCRATCH_2,
};
const fs = new LustreFileSystem(stack, 'FsxLustreFileSystem', {
lustreConfiguration,
storageCapacityGiB: 1200,
vpc,
vpcSubnet: vpc.privateSubnets[0]});

const inst = new Instance(stack, 'inst', {
instanceType: InstanceType.of(InstanceClass.T2, InstanceSize.LARGE),
machineImage: new AmazonLinuxImage({
generation: AmazonLinuxGeneration.AMAZON_LINUX_2,
}),
vpc,
vpcSubnets: {
subnetType: SubnetType.PUBLIC,
},
});
fs.connections.allowDefaultPortFrom(inst);

// Need to give the instance access to read information about FSx to determine the file system's mount name.
inst.role.addManagedPolicy(ManagedPolicy.fromAwsManagedPolicyName('AmazonFSxReadOnlyAccess'));

const mountPath = '/mnt/fsx';
const dnsName = fs.dnsName;
const mountName = fs.mountName;

inst.userData.addCommands(
'set -eux',
'yum update -y',
'amazon-linux-extras install -y lustre2.10',

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The lustre version will change over time. Maybe worth thinking how you would want to version control you cdk to meet with the lustre version you need.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is example code in a README, not managed on behalf of the user.

Looks like the current iteration of this is punting the version management to the user.

Is there a way the latest version could be automatically retrieved, which we can recommend or make available to users instead?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried looking into it a little more but couldn't find any documentation about why 2.10 is being used rather than a more recent version. There's also no way to tie it to the latest version, unless amazon-linux-extras ins't used. This will at least always pick up the most recent release of 2.10.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I got the following information about why AL2 is distributing Lustre 2.10:

FSx for Lustre currently serves only Lustre 2.10 and we only recommend Lustre 2.10 clients for our customers. The Lustre 2.10 kernel modules are baked directly into the AL2 kernel, which is why we currently only vend Lustre 2.10 in the amazon-linux-extras repo (only the Lustre 2.10 userspace tools are included in the Lustre2.10 package in the repo).
I think FSx for Lustre may be unique here. Using a Lustre 2.12 client, for example, may give you trouble when using FSx for Lustre backed by a 2.10 server.

// Set up the directory to mount the file system to and change the owner to the AL2 default ec2-user.
`mkdir -p ${mountPath}`,
`chmod 777 ${mountPath}`,
`chown ec2-user:ec2-user ${mountPath}`,
// Set the file system up to mount automatically on start up and mount it.
`echo "${dnsName}@tcp:/${mountName} ${mountPath} lustre defaults,noatime,flock,_netdev 0 0" >> /etc/fstab`,
'mount -a');
```

### Importing

An FSx for Lustre file system can be imported with `fromLustreFileSystemAttributes(stack, id, attributes)`. The
following example lays out how you could import the SecurityGroup a file system belongs to, use that to import the file
system, and then also import the VPC the file system is in and add an EC2 instance to it, giving it access to the file
system.

```ts
const app = new App();
const stack = new Stack(app, 'AwsCdkFsxLustreImport');

const sg = SecurityGroup.fromSecurityGroupId(stack, 'FsxSecurityGroup', '{SECURITY-GROUP-ID}');
const fs = LustreFileSystem.fromLustreFileSystemAttributes(stack, 'FsxLustreFileSystem', {
dnsName: '{FILE-SYSTEM-DNS-NAME}'
fileSystemId: '{FILE-SYSTEM-ID}',
securityGroup: sg
});

const vpc = Vpc.fromVpcAttributes(stack, 'Vpc', {
availabilityZones: ['us-west-2a', 'us-west-2b'],
publicSubnetIds: ['{US-WEST-2A-SUBNET-ID}', '{US-WEST-2B-SUBNET-ID}'],
vpcId: '{VPC-ID}'
});
const inst = new Instance(stack, 'inst', {
instanceType: InstanceType.of(InstanceClass.T2, InstanceSize.LARGE),
machineImage: new AmazonLinuxImage({
generation: AmazonLinuxGeneration.AMAZON_LINUX_2
}),
vpc,
vpcSubnets: {
subnetType: SubnetType.PUBLIC,
}
});
fs.connections.allowDefaultPortFrom(inst);
```

## FSx for Windows File Server

The L2 construct for the FSx for Windows File Server has not yet been implemented. To instantiate an FSx for Windows
file system, the L1 constructs can be used as defined by CloudFormation.
98 changes: 98 additions & 0 deletions packages/@aws-cdk/aws-fsx/lib/file-system.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,98 @@
import {Connections, IConnectable, ISecurityGroup, IVpc} from '@aws-cdk/aws-ec2';
import {IKey} from '@aws-cdk/aws-kms';
import {Resource} from '@aws-cdk/core';

/**
* Interface to implement FSx File Systems.
*/
export interface IFileSystem extends IConnectable {
/**
* The ID of the file system, assigned by Amazon FSx.
* @attribute
*/
readonly fileSystemId: string;
}

/**
* Properties for the FSx file system
*
* @see https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/aws-resource-fsx-filesystem.html
*/
export interface FileSystemProps {
/**
* The VPC to launch the file system in.
*/
readonly vpc: IVpc;

/**
* The ID of the backup. Specifies the backup to use if you're creating a file system from an existing backup.
*
* @default - no backup will be used.
*/
readonly backupId?: string;

/**
* The KMS key used for encryption to protect your data at rest.
*
* @default - the aws/fsx default KMS key for the AWS account being deployed into.
*/
readonly kmsKey?: IKey;

/**
* Security Group to assign to this file system.
*
* @default - creates new security group which allows all outbound traffic.
*/
readonly securityGroup?: ISecurityGroup;

/**
* The storage capacity of the file system being created.
* For Windows file systems, valid values are 32 GiB to 65,536 GiB.
* For SCRATCH_1 deployment types, valid values are 1,200, 2,400, 3,600, then continuing in increments of 3,600 GiB.
* For SCRATCH_2 and PERSISTENT_1 types, valid values are 1,200, 2,400, then continuing in increments of 2,400 GiB.
*/
readonly storageCapacityGiB: number;
}

/**
* A new or imported FSx file system.
*/
export abstract class FileSystemBase extends Resource implements IFileSystem {
/**
* The security groups/rules used to allow network connections to the file system.
* @attribute
*/
public abstract readonly connections: Connections;

/**
* The DNS name assigned to this file system.
* @attribute
*/
public abstract readonly dnsName: string;

/**
* The ID of the file system, assigned by Amazon FSx.
* @attribute
*/
public abstract readonly fileSystemId: string;
}

/**
* Properties that describe an existing FSx file system.
*/
export interface FileSystemAttributes {
/**
* The DNS name assigned to this file system.
*/
readonly dnsName: string;

/**
* The ID of the file system, assigned by Amazon FSx.
*/
readonly fileSystemId: string;

/**
* The security group of the file system.
*/
readonly securityGroup: ISecurityGroup;
}
4 changes: 3 additions & 1 deletion packages/@aws-cdk/aws-fsx/lib/index.ts
Original file line number Diff line number Diff line change
@@ -1,2 +1,4 @@
// AWS::FSx CloudFormation Resources:
export * from './file-system';
export * from './fsx.generated';
export * from './lustre-file-system';
export * from './maintenance-time';
Loading