title | parent | description | thumbnail | author | tags | category | date | mrm | xredirect | slug | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Deploy a modern data lake on OCI |
|
Using Terraform to create a data lake on Oracle Cloud Infrastructure. |
assets/datalakeocichart_4481-9331b237844018b1.png |
Ali Mukadam |
|
clouddev |
2021-10-29 12:00 |
WWMK211125P00021 |
create-modern-data-lake-oci-readme |
{% imgx aligncenter assets/datalakeocichart_4481-9331b237844018b1.png "OCI Datalake Architecture Overview" %}
What is a data lake? Simply, a data lake is a place to store both your structured and unstructured data. It's also a great method for organizing large volumes of diverse data from diverse sources.
In this article, we'll guide you through deploying a data lake in OCI and quickly get you up and running so you can explore its many benefits!
For more information, see:
- Signing Up for Oracle Cloud Infrastructure
- Getting started with Terraform
- Getting started with OCI Cloud Shell
- What is a data lake?
In order to successfully complete this tutorial, you'll need:
- An Oracle Cloud Infrastructure (OCI) Free Tier account. [Start for Free]({{ site.urls.always_free }}).
- A MacOS, Linux, or Windows computer with
ssh
support installed. - Access to the OCI Cloud Shell - It provides a great platform for quickly working with Terraform as well as a host of other OCI interfaces and tools.
- The OCI Resource Manager (ORM) - This Quick Start uses the ORM to make deployment easy.
- The ORM stack - Select the button below to download the
master.zip
file:
After logging into the console you'll be taken through the same steps described in the Deploy section below.
NOTE: If you use this template to create another repo you'll need to change the link for the button to point at your repo. {:.notice}
Make sure your credentials are defined in $HOME/.oci/config
file since Terraform takes takes the default value from the .oci/config
file.
For example:
user=ocid1.user.oc1..aaaaaxxxwf3a \
fingerprint=de:50:15:13:...:d6 \
key_file=/Users/shadab/.oci/oci_api_key.pem \
tenancy=ocid1.tenancy.oc1..aaaaaaaa2txfa \
compartment=ocid1.compartment.oc1..aaaa5pti7sq \
region=us-ashburn-1
git clone https://github.com/oracle-quickstart/oci-datalake && cd oci-datalake
Initialize the Terraform provider for OCI and Random:
terraform init
terraform plan -var-file=config.tfvars -out oci_datalake.out
terraform apply "oci_datalake.out"
terraform destroy -var-file=config.tfvars
-
Import the stack - Log in to OCI to import the stack:
Home > Solutions & Platform > Resource Manager > Stacks > Create Stack
-
Upload stack - Upload the
master.zip
and provide a name and description for the stack:{% imgx aligncenter assets/datalakeoci_4a6e6eb3_bbfb_d66353a189bc.png "Create Stack Dialog" %}
-
Configure the Stack - The UI will present the variables to the user dynamically, based on their selections.