Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Databricks fundamentals #80

Merged
merged 29 commits into from
Sep 6, 2024
Merged

Databricks fundamentals #80

merged 29 commits into from
Sep 6, 2024

Conversation

dfe-nt
Copy link
Collaborator

@dfe-nt dfe-nt commented Aug 29, 2024

Overview of changes

A few high level pages on some fundamental concepts in DataBricks and how it is different to how we are used to working with traditional computing.

Three new pages

  • DataBricks Fundamentals: How does DataBricks differ from usual computing (distributed and decoupled compute and storage). Covers at a high level the different kinds of storage and different kinds of compute available on the DataBricks platform.

  • Notebooks: Some high level information on what notebooks are and how they are used.

  • Workflows: Some high level information on what workflows are and how they can be used.

Why are these changes being made?

DataBricks is a bit of a paradigm shift from everything that analysts within the Department are used to. These pages attempt to start from what we're used to (traditional computers) and explain the difference of distributed cloud computing. They then give a bit of information about some of the core concepts that will be new to analysts.

Detailed description of changes

A page on how DataBricks is different from using a laptop/server for analysis, and some detail on the types of storage used and the different types of compute.

A page on notebooks and how they can be used.

A page on workflows and how they can be used.

None of these are instruction guides as such (with a bit of an exception for linking a repo to a users DataBricks workspace - this may be better suited elsewhere) but just an attempt to familiarise people with the concepts.

Links to each page added to the index page.

Issue ticket number/s and link

N/a

Checklist before requesting a review

  • I have checked the contributing guidelines
  • I have checked for and linked any relevant issues that this may resolve
  • I have checked that these changes build locally
  • I understand that if merged into main, these changes will be publicly available

@jen-machin
Copy link
Contributor

ADA/databricks_fundamentals.qmd Outdated Show resolved Hide resolved
ADA/databricks_fundamentals.qmd Outdated Show resolved Hide resolved
ADA/databricks_fundamentals.qmd Show resolved Hide resolved
ADA/databricks_fundamentals.qmd Outdated Show resolved Hide resolved
ADA/databricks_fundamentals.qmd Show resolved Hide resolved
ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
ADA/databricks_workflows.qmd Show resolved Hide resolved
ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
…R. Page for doing it natively in Databricks and one for RStudio.
Copy link
Contributor

@jen-machin jen-machin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Last few comments on some of the pages I hadn't got around to reviewing yet!

ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
ADA/databricks_workflows.qmd Outdated Show resolved Hide resolved
ADA/databricks_notebooks.qmd Outdated Show resolved Hide resolved
ADA/databricks_notebooks.qmd Outdated Show resolved Hide resolved
ADA/databricks_notebooks.qmd Show resolved Hide resolved
ADA/databricks_notebooks.qmd Show resolved Hide resolved
ADA/databricks_notebooks.qmd Show resolved Hide resolved
ADA/databricks_workflow_script_databricks.qmd Show resolved Hide resolved
ADA/databricks_workflow_script_databricks.qmd Show resolved Hide resolved
ADA/databricks_workflow_script_rstudio.qmd Show resolved Hide resolved
@jen-machin jen-machin merged commit 02bed1d into main Sep 6, 2024
@jen-machin jen-machin deleted the databricks_fundamentals branch September 6, 2024 13:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants