Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: docs for dwh cost estimates and savings #3554

Open
wants to merge 7 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
58 changes: 58 additions & 0 deletions docs/cloud/features/costs_savings.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
# Data Warehouse Costs and Savings with Tobiko

Cloud data warehouses can be expensive, and finance departments are paying attention. When they come asking about your cloud budget, the best preparation is a granular understanding of how your project's data pipelines map to those costs.

Cloud warehouse costs are determined by how much computation occurs. Clouds report computation/cost for each executed query, but it is challenging to align every query with a specific model in your project.

Tobiko Cloud solves this problem for you. It tracks data warehouse cost estimates per model for BigQuery and Snowflake projects that use a supported pricing plan.

This granular cost information allows you to directly explain/justify your cloud warehouse spend. It also uncovers the models that could most benefit from efficiency improvements.

Beyond tracking your warehouse costs, Tobiko Cloud estimates how much money it saved you! Its advanced column-level impact analysis saves you even more money than open-source SQLMesh.

## Supported Data Warehouse Pricing Plans

Tobiko Cloud supports these cloud pricing plans:

- BigQuery [On Demand](https://cloud.google.com/bigquery/pricing#on_demand_pricing)
- Snowflake [Credits](https://docs.snowflake.com/en/user-guide/cost-understanding-compute#label-what-are-credits)

## Data Warehouse Cost Configuration

Configure Tobiko Cloud to show cost information by navigating to the Settings page.

![Image highlighting location of the Settings link in the left site navigation](./costs_savings/costs-navigation.png)

On the General settings page (1), select your pricing plan (2), enter your costs, and then save (3).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What are default pricing plans? Include configs in writing or screenshots for those

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I need a little clarification on this one - are you asking what the default values are for the pricing plans, or just what the form looks like? I could update the costs-steps.png file to include the full form with either bigquery or snowflake selected that would show the "cost per TB" or "cost per credit" field as well, or is there something else you are looking for?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"full form with either bigquery or snowflake selected that would show the "cost per TB" or "cost per credit" field as well"

Can you include screenshots of both and explicitly write out the defaults? Ex: BigQuery charges $6.25 per TB processed

image

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this along the lines of what you were thinking?

image

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I worry the code and documentation will get out of sync. It will be hard to remember the docs need to be updated if the defaults change.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, these are the screenshots I was looking for. And @crericha fair point, so @caiters can you add a note along the lines of "These are the current cost defaults at the time of setup and update in Tobiko Cloud as warehouse cost defaults change"


![Annotated image showing locations of the general settings link, pricing plan form fields, and save button](./costs_savings/costs-steps.png)

For the BigQuery On Demand plan, we supply BigQuery's default cost per terabyte processed of $6.25:
![Costs form for BigQuery On-Demand](./costs_savings/costs-bigquery-on-demand.png)

For the Snowflake Credits plan, we supply Snowflake's default cost per credit of $3.00:
![Costs form for Snowflake Credits](./costs_savings/costs-snowflake-credits.png)

If your organization has negotiated a different cost, please enter that cost into the "Cost per" field instead so your organization's data warehouse cost estimates will be more accurate.

## Where to find cost and savings estimates information

Estimated costs and savings are displayed on the Tobiko Cloud homepage, production environment page, runs and plans pages, and individual model pages.

Cost information on each page will look similar to:

![Example of costs and savings data as seen on the Tobiko Cloud homepage](./costs_savings/costs-example.png)

## Savings Categories

When calculating your data warehouse costs, Tobiko Cloud also estimates how much money it saved you!

Cost savings are broken up into three main categories:
caiters marked this conversation as resolved.
Show resolved Hide resolved

- **Prevented Reruns**: If SQLMesh has already run an execution for a change in one environment, we won't need to rerun it in another environment (such as when a model change is backfilled on a development environment and then applied to prod).
- **Unaffected Downstream**: Because SQLMesh understands SQL, we know if a downstream model is not affected by an upstream change and can avoid re-execution.
- **Virtual Environments**: Using the Virtual data environments feature means new environments can be created without backfilling models within the new environment.

### Where to find cost savings information

Cost savings are shown in most of the places data warehouse costs are displayed. You can find how much Tobiko Cloud has saved you by viewing the homepage, production environment page, or individual model pages.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/cloud/features/costs_savings/costs-steps.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
3 changes: 2 additions & 1 deletion mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -106,9 +106,10 @@ nav:
- Cloud Features:
# - "Alerts & Notifications": cloud/features/alerts_notifications.md
- cloud/features/debugger_view.md
# - cloud/features/data_catalog.md
- cloud/features/data_catalog.md
# - cloud/features/model_freshness.md
# - cloud/features/scheduler.md
- "Cost & Savings Estimates": cloud/features/costs_savings.md
# - Observability:
# - cloud/observability/monitoring.md
# - "Measures & Dashboards": cloud/observability/measures_dashboards.md
Expand Down
Loading