Workshop: Maximizing Confidence in Your Data Model Changes with dbt and PipeRider

To learn how to use PipeRider together with dbt for detecting changes in model and data, sign up for a workshop

Homework

The following questions follow on from the original Week 4 homework, and so use the same data as required by those questions:

Yellow taxi data - Years 2019 and 2020 Green taxi data - Years 2019 and 2020 fhv data - Year 2019.

What is the distribution between vendor id filtering by years 2019 and 2020 data?

You will need to run PipeRider and check the report

What is the composition of total amount (positive/zero/negative) filtering by years 2019 and 2020 data?

You will need to run PipeRider and check the report

What is the numeric statistics (average/standard deviation/min/max/sum) of trip distances filtering by years 2019 and 2020 data?

You will need to run PipeRider and check the report

Form for submitting: https://forms.gle/WyLQHBu1DNwNTfqe8
You can submit your homework multiple times. In this case, only the last submission will be used.

Deadline: 20 March, 22:00 CET

We will publish the solution here