Skip to content

Commit

Permalink
Merge branch 'main' into sql-5
Browse files Browse the repository at this point in the history
  • Loading branch information
gildedgardenia committed May 21, 2024
2 parents 1554783 + e0b308a commit 3a9522e
Show file tree
Hide file tree
Showing 51 changed files with 948 additions and 33 deletions.
6 changes: 3 additions & 3 deletions content/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,9 +8,9 @@ draft = false

This course is an introduction to Data Analysis concepts and tools, such as Google Sheets, Python, SQL, and Tableau. Data Analysis is a multi-faceted field that includes working with business leaders, writing code, and digging through data to find solutions to issues a business is facing. As more and more companies turn to data to assist in making decisions, Data Analysis has grown exponentially. Not only is this a field that many companies need, but the analytical and learning skills you pick up in this course will serve you as you navigate today's technical landscape.

We use Google Sheets to master spreadsheets as it is a tool that is available to all our students and it is robust enough for us to dive into complex datasets and start to seek out answers to important questions. To further dive into the data, we will be using Python. Python is a programming language that is used by many data analysts worldwide to perform complex functions and navigate very large datasets. Python also has other uses beyond data analystics so after learning the basics of programming and Python, you will find that you can adapt that skillset to many other stages of your learning journey. SQL is a different type of programming language that helps us navigate databases. We will learn more about what a database is later, but for now, you should know that a database holds large quantities of data. Companies store their data different ways, but by diving into the basics with SQL, we hope to give you the foundation you need to confidently explore a company's data storage and get the numbers you require. Finally, we will be using Tableau to put together visuals and graphics to help us communicate our findings to our colleagues. Tableau is one of the industry's most powerful visualization tools and we will use this tool to foster conversation about what makes a visual effective and how we can communicate our findings to the appropriate team.
We use Google Sheets to master spreadsheets as it is a tool that is available to all our students and it is robust enough for us to dive into complex datasets and start to seek out answers to important questions. To further dive into the data, we will be using Python. Python is a programming language that is used by many data analysts worldwide to perform complex functions and navigate very large datasets. Python also has other uses beyond data analytics so after learning the basics of programming and Python, you will find that you can adapt that skillset to many other stages of your learning journey. SQL is a different type of programming language that helps us navigate databases. We will learn more about what a database is later, but for now, you should know that a database holds large quantities of data. Companies store their data different ways, but by diving into the basics with SQL, we hope to give you the foundation you need to confidently explore a company's data storage and get the numbers you require. Finally, we will be using Tableau to put together visuals and graphics to help us communicate our findings to our colleagues. Tableau is one of the industry's most powerful visualization tools and we will use this tool to foster conversation about what makes a visual effective and how we can communicate our findings to the appropriate team.

As you work through this course, we encourage you to remember that learning data analytics is not only valuable and challenging, it can also be fun! Every moment inspires us to keep going forward and to learn new things. You may find some concepts difficult to understand at first, but these will also be the skills you may take the most pride in mastering making the journey long and winding and rewarding.
As you work through this course, we encourage you to remember that learning data analytics is not only valuable and challenging, it can also be fun! Every moment inspires us to keep going forward and to learn new things. You may find some concepts difficult to understand at first, but these will also be the skills you may take the most pride in mastering, making the journey long and winding and rewarding.

From the moment you started reading this book, you became a data analyst. We hope you enjoy your journey with us!

Expand All @@ -31,6 +31,6 @@ There are no academic pre-requisites for this course.
In order to participate, you will need to bring your own laptop. A Chromebook or tablet will not be sufficient. Your laptop should meet the following requirements:

1. Be younger than 4 years old and able to support the latest operating system.
1. You have administrator privileges and am able to install new software.
1. You have administrator privileges and are able to install new software.
1. Has an i5 or i7 Intel chip or an M1 or newer Apple processor.
1. Has at least 8 GB of RAM.
8 changes: 4 additions & 4 deletions content/assignments/assignment0/hello-spreadsheets/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,11 +9,11 @@ Once you have the Google Sheet open in your browser window, have a look around t

## Car Info Tab

Below you will find a screenshot of what the `Car Info` spreadsheet currently looks like:
Below, you will find a screenshot of what the `Car Info` spreadsheet currently looks like:

![Image of car info spreadsheet tab within the Hello, Google Sheets! google sheet](pictures/car-info-spreadsheet.png?classes=border)

As mentioned in the [Project Introduction]({{% relref "../project-introduction/_index.md#car-info" %}}) section The `Car Info` tab contains a monthly and yearly estimate for a total of four cars. You can see in cell K11 the insurance cost for the 2023 year was 1400. However insurace premiums have been raised by `12%`. Complete the following steps in order to do so:
As mentioned in the [Project Introduction]({{% relref "../project-introduction/_index.md#car-info" %}}) section, the `Car Info` tab contains a monthly and yearly estimate for a total of four cars. In cell K11, you will see that the insurance cost for the 2023 year was $1400. However, insurace premiums have been raised by 12%. Complete the following steps in order to update the insurance costs to the new 2024 price:

1. Within cell J12, type in "Insurance Rate Increase".
1. In cell K12, type in 1.12 to represent the 12% rate increase.
Expand All @@ -27,7 +27,7 @@ Below you will find a screenshot of what the `Grocery Info` spreadsheet currentl

![Image of the grocery info spreadsheet tab within the Hello, Google Sheets! google sheet](pictures/grocery-info-spreadsheet.png?classes=border)

As mentioned in the [Project Introduction]({{% relref "../project-introduction/_index.md#grocery-info" %}}) section The `Grocery Info` tab contains categories for groceries purchased at the store, including a weekly cost breakdown for each category. You will notice in cells J12 and K12 the 2023 inflation rates are referenced. In this scenario, inflation had risen by 5% for the 2023 year. However, in 2024 inflation is now at 8%. You will need to adjust the monthly grocery estimate so that it matches the new rates of inflation.
As mentioned in the [Project Introduction]({{% relref "../project-introduction/_index.md#grocery-info" %}}) section, the `Grocery Info` tab contains categories for groceries purchased at the store, including a weekly cost breakdown for each category. You will notice in cells J12 and K12 the 2023 inflation rates are referenced. In this scenario, inflation had risen by 5% for the 2023 year. However, in 2024 inflation is now at 8%. You will need to adjust the monthly grocery estimate so that it matches the new rates of inflation.

1. Within cell J9, type in "2024 Inflation Rates"
1. In cell K9, type in "0.08" to represent the 8% inflation rate.
Expand All @@ -43,7 +43,7 @@ Below is an image of the `Overview` spreadsheet.

![Image of the overview spreadsheet tab within the Hello, Google Sheets! google sheet](pictures/overview-spreadsheet.png?classes=border)

Now that the data within the `car-info` tab and the `grocery-info` tab has been updated we can calculate the monthly and yearly estimates in the `overview` tab.
Now that the data within the `Car Info` tab and the `Grocery Info` tab has been updated we can calculate the monthly and yearly estimates in the `Overview` tab.

1. Within cell B2, reference the Total Monthly Estimate from the `Car Info` spreadsheet using the appropriate forumla.
1. In cell B3, reference the Monthly Grocery Estimate from the `Grocery Info` spreadsheet using the appropriate forumla.
Expand Down
22 changes: 11 additions & 11 deletions content/assignments/assignment0/project-introduction/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,44 +5,44 @@ draft = false
weight = 1
+++

Your first assignment is mostly an exercise in following instructions. We'll ask you to modify an existing google sheet so that, when updated, will return a specific value within the given cell.
Your first assignment is mostly an exercise in following instructions. We'll ask you to modify an existing Google Sheet so that, when updated, will return a specific value within the given cell.

{{% notice orange Warning "rocket" %}}
In order to complete this assignment you will need to have created a google account! If you have not done so you can find the link below:
In order to complete this assignment you will need to have created a Google account! If you have not done so you can find the link below:

1. [Create a google account](https://support.google.com/accounts/answer/27441?hl=en)
1. [Create a Google account](https://support.google.com/accounts/answer/27441?hl=en)
{{% /notice %}}

## Requirements
1. Sign in to your associated google account.
1. Adjust data and formulas within the provided google sheet.
1. Sign in to your associated Google account.
1. Adjust data and formulas within the provided Google Sheet.
1. Submit your assignment link for grading.

### Google Account
1. If you have not already done so, [sign in to your google account](https://support.google.com/accounts/answer/27441?hl=en) before moving forward.
1. If you have not already done so, [sign in to your Google account](https://support.google.com/accounts/answer/27441?hl=en) before moving forward.

{{% notice blue Note "rocket" %}}
You can learn more about google accounts and a general overview here: [Google Account overview and features](https://account.google.com/)
You can learn more about Google accounts and a general overview here: [Google Account overview and features](https://account.google.com/)
{{% /notice %}}

## Assignment

The following google sheet link will be pre-populated with existing data. You will need to complete the following tasks:
The following Google Sheet link will be pre-populated with existing data. You will need to complete the following tasks:
1. Adjust insurance rates based on the new premiums in 2024 for car insurace.
1. Update the monthly and yearly grocery budget to account for inflation.
1. Create a dashboard of combined data from the car and grocery budgets.

Below is some additional information about the data included within the spreadsheet:

### Car Info
The `Car Info` tab contains a monthly and yearly estimate for a total of four cars with other various information related to each vehicle. However, the `Monthly Insurace Cost` will need to be updated as it is based on the previous years insurance premiums and the rates have gone up `12%`.
The `Car Info` tab contains a monthly and yearly estimate for a total of four cars with other various information related to each vehicle. However, the `Monthly Insurace Cost` will need to be updated as it is based on the previous year's insurance premiums and the rates have gone up 12%.

### Grocery Info
The `Grocery Info` tab contains categories for groceries purchased at the store, including a weekly cost breakdown for each category. The sheet also contains an average weekly spend, a monthly category spend, and a monthly total spend. The existing spreadsheet also has a 2023 Grocery budget including an inflation rate adjustment. You will need to add cell that includes the 2024 inflation rate of `8%` and adjust the existing formulas to reference the new cell.

### Overview
Lastly there is an `Overview` tab, combining data from the Car Info sheet and the Grocery Info sheet. The goal of this sheet is to provide one centralized place to view the total combined costs for Car and Grocery expenses. You will need to reference the cells containing the monthly and yearly totals for both car and grocery expenses, and display the total for both.

First, find Assignment #0 in Canvas and click on the starter google sheet link.
First, find Assignment #0 in Canvas and click on the starter Google Sheet link.

Once you have the sheet open you are ready to move on!
Once you have the sheet open, you are ready to move on!
8 changes: 4 additions & 4 deletions content/assignments/assignment0/submission/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,16 +7,16 @@ weight = 1

## Submitting your work

Once you are done with your assignment and have verified that you have the correct outcomes, navigate to the Canvas assignment and paste the link to your google sheet!
Once you are done with your assignment and have verified that you have the correct outcomes, navigate to the Canvas assignment and paste the link to your Google Sheet!

### Proper Sharing Options

When sharing a google sheet you want to ensure you provide the correct sharing permissions. For this assignment you want to only allow individuals with the link to be able to view the sheet.
When sharing a google sheet you want to ensure you provide the correct sharing permissions. For this assignment, you want to only allow individuals with the link to be able to *view* the sheet.

1. Click on the "Share" button in the top right corner.
1. Under "General Access" click it may still be "Restricted" access, meaning the only people that have been directly added can view the google sheet.
1. Under "General Access", it may still be "Restricted" access, meaning the only people that have been directly added can view the google sheet.
1. Click on the "Restricted" button and select "Anyone with the link"
1. To the right, make sure the "Viewer" option is selected. This ensures that no one can edit or make changes to your sheet, even if they have access.
1. To the right, make sure the "Viewer" option is selected. This ensures that no one can edit or make changes to your workbook, even if they have access.
1. Click the *Copy link* button and paste the link within the submission box on the assignment page within Canvas!

If you have completed the above steps, your sharing options should look similar to the image below:
Expand Down
27 changes: 27 additions & 0 deletions content/assignments/assignment4/checkpoint-2/_index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
+++
title = "Checkpoint 2: EDA"
date = 2023-05-25T12:55:09-05:00
draft = false
weight = 2
+++

## Before You Start

If you have not received any feedback from Checkpoint 1, check in with your mentors before working on Checkpoint 2.

## Getting Started

Fork [this GitHub repository](https://github.com/launchcodeeducation/eda-checkpoint) and clone it to your computer. Follow the instructions within the notebook using your chosen dataset. Add notes about your EDA where directed.

### Examples
{{% notice blue Note %}}
Checkpoint 2 examples can be found here: [Checkpoint 2 Examples](https://github.com/LaunchCodeEducation/finalProjectDAExamples/tree/main/Checkpoint%202)
{{% /notice %}}

## Submitting Your Work

When finished with your EDA, make sure to push your changes up to GitHub. Copy the link to your GitHub repository and paste it into the submission box in Canvas for Graded Assignment 4: Checkpoint 2 and click *Submit*.

[Back to Final Project Overview]({{% relref "./../" %}})


33 changes: 33 additions & 0 deletions content/assignments/assignment4/checkpoint-4/_index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
+++
title = "Checkpoint 4: Tableau Story"
date = 2023-05-25T12:55:09-05:00
draft = false
weight = 4
+++

## Before You Start

You want to first check to see if you have received any feedback from Checkpoints 2 and 3. This
feedback could influence the direction of your work on Checkpoint 4. If you want to change anything
about what you have done so far in earlier checkpoints, you do not have to re-submit any previous
checkpoints unless your IA or instructor requests you do so. You can simply add any updated work and notes to
the current checkpoint.

## Getting Started

For this checkpoint, you will need to manipulate your data and produce a Tableau story that shows off skills from class, such as filtering and table calculations. You may find yourself wanting to use pandas and Jupyter notebooks for data manipulation. If you do, make sure to add code comments explaining your thought process and push your work up to Github.

No matter what visualizations you add to your Tableau story, all of your captions should include explanations as to your thought process for each visualization. The first caption should include a link to your dataset and the final story point should include links to any supporting materials, such as the Github repository if you used a Jupyter notebook for this checkpoint.

## Examples

{{% notice blue Note "rocket" %}}
Checkpoint 4 examples can be found here: [Checkpoint 4 Examples](https://github.com/LaunchCodeEducation/finalProjectDAExamples/tree/main/Checkpoint%204).
{{% /notice %}}

## Submitting Your Work

When finished paste the link to your Tableau story into the submission box in Canvas for Graded
Assignment #4: Checkpoint 4 and click *Submit*.

[Back to Final Project Overview]({{% relref "./../" %}})
Original file line number Diff line number Diff line change
Expand Up @@ -169,7 +169,7 @@ Now that you have seen how to pull aside a single column's data let's take a loo
# import pandas
import pandas as pd

movies = {'Name': ["Interstellar", "Pride and Prejudice", "Inception", "Barbie"],'Release': [2014, 2005, 2010, 2003], 'Genre': ["Science Fiction", "Novel", "Science Fiction", "Comedy"]}
movies = {'Name': ["Interstellar", "Pride and Prejudice", "Inception", "Barbie"],'Release': [2014, 2005, 2010, 2023], 'Genre': ["Science Fiction", "Novel", "Science Fiction", "Comedy"]}

movies_dataframe = pd.DataFrame(movies)

Expand All @@ -194,4 +194,4 @@ True or False: Column names cannot be changed in a DataFrame.
True or False: A DataFrame column is a Series.

<!-- Solution: True -->
{{% /notice %}}
{{% /notice %}}
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,14 @@ The pandas library is incredibly powerful and was built specifically for data an
We will use pandas to create, manipulate, and view data structures based on certain conditions. We will also cover some of the most common functions used when exploring data with pandas that we can use to our advantage during the exploration process.

{{% notice blue Note "rocket" %}}
To install pandas, you will need to run the following command:

```python
pip install pandas
```

If the above command does not work, you may need to specify `pip3` in the command.

Once pandas is installed, it can be imported into your workspace in the following way:

```python
Expand Down
14 changes: 14 additions & 0 deletions content/how-programs-work/reading/data-analysis-projects/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
+++
title = "GitHub Repository: Data Analysis Projects"
date = 2024-01-08T10:24:35-06:00
draft = false
weight = 7
+++

Throughout this course, you will be utilizing a GitHub repository that holds starter code for exercises, studios, and in some lessons, example code from readings.

Fork the following repository and clone it to your machine: [LaunchCode Education: Data Analysis Projects](https://github.com/launchcodeeducation/data-analysis-projects)

{{% notice blue Note "rocket" %}}
You will begin using the `data-analysis-projects` repository beginning in the next chapter. Do not move on to the next page without it!
{{% /notice %}}
Loading

0 comments on commit 3a9522e

Please sign in to comment.