Skip to content

Latest commit

 

History

History
69 lines (36 loc) · 4.7 KB

README.md

File metadata and controls

69 lines (36 loc) · 4.7 KB

School District Analysis

Analyze data from a school district and provide insights

Overview of Analysis

Due to potential academic dishonesty, the Freshman class at Thomas High School (THS) has had its test scores voided. The purpose of this analysis will be to determine the impact of having these scores thrown out.

Results of Analysis

The results of the School District Analysis changed significantly due to the potential academic dishonesty over at THS. We'll take a deep dive into several different metrics below and make before-and-after comparisons.

  • The district summary did not change all that much. This is due to the relatively small amount of test scores that were thrown out (461) compared to the total number of student in the district (39,170).

    Before Potential Academic Dishonesty (PAD):

    district_summary_before

    After PAD:

    district_summary_after

  • The school summary was not significantly impacted by the PAD. The per school statistics did not change at all, aside from THS, who experienced marginal decreases across the board in scores and passing rates.

  • Despite the marginal decreases in each metric for THS, their overall performance relative to other schools was unaffected when measured by overall passing rate. Below are images of the school summary DataFrame, sorted by overall passing rate:

    Before PAD:

    school_summary_before

    After PAD:

    school_summary_after

  • Math and reading scores by grade were only affected for the THS freshman class. Since none of the THS freshman class' scores were counted, they do not have valid averages and the "after" DataFrame returns nan.

Math Reading
Before PAD math_by_grade_before read_by_grade_before
After PAD math_by_grade_after read_by_grade_after
  • Scores by school spending were unaffected up to the level of precision included in the DataFrame, that is, to the tenths place for Average Scores and the nearest whole number for Passing Percentages. Both the "before PAD" and "after PAD" look like this:

    scores_by_spending

  • Scores by school size were also unaffected up to the level of precision included in the DataFrame:

    scores_by_size

  • Scores by school type were also unaffected up to the level of precision included in the DataFrame:

    scores_by_type

Summary of Analysis

Despite having 461 students' scores removed from the dataset, the impact on the school district's summary was minimal. This is because the average of the 9th grade scores at THS that were thrown out were almost identical to the average of the 10th-12th grade scores at THS.

When we make comparisons at the district level (scores by spending/school size/school type), Thomas High School is considered as a part of a whole. Since THS's averages remained almost the same, the absence of the 9th grade data became inconsequential at the macro level. There were, however, a few things that changed:

  1. THS's average math score dropped by about 0.07%.
  2. THS's average reading score rose by about 0.05%.
  3. The percentage of THS students that passed math dropped by about 0.09%.
  4. The percentage of THS students that passed reading dropped by about 0.29%.
  5. The percentage of THS students that passed both math and reading dropped by about 0.32%.