## Warning: package 'ggplot2' was built under R version 4.3.3
A collection of datasets on the Alone survival TV series in tidy format. Included in the package are 4 datasets detailed below.
For non-Rstats users here is the link to the Google sheets doc.
Install from CRAN:
install.packages("alone")
Install from Github:
devtools::install_github("doehm/alone")
A data frame of survivalists across all 9 seasons detailing name and demographics, location and profession, result, days lasted, reasons for tapping out (detailed and categorised), page URL.
Dataset features:
version
: The version country code (currently only US but getting preppred or other versions)season
: The season numberid
: Survivalist unique IDname
: Name of the survivalistfirst_name
: First name of the survivalistlast_name
: last name of the survivalistage
: Age of survivalistgender
: Gendercity
: Citystate
: Statecountry
: Countryresult
: Place the survivalist finished in the seasondays_lasted
: The number of days lasted in the game before tapping out or winningmedically_evacuated
: Logical. If the survivalist was medically evacuated from the gamereason_tapped_out
: The reason the survivalist tapped out of the game.NA
means they were the winnerreason_category
: A simplified category of the reason for tapping outteam
: The team they were associated with (only for season 4)day_linked_up
: Day the team members linked upprofession
: Profession
library(tidyverse)
df <- expand_grid(
days_lasted = 0:max(survivalists$days_lasted),
gender = unique(survivalists$gender)
) |>
left_join(
survivalists |>
count(days_lasted, gender),
by = c("days_lasted", "gender")
) |>
left_join(
survivalists |>
count(gender, name = "N"),
by = "gender"
) |>
group_by(gender) |>
mutate(
n = replace_na(n, 0),
n_lasted = N-cumsum(n),
p = n_lasted/N
)
# Kaplan-Meier survival curves
df |>
ggplot(aes(days_lasted, p, colour = gender)) +
geom_line()
# boxplots
survivalists |>
ggplot(aes(days_lasted, fill = gender)) +
geom_boxplot(alpha = 0.5) +
geom_jitter(width = 0.2, pch = 1, size = 3) +
theme_minimal()
Each survivalist is allowed to take with them 10 items. This dataset includes information on each survivalist’s loadout. It has detailed item descriptions and a simplified version for easier aggregation and analysis.
version
: Country code for the version of the showseason
: The season numberid
: Survivalist unique IDname
: Name of the survivalistitem_number
: Item numberitem_detailed
: Detailed loadout item descriptionitem
: Loadout item. Simplified for aggregation
library(forcats)
loadouts |>
count(item) |>
mutate(item = fct_reorder(item, n, max)) |>
ggplot(aes(item, n)) +
geom_col() +
geom_text(aes(item, n + 3, label = n), family = ft, size = 12, colour = txt) +
coord_flip()
This dataset contains details of each episode including the title, number of viewers, beginning quote and IMDb rating. New episodes will be added at the end of future seasons.
version
: Country code for the version of the showseason
: The season numberepisode_number_overall
: Episode number across seasonsepisode
: Episode numbertitle
: Episode titleair_date
: Date the episode originally airedviewers
: Number of viewers in the US (millions)quote
: The beginning quoteauthor
: Author of the beginning quoteimdb_rating
: IMDb rating of the episoden_ratings
: Number of ratings given for the episode
The season summary dataset includes location, latitude and longitude, and other season level information. It includes the date of drop off where the information exists.
version
: Country code for the version of the showseason
: The season numberlocation
: Locationcountry
: Countryn_survivors
: Number of survivalists. Season 4 there were 7 teams of 2.lat
: Latitudelon
: Longitudedate_drop_off
: Date the survivalists were dropped off
If there is any data you would like to include please get in touch.
- History: https://www.history.com/shows/alone/cast
- Wikipedia: https://en.wikipedia.org/wiki/Alone_(TV_series)
- Wikipedia (episodes): https://en.wikipedia.org/wiki/List_of_Alone_episodes#Season_1_(2015)_-_Vancouver_Island