PsyTeachR
diff --git a/‎04-summary.qmd
+169-170 b/‎04-summary.qmd
+169-170
diff --git a/‎09-wrangle.qmd
+2-2 b/‎09-wrangle.qmd
+2-2
diff --git a/‎app-dates.qmd
+38-2 b/‎app-dates.qmd
+38-2
diff --git a/‎data/12.1_delivery.csv
+97,078 b/‎data/12.1_delivery.csv
+97,078
diff --git a/‎data/date_formats.csv
+2 b/‎data/date_formats.csv
+2
diff --git a/‎data/rep_gho_mortality-metadata.pdf
-253 KB b/‎data/rep_gho_mortality-metadata.pdf
-253 KB
diff --git a/‎data/rep_gho_mortality.xlsx
-15.7 MB b/‎data/rep_gho_mortality.xlsx
-15.7 MB
diff --git a/‎data/weekly_ae_activity_20240303.csv
+14,190 b/‎data/weekly_ae_activity_20240303.csv
+14,190
@@ -8,7 +8,7 @@
 
 ## Walkthrough video {#sec-walkthrough-wrangle .unnumbered}
 
-There is a walkthrough video of this chapter available via [Echo360.](https://echo360.org.uk/media/dc1e2869-a6c2-45d8-ab40-cb85cdb67f43/public) Please note that there may have been minor edits to the book since the video was recorded. Where there are differences, the book should always take precedence.
+There is a walkthrough video of this chapter available via [Echo360](https://echo360.org.uk/media/dc1e2869-a6c2-45d8-ab40-cb85cdb67f43/public). Please note that there may have been minor edits to the book since the video was recorded. Where there are differences, the book should always take precedence.
 
 ## Set-up {#sec-setup-wrangle}
 
@@ -284,7 +284,7 @@ Note that `str_detect()` is case sensitive so it would not return values of "Hig
 `filter()` is incredibly powerful and can allow you to select very specific subsets of data. But, it is also quite dangerous because when you start combining multiple criteria and operators, it's very easy to accidentally specify something slightly different than what you intended. **Always check your output**. If you have a small dataset, then you can eyeball it to see if it looks right. With a larger dataset, you may wish to compute summary statistics or count the number of groups/observations in each variable to verify your filter is correct. There is no level of expertise in coding that can substitute knowing and checking your data. 
 :::
 
-### Arrange
+### Arrange #sec-arrange
 
 You can sort your dataset using `arrange()`. You will find yourself needing to sort data in R much less than you do in Excel, since you don't need to have rows next to each other in order to, for example, calculate group means. But `arrange()` can be useful when preparing data for display in tables. `arrange()` works on character data where it will sort alphabetically, as well as numeric data where the default is ascending order (smallest to largest). Reverse the order using `desc()`.
 
 
@@ -10,9 +10,46 @@ library(tidyverse)
 library(lubridate)
 ```
 
+## Formats
+
+While there is only one correct way to write date (The ISO 8601 format of "YYYY-MM-DD"), dates can be found in many formats. When you are reading a data file, you might need to specify the date format so it can be read properly. Date format specification uses abbreviations to represent the different ways people can write. the year, month, and day (as well as hours, minutes, and seconds). For example, the date `2023-01-03` is represented by the formatting string `"%Y-%m-%d`. The fastest way to find the list of formatting abbreviations is to look in the help for the function `col_date()`.
+
+```{r, filename = "Run in the console"}
+?col_date
+```
+
+
+```{r}
+# create a table with some different date formats
+date_formats <- tibble(
+  best = "2022-01-03",
+  ok = "2022 January 3",
+  bad = "January 3, 2022",
+  terrible = "Mon is 3 22 1"
+)
+
+# save it as a CSV file
+write_csv(date_formats, "data/date_formats.csv")
+
+# read it in
+df <- read_csv("data/date_formats.csv")
+```
+
+You can see that only the first column read as a date, and the rest read as characters. You can set the date format using the `col_types` argument and two helper functions, `cols()` and `col_date()`.
+
+```{r}
+ct <- cols(ok = col_date("%Y %B %d"),
+           bad = col_date("%B %d, %Y"),
+           terrible = col_date("%a is %m %y %d"))
+
+read_csv("data/date_formats.csv", 
+         col_types = ct)
+```
+
+
 ## Parsing
 
-Dates can be in many formats. The `ymd` functions can deal with almost all of them, regardless of the punctuation used in the format. All of the examples below produce a date in the standard format "2022-01-03".
+The `ymd` functions can deal with almost all date formats, regardless of the punctuation used in the format. All of the examples below produce a date in the standard format "2022-01-03".
 
 ```{r ymd, results='hide'}
 # year-month-day orders
@@ -45,7 +82,6 @@ The date/time functions can also take a timezone argument. If you don't specify
 ymd_hm("2022-01-03 18:05", tz = "GMT")
 ```
 
-
 ## Get Parts
 
 You frequently need to extract parts of a date/time for plotting. The following functions extract specific parts of a date or datetime object. This is a godsend for those of us who never have a clue what week of the year it is today.
 
@@ -0,0 +1,2 @@
+best,ok,bad,terrible
+2022-01-03,2022 January 3,"January 3, 2022",Mon is 3 22 1
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+best,ok,bad,terrible`
	`2`	`+2022-01-03,2022 January 3,"January 3, 2022",Mon is 3 22 1`