This document explores the claim that universities are increasingly
hiring more part-time faculty and less full-time faculty. It starts with
a rather poorly designed plot, which shows the trend but only at the
cost of some considerable effort on the part of the reader, and improves
the visualization, which shows the trend more clearly.
Analysing the Original Visualization
The American Association of University Professors (AAUP), a nonprofit
membership association of faculty and other academic professionals,
created this
report. The report discusses trends in instructional staff employees
between 1975 and 2011, and contains a visualization very similar to this
one:
🚧 Analyze this plot: what type of plot is it? What variables are
mapped to what aesthetics? What is easy to see, what is hard to see?
Designing Alternate Visualizations
🚧 Sketch on paper two alternative designs for this plot that might
do a better job of illustrating the hiring levels different types of
faculty over time. Describe the alternatives in the same way as you just
did above (type, aesthetic mappings) and explain how they might be
improvements of the original.
Rebuilding the Visualization
We now update the visualization incrementally, starting with an
approximation of the original.
1. Reproduce the original plot.
🚧 Remake the original plot, starting with this
dataset and trying to make it look as follows.
Notes:
- The dataset will need to be pivoted, so that you have columns for
faculty_type, year, and
percentage.
- Use a column chart with dodged bars
(
geom_col(position = "dodge")).
- Use the Viridis color scale
(
scale_fill_viridis_d())
- Include labels (
labs()) for: x,
y, fill, title,
caption
2. Convert to a stacked bar plot.
We’d like to more directly compare the hiring levels of the different
faculty types.
🚧 Update the plot to look like this.
Notes:
- See if you can do this without repeating the
pivot_longer.
- We used
theme_minimal().
3. Convert to a line plot.
Line plots tend to be good for view values over time.
🚧 Update the plot to look like this.
Notes:
- Note the use of a redundant encoding (
shape). I had to
add a geom_point layer to draw those.
- Note that, at first, the lines don’t actually show up. You can hack
you way to make it work by adding
group = year to the
aesthetic mappings; a Google search on the error message you get will
probably get you there. But that yields the plot shown above. If you
look carefully, you’ll see something badly wrong with it.
- Once you notice what’s wrong with the line plot, then fix things by
making sure that the year is treated
as.numeric. (And once
you do that, you don’t need the group = year hack anymore
anyway.)
- If you made changes to earlier code, make sure your earlier plots
still work. (You may need to replace
year with
factor(year) in those plots, depending on how you did
it.)
4. Just show the numbers.
Sometimes, simply displaying numbers is as effective as visualizing
them. Let’s make the table below, which shows the changes in
hiring levels, i.e., the 2011 percentage minus the 1975 percentage.
🚧
- First, make the table the easy way, starting with
staff. Tip: 2011 isn’t normally a valid
variable name, but you can use backticks (next to the 1 key):
`2011`. You’ll probably need to select some
columns.
- Once you have the resulting table, can you now make the same thing
but starting with your long-format data?
- Yes, you’ll need to pivot.
- Can you do write this pipeline without using
select()?
Hint: I think that select and filter should
have been called select_columns and
select_rows.
|
Full-Time Tenured Faculty
|
29.0
|
16.7
|
-12.3
|
|
Full-Time Tenure-Track Faculty
|
16.1
|
7.4
|
-8.7
|
|
Full-Time Non-Tenure-Track Faculty
|
10.3
|
15.4
|
5.1
|
|
Part-Time Faculty
|
24.0
|
41.3
|
17.3
|
|
Graduate Student Employees
|
20.5
|
19.3
|
-1.2
|
Notes:
- You can format the table by piping the resulting dataframe through
knitr::kable().
Drawing Conclusions
🚧 In conclusion, are universities hiring more part-time faculty and
less full-time faculty? Why might this be?
---
title: "Lab 7.1 - Faculty Hiring Patterns"
author: "Author Goes Here"
date: "Semester Goes Here"
output:
  html_document:
    code_download: true
---

```{r setup, include = FALSE}
# Set some useful chunk options for code chunks.
knitr::opts_chunk$set(
  echo = TRUE,
  error = TRUE,    # display errors but keep going.
  comment = "",    # don't add '##' characters before output.
  message = FALSE  # don't include messages in the knitted output (check them in RStudio)
  )
library(tidyverse)
theme_set(theme_bw())
```

This document explores the claim that universities are increasingly hiring more part-time faculty and less full-time faculty. It starts with a rather poorly designed plot, which shows the trend but only at the cost of some considerable effort on the part of the reader, and improves the visualization, which shows the trend more clearly.

## Analysing the Original Visualization

The American Association of University Professors (AAUP), a nonprofit membership association of faculty and other academic professionals, created [this report](https://www.aaup.org/sites/default/files/files/AAUP_Report_InstrStaff-75-11_apr2013.pdf). The report discusses trends in instructional staff employees between 1975 and 2011, and contains a visualization very similar to this one:

![Instructional staff employment trends](https://cs.calvin.edu/courses/info/601/07tidy/lab/images/staff-employment.png){width="700px"}

🚧
Analyze this plot: what type of plot is it? What variables are mapped to what aesthetics? What is easy to see, what is hard to see?

## Designing Alternate Visualizations

🚧
Sketch on paper two alternative designs for this plot that might do a better job of illustrating the hiring levels different types of faculty over time. Describe the alternatives in the same way as you just did above (type, aesthetic mappings) and explain how they might be improvements of the original.

## Rebuilding the Visualization

We now update the visualization incrementally, starting with an approximation of the original.

### 1. Reproduce the original plot.

🚧
Remake the original plot, starting with [this dataset](https://cs.calvin.edu/courses/info/601/07tidy/lab/data/instructional-staff.csv) and trying to make it look as follows.

![aaup-1](https://cs.calvin.edu/courses/info/601/07tidy/lab/images/aaup-1.png)

Notes:

- The dataset will need to be pivoted, so that you have columns for `faculty_type`, `year`, and `percentage`.
- Use a column chart with dodged bars (`geom_col(position = "dodge")`).
- Use the Viridis color scale (`scale_fill_viridis_d()`)
- Include labels (`labs()`) for: `x`, `y`, `fill`, `title`, `caption`

### 2. Convert to a stacked bar plot.

We'd like to more directly compare the hiring levels of the different faculty types.

🚧
Update the plot to look like this.

![aaup-2](https://cs.calvin.edu/courses/info/601/07tidy/lab/images/aaup-2.png)

Notes:

- See if you can do this without repeating the `pivot_longer`.
- We used `theme_minimal()`.

### 3. Convert to a line plot.

Line plots tend to be good for view values over time.

🚧
Update the plot to look like this.

![aaup-3](https://cs.calvin.edu/courses/info/601/07tidy/lab/images/aaup-3.png)

Notes:


- Note the use of a redundant encoding (`shape`). I had to add a `geom_point` layer to draw those.
- Note that, at first, the lines don't actually show up. You can hack you way to make it work by adding `group = year` to the aesthetic mappings; a Google search on the error message you get will probably get you there. But that yields the plot shown above. If you look carefully, you'll see something badly wrong with it.
- Once you notice what's wrong with the line plot, then fix things by making sure that the year is treated `as.numeric`. (And once you do that, you don't need the `group = year` hack anymore anyway.)
- If you made changes to earlier code, make sure your earlier plots still work. (You may need to replace `year` with `factor(year)` in those plots, depending on how you did it.)

### 4. Just show the numbers.

Sometimes, simply displaying numbers is as effective as visualizing them. Let's make the *table* below, which shows the changes in hiring levels, i.e., the 2011 percentage minus the 1975 percentage.

🚧

- First, make the table the easy way, starting with `staff`. Tip: `2011` isn't normally a valid variable name, but you can use backticks (next to the 1 key): `` `2011` ``. You'll probably need to `select` some columns.
- Once you have the resulting table, can you now make the same thing but starting with your *long*-format data?
  - Yes, you'll need to pivot.
  - Can you do write this pipeline without using `select()`? Hint: I think that `select` and `filter` should have been called `select_columns` and `select_rows`.

<table>
<thead>
<tr class="header">
<th align="left">faculty_type</th>
<th align="right">1975</th>
<th align="right">2011</th>
<th align="right">change</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td align="left">Full-Time Tenured Faculty</td>
<td align="right">29.0</td>
<td align="right">16.7</td>
<td align="right">-12.3</td>
</tr>
<tr class="even">
<td align="left">Full-Time Tenure-Track Faculty</td>
<td align="right">16.1</td>
<td align="right">7.4</td>
<td align="right">-8.7</td>
</tr>
<tr class="odd">
<td align="left">Full-Time Non-Tenure-Track Faculty</td>
<td align="right">10.3</td>
<td align="right">15.4</td>
<td align="right">5.1</td>
</tr>
<tr class="even">
<td align="left">Part-Time Faculty</td>
<td align="right">24.0</td>
<td align="right">41.3</td>
<td align="right">17.3</td>
</tr>
<tr class="odd">
<td align="left">Graduate Student Employees</td>
<td align="right">20.5</td>
<td align="right">19.3</td>
<td align="right">-1.2</td>
</tr>
</tbody>
</table>

Notes:

- You can format the table by piping the resulting dataframe through `knitr::kable()`.

## Drawing Conclusions

🚧
In conclusion, are universities hiring more part-time faculty and less full-time faculty? Why might this be?
