10:00
Explore the bears
data frame:
glimpse(bears)head(bears)
plot()
plot()
General syntax:
plot(x = x_vector, y = y_vector)
plot()
General syntax:
plot(x = x_vector, y = y_vector)
Example:
var1 <- bears$yearvar2 <- bears$ageplot(x = var1, y = var2)
plot()
x
and y
must have the same length!plot()
x
and y
must have the same length!var2 <- var2[-1]
plot()
x
and y
must have the same length!var2 <- var2[-1]
length(var1) == length(var2)
#> [1] FALSE
plot()
x
and y
must have the same length!var2 <- var2[-1]
length(var1) == length(var2)
#> [1] FALSE
plot(x = var1, y = var2)
#> Error in xy.coords(x, y, xlabel, ylabel, log): 'x' and 'y' lengths differ
plot()
prettyplot( x = bears$year, y = bears$age, col = 'darkblue', # Point color pch = 19, # Point shape main = "Age of victims over time", xlab = "Year", ylab = "Age of victim")
10:00
plot()
Does the annual number of bird impacts appear to be changing over time?
Make a plot using the birds
data frame to justify your answer.
Hint: You may need to create a summary data frame to answer this question!
Bonus: Make your plot pretty!
hist()
General syntax:
hist(x = x_vector)
hist()
General syntax:
hist(x = x_vector)
Example:
hist(bears$month)
hist()
prettyhist( x = bears$month, breaks = 12, col = 'darkred', main = "Bear killings by month", xlab = "Month", ylab = "Count")
10:00
hist()
Make plots using the birds
data frame to answer these questions
Bonus: Make your plots pretty!
Concept developed by Leland Wilkinson (1999)
ggplot2 package developed by Hadley Wickham (2005)
bears
)The ggplot()
function initializes the plot with whatever data you're using
ggplot(data = bears)
The aes()
function determines which variables will be mapped to the geometries
(e.g. the axes)
ggplot( data = bears, mapping = aes(x = year, y = age))
Use +
to add geometries (e.g. points)
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
geom_point()
: scatter plotsgeom_line()
: lines connecting data pointsgeom_col()
: bar chartsgeom_boxplot()
: boxes for boxplotsgeom_point()
Add points:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
geom_point()
Change the color of all points:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(color = 'blue')
geom_point()
Map the point color to a variable:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(aes(color = gender))
Note that color = gender
is inside aes()
geom_point()
Adjust labels with labs()
layer:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(aes(color = gender)) + labs( x = "Year", y = "Age", title = "Bear victim age over time", color = "Gender" )
10:00
geom_point()
Use the birds
data frame to create the following plots
05:00
geom_col()
With bar charts, you'll often need to create summary variables to plot
geom_col()
With bar charts, you'll often need to create summary variables to plot
Step 1: Summarize the data
bear_months <- bears %>% count(month)
Step 2: Make the plot
ggplot(data = bear_months) + geom_col(aes(x = month, y = n))
Example: count of attacks by month
geom_col()
Alternative approach: piping directly into ggplot
bears %>% count(month) %>% # Pipe into ggplot ggplot() + geom_col(aes(x = month, y = n))
geom_col()
vs. geom_bar()
geom_col()
Map both x
and y
bears %>% count(month) %>% ggplot() + geom_col(aes(x = month, y = n))
geom_bar()
Only map x
(y
is computed)
bears %>% ggplot() + geom_bar(aes(x = month))
geom_col()
Another example:
Mean age of victim in each year
bears %>% filter(!is.na(age)) %>% group_by(year) %>% summarise(meanAge = mean(age)) %>% ggplot() + geom_col(aes(x = year, y = meanAge))
width
fill
color
bears %>% count(month) %>% ggplot() + geom_col( mapping = aes(x = month, y = n), width = 0.7, fill = "blue", color = "red" )
fill
to bearType
bears %>% count(month, bearType) %>% ggplot() + geom_col( mapping = aes( x = month, y = n, fill = bearType) )
Note that I had to summarize the count by both month
and bearType
bears %>% count(month, bearType)
#> # A tibble: 27 × 3#> month bearType n#> <dbl> <chr> <int>#> 1 1 Brown 1#> 2 1 Polar 2#> 3 2 Brown 1#> 4 3 Brown 1#> 5 4 Black 1#> 6 4 Brown 3#> 7 5 Black 15#> 8 5 Brown 2#> 9 5 Polar 1#> 10 6 Black 10#> # ℹ 17 more rows
By default, R makes numeric variables continuous
bears %>% count(month) %>% ggplot() + geom_col(aes(x = month, y = n))
The variable month
is a number
You can make a continuous variable categorical using as.factor()
bears %>% count(month) %>% ggplot() + geom_col( mapping = aes( x = as.factor(month), y = n) )
The variable month
is a factor
15:00
geom_col()
Use the bears
and birds
data frame to create the following plots
Themes change global features of your plot, like the background color, grid lines, etc.
Themes change global features of your plot, like the background color, grid lines, etc.
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
Themes change global features of your plot, like the background color, grid lines, etc.
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_bw()
theme_bw()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_bw()
theme_minimal()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_minimal()
theme_classic()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_classic()
theme_void()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_void()
library(hrbrthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_ipsum()
library(hrbrthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_ft_rc()
library(ggthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_economist()
library(ggthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_economist_white()
ggsave()
ggsave()
First, assign the plot to an object name:
scatterPlot <- ggplot(data = bears) + geom_point(aes(x = year, y = age))
ggsave()
First, assign the plot to an object name:
scatterPlot <- ggplot(data = bears) + geom_point(aes(x = year, y = age))
Then use ggsave()
to save the plot:
ggsave( filename = here('plots', 'scatterPlot.png'), plot = scatterPlot, width = 6, # inches height = 4)
Use the mtcars
data frame to create the following plots
Use the mpg
data frame to create the following plot
10:00
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
10:00
Explore the bears
data frame:
glimpse(bears)head(bears)
plot()
plot()
General syntax:
plot(x = x_vector, y = y_vector)
plot()
General syntax:
plot(x = x_vector, y = y_vector)
Example:
var1 <- bears$yearvar2 <- bears$ageplot(x = var1, y = var2)
plot()
x
and y
must have the same length!plot()
x
and y
must have the same length!var2 <- var2[-1]
plot()
x
and y
must have the same length!var2 <- var2[-1]
length(var1) == length(var2)
#> [1] FALSE
plot()
x
and y
must have the same length!var2 <- var2[-1]
length(var1) == length(var2)
#> [1] FALSE
plot(x = var1, y = var2)
#> Error in xy.coords(x, y, xlabel, ylabel, log): 'x' and 'y' lengths differ
plot()
prettyplot( x = bears$year, y = bears$age, col = 'darkblue', # Point color pch = 19, # Point shape main = "Age of victims over time", xlab = "Year", ylab = "Age of victim")
10:00
plot()
Does the annual number of bird impacts appear to be changing over time?
Make a plot using the birds
data frame to justify your answer.
Hint: You may need to create a summary data frame to answer this question!
Bonus: Make your plot pretty!
hist()
General syntax:
hist(x = x_vector)
hist()
General syntax:
hist(x = x_vector)
Example:
hist(bears$month)
hist()
prettyhist( x = bears$month, breaks = 12, col = 'darkred', main = "Bear killings by month", xlab = "Month", ylab = "Count")
10:00
hist()
Make plots using the birds
data frame to answer these questions
Bonus: Make your plots pretty!
Concept developed by Leland Wilkinson (1999)
ggplot2 package developed by Hadley Wickham (2005)
bears
)The ggplot()
function initializes the plot with whatever data you're using
ggplot(data = bears)
The aes()
function determines which variables will be mapped to the geometries
(e.g. the axes)
ggplot( data = bears, mapping = aes(x = year, y = age))
Use +
to add geometries (e.g. points)
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
geom_point()
: scatter plotsgeom_line()
: lines connecting data pointsgeom_col()
: bar chartsgeom_boxplot()
: boxes for boxplotsgeom_point()
Add points:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
geom_point()
Change the color of all points:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(color = 'blue')
geom_point()
Map the point color to a variable:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(aes(color = gender))
Note that color = gender
is inside aes()
geom_point()
Adjust labels with labs()
layer:
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point(aes(color = gender)) + labs( x = "Year", y = "Age", title = "Bear victim age over time", color = "Gender" )
10:00
geom_point()
Use the birds
data frame to create the following plots
05:00
geom_col()
With bar charts, you'll often need to create summary variables to plot
geom_col()
With bar charts, you'll often need to create summary variables to plot
Step 1: Summarize the data
bear_months <- bears %>% count(month)
Step 2: Make the plot
ggplot(data = bear_months) + geom_col(aes(x = month, y = n))
Example: count of attacks by month
geom_col()
Alternative approach: piping directly into ggplot
bears %>% count(month) %>% # Pipe into ggplot ggplot() + geom_col(aes(x = month, y = n))
geom_col()
vs. geom_bar()
geom_col()
Map both x
and y
bears %>% count(month) %>% ggplot() + geom_col(aes(x = month, y = n))
geom_bar()
Only map x
(y
is computed)
bears %>% ggplot() + geom_bar(aes(x = month))
geom_col()
Another example:
Mean age of victim in each year
bears %>% filter(!is.na(age)) %>% group_by(year) %>% summarise(meanAge = mean(age)) %>% ggplot() + geom_col(aes(x = year, y = meanAge))
width
fill
color
bears %>% count(month) %>% ggplot() + geom_col( mapping = aes(x = month, y = n), width = 0.7, fill = "blue", color = "red" )
fill
to bearType
bears %>% count(month, bearType) %>% ggplot() + geom_col( mapping = aes( x = month, y = n, fill = bearType) )
Note that I had to summarize the count by both month
and bearType
bears %>% count(month, bearType)
#> # A tibble: 27 × 3#> month bearType n#> <dbl> <chr> <int>#> 1 1 Brown 1#> 2 1 Polar 2#> 3 2 Brown 1#> 4 3 Brown 1#> 5 4 Black 1#> 6 4 Brown 3#> 7 5 Black 15#> 8 5 Brown 2#> 9 5 Polar 1#> 10 6 Black 10#> # ℹ 17 more rows
By default, R makes numeric variables continuous
bears %>% count(month) %>% ggplot() + geom_col(aes(x = month, y = n))
The variable month
is a number
You can make a continuous variable categorical using as.factor()
bears %>% count(month) %>% ggplot() + geom_col( mapping = aes( x = as.factor(month), y = n) )
The variable month
is a factor
15:00
geom_col()
Use the bears
and birds
data frame to create the following plots
Themes change global features of your plot, like the background color, grid lines, etc.
Themes change global features of your plot, like the background color, grid lines, etc.
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point()
Themes change global features of your plot, like the background color, grid lines, etc.
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_bw()
theme_bw()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_bw()
theme_minimal()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_minimal()
theme_classic()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_classic()
theme_void()
ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_void()
library(hrbrthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_ipsum()
library(hrbrthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_ft_rc()
library(ggthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_economist()
library(ggthemes)ggplot( data = bears, mapping = aes(x = year, y = age)) + geom_point() + theme_economist_white()
ggsave()
ggsave()
First, assign the plot to an object name:
scatterPlot <- ggplot(data = bears) + geom_point(aes(x = year, y = age))
ggsave()
First, assign the plot to an object name:
scatterPlot <- ggplot(data = bears) + geom_point(aes(x = year, y = age))
Then use ggsave()
to save the plot:
ggsave( filename = here('plots', 'scatterPlot.png'), plot = scatterPlot, width = 6, # inches height = 4)
Use the mtcars
data frame to create the following plots
Use the mpg
data frame to create the following plot