Pratice for Exploratory analysis / data visualization for Quiz 7
Create a plot with the faithful dataset
add points with geom_point
assign the variable eruptions to the x-axis
assign the variable waiting to the y-axis colour the points according to whether waiting is smaller or greater than 76
ggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting, colour = waiting > 76))
faithful
datasetgeom_point
eruptions
to the x-axiswaiting
to the y-axispurple
to all the pointsggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting),
colour = 'purple')
faithful
datasetgeom_histogram()
to plot the distribution of waiting
time
waiting
to the x-axisggplot(faithful) +
geom_histogram(aes(x = waiting))
See how shapes and sizes of points can be specified here: https://ggplot2.tidyverse.org/articles/ggplot2-specs.html#sec:shape-spec
Create a plot with the faithful dataset
add points with geom_point
ggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting), shape = "cross", size = 7, alpha = 0.6)
faithful
datasetgeom_histogram()
to plot the distribution of the eruptions
(time)ggplot(faithful) +
geom_histogram(aes(x = eruptions, fill = eruptions > 3.2))
mpg
datasetgeom_bar()
to create a bar chart of the variable manufacturer
manufacturer
instead of class
class
to manufacturer
ggplot(mpg) +
geom_bar(aes(x = manufacturer, y = after_stat(100 * count / sum(count))))
for reference see: https://ggplot2.tidyverse.org/reference/stat_summary.html?q=stat%20_%20summary#examples
Use stat_summary()
to add a dot at the median
of each group
orange
square
9
ggplot(mpg) +
geom_jitter(aes(x = class, y = hwy), width = 0.2) + stat_summary(aes(x = class, y = hwy),fun = "median", geom = 'point', colour = 'orange', shape = "square", size = 9)