Pratice for Exploratory analysis / data visualization for Quiz 7
Create a plot with the faithful dataset
add points with geom_point
assign the variable eruptions to the x-axis
assign the variable waiting to the y-axis colour the points according to whether waiting is smaller or greater than 76
ggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting, colour = waiting > 76))

faithful datasetgeom_point
eruptions to the x-axiswaiting to the y-axispurple to all the pointsggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting),
colour = 'purple')

faithful datasetgeom_histogram() to plot the distribution of waiting time
waiting to the x-axisggplot(faithful) +
geom_histogram(aes(x = waiting))

See how shapes and sizes of points can be specified here: https://ggplot2.tidyverse.org/articles/ggplot2-specs.html#sec:shape-spec
Create a plot with the faithful dataset
add points with geom_point
ggplot(faithful) +
geom_point(aes(x = eruptions, y = waiting), shape = "cross", size = 7, alpha = 0.6)

faithful datasetgeom_histogram() to plot the distribution of the eruptions (time)ggplot(faithful) +
geom_histogram(aes(x = eruptions, fill = eruptions > 3.2))

mpg datasetgeom_bar() to create a bar chart of the variable manufacturermanufacturer instead of classmpg_counted <- mpg %>%
count(manufacturer, name = 'count')
ggplot(mpg_counted) +
geom_bar(aes(x = manufacturer, y = count), stat = 'identity')

class to manufacturerggplot(mpg) +
geom_bar(aes(x = manufacturer, y = after_stat(100 * count / sum(count))))

for reference see: https://ggplot2.tidyverse.org/reference/stat_summary.html?q=stat%20_%20summary#examples
Use stat_summary() to add a dot at the median of each group
orangesquare9ggplot(mpg) +
geom_jitter(aes(x = class, y = hwy), width = 0.2) + stat_summary(aes(x = class, y = hwy),fun = "median", geom = 'point', colour = 'orange', shape = "square", size = 9)
