--- title: "Building and customizing ggplot2-based diagnostic plots with the mixpoissonreg package" author: "Alexandre B. Simas" date: '`r format(Sys.time(), "%Y-%m-%d")`' output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Building and customizing ggplot2-based diagnostic plots with the mixpoissonreg package} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` # `ggplot2`-based diagnostic plots To obtain `ggplot2`-based diagnostic plots for `mixpoissonreg` objects one should use the `autoplot` method. In the subsequent sections we will show how to use the `autoplot` method as well as how to use its arguments to customize the plots. ## Choosing plots The first, and maybe the easiest, customizable option is the choice of plots to be displayed. The current available plots are (together with their corresponding numbers): 1. *Residuals vs Obs. number* 2. *Normal Q-Q* 3. *Cook's distance* 4. *Generalized Cook's distance* 5. *Cook's dist vs Generalized Cook's dist* 6. *Response vs Fitted means* These plots can be chosen by giving a list of the numbers of the wanted plots in the `which` argument. If the argument `which` is not provided, then, by default, the plots 1, 2, 5 and 6 will be displayed: ```{r warning = FALSE} library(mixpoissonreg) fit <- mixpoissonreg(daysabs ~ gender + math + prog | gender + math + prog, data = Attendance) autoplot(fit) ``` If we want only a single plot, we simply indicate its number. For instance, if we only want the plot of Cook's distances, we simply set `which = 3`: ```{r} autoplot(fit, which = 3) ``` If we want more than one, we provide a list with the desired plots. Suppose we want the global influence-related plots, that is, plots 3, 4 and 5. Then, we set `which = c(3,4,5)`: ```{r} autoplot(fit, which = c(3,4,5)) ``` ## Titles and subtitles In this section we will describe how to customize titles and subtitles. First of all, by default, the type of the fitted model (that is, if it is a Negative-Binomial or Poisson Inverse Gaussian regression) is included in each title. See, for instance, the plot below: ```{r} autoplot(fit, which = 1) ``` To remove the model type, simply set `include.modeltype` to `FALSE`: ```{r} autoplot(fit, which = 1, include.modeltype = FALSE) ``` One should notice that in both plots, the type of the residual was only included in the *y*-axis label. To also include the type of the residual at the title, simply set `include.residualtype` to `TRUE`: ```{r} autoplot(fit, which = 1, include.residualtype = TRUE) ``` The titles of the plots (displayed above the plot) can be altered by setting the `title` parameter to a list containing the wanted titles. One drawback is that we have to provide a list containing the titles we want in the positions of the plots we want. For example, if we only have one plot, the following example will change the caption: ```{r} my_titles <- rep("",6) my_titles[2] <- "My title 2" autoplot(fit, which = 2, title = my_titles) ``` Similarly, if we wanted to set the titles of plots 2 and 4, we could do: ```{r} my_titles <- rep("",6) my_titles[2] <- "My title 2" my_titles[4] <- "My title 4" autoplot(fit, which = c(2,4), title = my_titles) ``` Notice that, since we did not change the `include.modeltype` argument, the model type are added in the new captions. We can change the size of the titles with the argument `title.size`. So, for instance, ```{r} autoplot(fit, which = 1, title.size = 20, include.modeltype = FALSE) ``` We can also turn the title *bold* by setting the argument `title.bold` to `TRUE`: ```{r} autoplot(fit, which = 1, title.bold = TRUE) ``` We can also change the subcaption. By default, the subcaption is a simplified version of the call to the `mixpoissonreg` function that was used to fit the model. We must have some caution on describing the subcaption. If each plot is given in one window (in the case both `ncol` and `nrow` are `NULL`), then the subcaption is the caption below the *x*-axis label. However, if multiple plots are given at once, and there is space on the upper part of the plot, then the subcaption is a general caption for all the plots. We will illustrate the above description with examples to make it clearer. The subcaption can be altered by setting the `sub.caption` parameter to the desired caption. Thus, we begin by providing the situation of one plot at a time. ```{r} autoplot(fit, which = 1, sub.caption = "My subcaption") ``` Notice that the subcaption is below the *x*-axis label. The same happens even if we there is more than one plot but both `ncol` and `nrow` are `NULL`: ```{r} autoplot(fit, sub.caption = "My subcaption") ``` Now, notice the position of the subcaption when we gather multiple plots using the one of the arguments `ncol` and/or `nrow`: ```{r} autoplot(fit, sub.caption = "My subcaption", nrow = 2) ``` In the previous case, that is, the case in which we have the subcaption above all plots,notice that the common title is bold by default. One can make it not bold by setting `gpar_sub.caption` argument to `list()`: ```{r} autoplot(fit, sub.caption = "My subcaption", nrow = 2, gpar_sub.caption = list()) ``` Similarly, we can change the size of the subcaption using the argument by defining a `fontsize` element to the `gpar_sub.caption` list. For instance, ```{r} autoplot(fit, sub.caption = "My subcaption", nrow = 2, gpar_sub.caption = list(fontface = "bold", fontsize = 30)) ``` ## Colors, sizes, point types and line types In this section we show how to customize colors, sizes and types of lines and points. First of all, let us change the shape of the points. To this end we set the `shape` argument to the desired shape: ```{r} autoplot(fit, which = 1, shape = 2) ``` Now let us deal with the point colors. To change the colors of the points, we use the `colour` argument: ```{r} autoplot(fit, which = 1, colour = "red") ``` To change the point sizes, we use the `size` argument: ```{r} autoplot(fit, which = 1, size = 5) ``` We will now deal with point labels. First of all, we may change the color of the point labels by using the argument `label.colour`: ```{r} autoplot(fit, which = 1, label.colour = "red") ``` We can change the point labels' sizes by using the `label.size` argument: ```{r} autoplot(fit, which = 1, label.size = 10) ``` Another example is the title color. We can change the title color with the `title.colour` argument: ```{r} autoplot(fit, which = 1, title.colour = "red") ``` We can also change the title size, by using the `title.size` argument: ```{r} autoplot(fit, which = 1, title.size = 20) ``` Similarly, we can change the *x* and *y* label's colors by using the `x.axis.col` and `y.axis.col` arguments: ```{r} autoplot(fit, which = 1, x.axis.col = "orange", y.axis.col = "red") ``` As well as change their sizes by using the `x.axis.size` and `y.axis.size` arguments: ```{r} autoplot(fit, which = 1, x.axis.size = 20, y.axis.size = 20) ``` We can change the `sub.caption` color by using the `sub.caption.col` argument: ```{r} autoplot(fit, which = 1, sub.caption.col = "red") ``` and change its size by using the `sub.caption.size` argument: ```{r} autoplot(fit, which = 1, sub.caption.size = 20) ``` We can also change the font face by using the `sub.caption.face` argument: ```{r} autoplot(fit, which = 1, sub.caption.face = "italic") ``` We can change the position of the subcaption by changing the `sub.caption.hjust` argument. The default is 0.5 which indicates that the subcaption is centered. We can place the subcaption to the left side of the plot by setting it to 0: ```{r} autoplot(fit, which = 1, sub.caption.hjust = 0) ``` We can also place it on the right side of the plot by setting it to 1: ```{r} autoplot(fit, which = 1, sub.caption.hjust = 1) ``` Let us now customize the lines in the Cook's distance plots, namely plots 3 and 4. By default, the line type for Cook's distance plots is "linerange": ```{r} autoplot(fit, which = c(3,4)) ``` We change the line colors by using the `colour` argument: ```{r} autoplot(fit, which = c(3,4), colour = "red") ``` Let us change the line type for Cook's distance plots to points and change the point types to crosses: ```{r} autoplot(fit, which = c(3,4), cook.plot.type = "points", cook.plot.pointshape = 4) ``` Finally, let us customize the quantile-quantile plots with and without simulated envelopes, namely, plot 2. The first customization is to remove the diagonal Q-Q line in the quantile-quantile plot without simulated envelopes: ```{r} autoplot(fit, which = 2, qqline = FALSE) ``` We can change the qqline color by using the `ad.colour` argument: ```{r} autoplot(fit, which = 2, ad.colour = "red") ``` Finally, let us consider a fitting with simulated envelopes: ```{r} fit_env <- mixpoissonregML(daysabs ~ gender + math + prog | gender + math + prog, envelope = 100, data = Attendance) autoplot(fit_env, which = 2) ``` Let us first change the color of the median line of the simulated envelopes: ```{r} autoplot(fit_env, which = 2, ad.colour = "red") ``` Let us also change the fill color of the simulated envelopes: ```{r} autoplot(fit_env, which = 2, env_fill = "red") ``` We can change the transparency of the envelope by changing the `env_alpha` argument: ```{r} autoplot(fit_env, which = 2, env_fill = "red", env_alpha = 0.2) ``` ## Points to be identified By default the `autoplot.mixpoissonreg` function always identifies the 3 "most extreme" points. We can change it so that it does not identify any points by setting the argument `label.n` to 0: ```{r} autoplot(fit, label.n = 0) ``` We can also increase the number of identified points. For instance: ```{r} autoplot(fit, label.n = 5) ``` Finally, we can change the labels of the identified points with the argument `label.label`. For instance, we may want to have the value of the `prog` covariate instead: ```{r} autoplot(fit, label.label = model.frame(fit)$prog) ``` ## Multiple plots We can easily create frames of multiple plots by simply using one of the arguments `nrow` or `ncol`. For instance, ```{r} autoplot(fit, nrow = 2) ``` As previously mentioned, we can change the common title in a multiple plot by changing the `sub.caption` argument: ```{r} autoplot(fit, nrow = 2, sub.caption = "Mixed Poisson Regression") ``` To customize the common title, one can provide a list containing the "customizations" to the `gpar_sub.caption` argument. For instance: ```{r} autoplot(fit, nrow = 2, sub.caption = "Mixed Poisson Regression", gpar_sub.caption = list(fontsize = 30, col = "red", fontface = "italic")) ```