More TidierPlots and cleanup

JuliaMixedModels · May 6, 2024 · 5968b3c · 5968b3c · kliegl · May 21, 2024
1 parent 3654bc1
commit 5968b3c
Show file tree

Hide file tree

Showing 5 changed files with 68 additions and 48 deletions.
diff --git a/aGHQ.qmd b/aGHQ.qmd
@@ -158,6 +158,8 @@ and define some constants
 #| label: constantsA03
 @isdefined(contrasts) || const contrasts = Dict{Symbol,Any}()
 @isdefined(progress) || const progress = false
+TidierPlots_set("plot_show", false)
+TidierPlots_set("plot_log", false)
 ```
 
 ## Generalized linear models for binary data {#sec-BernoulliGLM}
@@ -404,7 +406,7 @@ Each evaluation of the deviance is fast, requiring only a fraction of a millisec
 
 ```{julia}
 βopt = copy(com05fe.β)
-@b deviance(setβ!($com05fe, $βopt))
+@be deviance(setβ!($com05fe, $βopt))
 ```
 
 but the already large number of evaluations for these six coefficients would not scale well as this dimension increases.
@@ -636,7 +638,7 @@ The IRLS algorithm has converged in 4 iterations to essentially the same devianc
 Each iteration of the IRLS algorithm takes more time than a deviance evaluation, but still only a fraction of a millisecond on a laptop computer.
 
 ```{julia}
-@b deviance(updateβ!($com05fe))
+@be deviance(updateβ!($com05fe))
 ```
 
 ## GLMMs and the PIRLS algorithm {#sec-PIRLS}
@@ -867,7 +869,7 @@ pirls!(m; verbose=true);
 As with IRLS, PIRLS is a fast and stable algorithm for determining the mode of the conditional distribution $(\mcU|\mcY=\bby)$ with $\bbtheta$ and $\bbbeta$ held fixed.
 
 ```{julia}
-@b pirls!($m)
+@be pirls!($m)
 ```
 
 The time taken for the four iterations to determine the conditional mode of $\bbu$ is comparable to the time taken for a single call to `updateβ!`.
@@ -1143,7 +1145,7 @@ Notice that the magnitudes of the weights drop quite dramatically as the positio
 
 ```{julia}
 #| code-fold: true
-#| fig-cap: Weights (logarithm base 2) and positions for the 9th order normalized Gauss-Hermite quadrature rule
+#| fig-cap: Weights (logarithm base 10) and positions for the 9th order normalized Gauss-Hermite quadrature rule
 #| label: fig-ghninelog
 ggplot(df9, aes(; x=:abscissae, y=:weights)) +
 geom_point() +

diff --git a/intro.qmd b/intro.qmd
@@ -120,6 +120,8 @@ using StatsBase         # basic statistical summaries
 using TidierPlots       # ggplot2-like graphics in Julia
 
 using EmbraceUncertainty: dataset  # `dataset` means this one
+TidierPlots_set("plot_show", false)
+TidierPlots_set("plot_log", false)
 ```
 
 A package must be attached before any of the data sets or functions in the package can be used.
@@ -621,6 +623,7 @@ Plots of the bootstrap estimates for individual parameters are obtained by extra
 For example,
 
 ```{julia}
+tbl = DataFrame(dsm01samp.tbl)
 βdf = @subset(dsm01pars, :type == "β")
 ```
 
@@ -633,10 +636,7 @@ You can check the details by clicking on the "Code" button in the HTML version o
 #| fig-cap: Kernel density plot of bootstrap fixed-effects parameter estimates from dsm01
 #| label: fig-dsm01_bs_beta_density
 #| warning: false
-ggplot(
-  filter(==("β") ∘ getproperty(:type), dsm01pars),
-  aes(; x=:value),
-) +
+ggplot(tbl, aes(; x=:β1)) +
 geom_density() +
 labs(; x="Bootstrap samples of β₁")
 ```
@@ -655,15 +655,18 @@ The situation is different for the estimates of the standard deviation parameter
 #| fig-cap: Kernel density plot of bootstrap variance-component parameter estimates from model dsm01
 #| label: fig-dsm01_bs_sigma_density
 #| warning: false
-draw(
-  data(@subset(dsm01pars, :type == "σ")) *
-  mapping(
-    :value => "Bootstrap samples of σ";
-    color=(:group => "Group"),
-  ) *
-  AlgebraOfGraphics.density();
-  figure=(; size=(600, 340)),
-)
+ggplot(stack(DataFrame(tbl), 3:4), aes(; x=:value, color=:variable)) +
+geom_density(alpha=0.6) +
+labs(; x="Bootstrap samples of σ")
+# draw(
+#   data(@subset(dsm01pars, :type == "σ")) *
+#   mapping(
+#     :value => "Bootstrap samples of σ";
+#     color=(:group => "Group"),
+#   ) *
+#   AlgebraOfGraphics.density();
+#   figure=(; size=(600, 340)),
+# )
 ```
 
 The estimator for the residual standard deviation, $\sigma$, is approximately normally distributed but the estimator for $\sigma_1$, the standard deviation of the `batch` random effects is bimodal (i.e. has two "modes" or local maxima).

diff --git a/largescaledesigned.qmd b/largescaledesigned.qmd
@@ -39,6 +39,8 @@ and define some constants, if not already defined.
 #| label: constants04
 @isdefined(contrasts) || const contrasts = Dict{Symbol,Any}()
 @isdefined(progress) || const progress = false
+TidierPlots_set("plot_show", false)
+TidierPlots_set("plot_log", false)
 ```
 
 As with many techniques in data science, the place where "the rubber meets the road", as they say in the automotive industry, for mixed-effects models is when working on large-scale studies.

diff --git a/longitudinal.qmd b/longitudinal.qmd
@@ -57,6 +57,8 @@ and declare some constants, if not already defined.
 #| label: constants03
 @isdefined(contrasts) || const contrasts = Dict{Symbol,Any}()
 @isdefined(progress) || const progress = false
+TidierPlots_set("plot_show", false)
+TidierPlots_set("plot_log", false)
 ```
 
 Longitudinal data consist of repeated measurements on the same subject, or some other observational unit, taken over time.
@@ -589,6 +591,7 @@ bxm03samp = parametricbootstrap(
   progress=false,
 )
 bxm03pars = DataFrame(bxm03samp.allpars)
+tbl = DataFrame(bxm03samp.tbl)
 DataFrame(shortestcovint(bxm03samp))
 ```
 
@@ -600,16 +603,18 @@ A kernel density plot, @fig-bxm03rhodens, of the parametric bootstrap estimates
 #| code-fold: true
 #| fig-cap: Kernel density plots of parametric bootstrap estimates of correlation estimates from model bxm03
 #| label: fig-bxm03rhodens
-#| warning: false
-draw(
-  data(@subset(bxm03pars, :type == "ρ")) *
-  mapping(
-    :value => "Bootstrap replicates of correlation estimates";
-    color=(:names => "Variables"),
-  ) *
-  AlgebraOfGraphics.density();
-  figure=(; size=(600, 400)),
-)
+ggplot(stack(tbl, [:ρ1, :ρ2, :ρ3]), aes(; x=:value, color=:variable)) +
+  geom_density() +
+  labs(; x="Bootstrap replicates of correlation estimates")
+# draw(
+#   data(@subset(bxm03pars, :type == "ρ")) *
+#   mapping(
+#     :value => "Bootstrap replicates of correlation estimates";
+#     color=(:names => "Variables"),
+#   ) *
+#   AlgebraOfGraphics.density();
+#   figure=(; size=(600, 400)),
+# )
 ```
 
 Even on the scale of [Fisher's z transformation](https://en.wikipedia.org/wiki/Fisher_transformation), @fig-bxm03rhodensatanh, these estimates are highly skewed.

diff --git a/multiple.qmd b/multiple.qmd
@@ -58,6 +58,8 @@ and define some constants, if not already defined,
 #| label: constants02
 @isdefined(contrasts) || const contrasts = Dict{Symbol,Any}()
 @isdefined(progress) || const progress = false
+TidierPlots_set("plot_show", false)
+TidierPlots_set("plot_log", false)
 ```
 
 The mixed models considered in the previous chapter had only one random-effects term, which was a simple, scalar random-effects term, and a single fixed-effects coefficient.
@@ -239,6 +241,7 @@ A parametric bootstrap sample of the parameter estimates
 #| code-fold: true
 bsrng = Random.seed!(9876789)
 pnm01samp = parametricbootstrap(bsrng, 10_000, pnm01; progress)
+tbl = DataFrame(pnm01samp.tbl)
 pnm01pars = DataFrame(pnm01samp.allpars);
 ```
 
@@ -255,7 +258,7 @@ As for model `dsm01` the bootstrap parameter estimates of the fixed-effects para
 #| fig-cap: "Parametric bootstrap estimates of fixed-effects parameters in model pnm01"
 #| label: fig-pnm01bsbeta
 #| warning: false
-ggplot(DataFrame(pnm01samp.tbl), aes(x=:β1)) +
+ggplot(tbl, aes(x=:β1)) +
 geom_density() +
 labs(; x="Bootstrap samples of β₁")
 ```
@@ -273,16 +276,18 @@ The densities of the variance-components, on the scale of the standard deviation
 #| fig-cap: "Parametric bootstrap estimates of variance components in model pnm01"
 #| label: fig-pnm01bssigma
 #| code-fold: true
-#| warning: false
-draw(
-  data(@subset(pnm01pars, :type == "σ")) *
-  mapping(
-    :value => "Bootstrap samples of σ";
-    color=(:group => "Group"),
-  ) *
-  AlgebraOfGraphics.density();
-  figure=(; size=(600, 340)),
-)
+ggplot(stack(tbl, [:σ, :σ1, :σ2]), aes(; x=:value, color=:variable)) +
+  geom_density() +
+  labs(; x="Bootstrap samples of σ")
+# draw(
+#   data(@subset(pnm01pars, :type == "σ")) *
+#   mapping(
+#     :value => "Bootstrap samples of σ";
+#     color=(:group => "Group"),
+#   ) *
+#   AlgebraOfGraphics.density();
+#   figure=(; size=(600, 340)),
+# )
 ```
 
 The lack of precision in the estimate of $\sigma_2$, the standard deviation of the random effects for `sample`, is a consequence of only having 6 distinct levels of the `sample` factor.
@@ -437,23 +442,26 @@ Furthermore, kernel density estimates from a parametric bootstrap sample of the
 ```{julia}
 Random.seed!(4567654)
 psm01samp = parametricbootstrap(10_000, psm01; progress)
+tbl = DataFrame(psm01samp.tbl)
 psm01pars = DataFrame(psm01samp.allpars);
 ```
 
 ```{julia}
 #| fig-cap: "Kernel density plots of bootstrap estimates of σ for model psm01"
 #| label: fig-psm01bssampdens
 #| code-fold: true
-#| warning: false
-draw(
-  data(@subset(psm01pars, :type == "σ")) *
-  mapping(
-    :value => "Bootstrap samples of σ";
-    color=(:group => "Group"),
-  ) *
-  AlgebraOfGraphics.density();
-  figure=(; size=(600, 340)),
-)
+ggplot(stack(tbl, [:σ, :σ1, :σ2]), aes(; x=:value, color=:variable)) +
+  geom_density() +
+  labs(; x="Bootstrap samples of σ")
+# draw(
+#   data(@subset(psm01pars, :type == "σ")) *
+#   mapping(
+#     :value => "Bootstrap samples of σ";
+#     color=(:group => "Group"),
+#   ) *
+#   AlgebraOfGraphics.density();
+#   figure=(; size=(600, 340)),
+# )
 ```
 
 Because there are several indications that $\sigma_2$ could reasonably be zero, resulting in a simpler model incorporating random effects for only `sample`, we perform a statistical test of this hypothesis.