aov package:stats R Documentation _F_i_t _a_n _A_n_a_l_y_s_i_s _o_f _V_a_r_i_a_n_c_e _M_o_d_e_l _D_e_s_c_r_i_p_t_i_o_n: Fit an analysis of variance model by a call to 'lm' for each stratum. _U_s_a_g_e: aov(formula, data = NULL, projections = FALSE, qr = TRUE, contrasts = NULL, ...) _A_r_g_u_m_e_n_t_s: formula: A formula specifying the model. data: A data frame in which the variables specified in the formula will be found. If missing, the variables are searched for in the standard way. projections: Logical flag: should the projections be returned? qr: Logical flag: should the QR decomposition be returned? contrasts: A list of contrasts to be used for some of the factors in the formula. These are not used for any 'Error' term, and supplying contrasts for factors only in the 'Error' term will give a warning. ...: Arguments to be passed to 'lm', such as 'subset' or 'na.action'. See 'Details' about 'weights'. _D_e_t_a_i_l_s: This provides a wrapper to 'lm' for fitting linear models to balanced or unbalanced experimental designs. The main difference from 'lm' is in the way 'print', 'summary' and so on handle the fit: this is expressed in the traditional language of the analysis of variance rather than that of linear models. If the formula contains a single 'Error' term, this is used to specify error strata, and appropriate models are fitted within each error stratum. The formula can specify multiple responses. Weights can be specified by a 'weights' argument, but should not be used with an 'Error' term, and are incompletely supported (e.g., not by 'model.tables'). _V_a_l_u_e: An object of class 'c("aov", "lm")' or for multiple responses of class 'c("maov", "aov", "mlm", "lm")' or for multiple error strata of class '"aovlist"'. There are 'print' and 'summary' methods available for these. _N_o_t_e: 'aov' is designed for balanced designs, and the results can be hard to interpret without balance: beware that missing values in the response(s) will likely lose the balance. If there are two or more error strata, the methods used are statistically inefficient without balance, and it may be better to use 'lme'. Balance can be checked with the 'replications' function. The default 'contrasts' in R are not orthogonal contrasts, and 'aov' and its helper functions will work better with such contrasts: see the examples for how to select these. _A_u_t_h_o_r(_s): The design was inspired by the S function of the same name described in Chambers _et al._ (1992). _R_e_f_e_r_e_n_c_e_s: Chambers, J. M., Freeny, A and Heiberger, R. M. (1992) _Analysis of variance; designed experiments._ Chapter 5 of _Statistical Models in S_ eds J. M. Chambers and T. J. Hastie, Wadsworth & Brooks/Cole. _S_e_e _A_l_s_o: 'lm', 'summary.aov', 'replications', 'alias', 'proj', 'model.tables', 'TukeyHSD' _E_x_a_m_p_l_e_s: ## From Venables and Ripley (2002) p.165. utils::data(npk, package="MASS") ## Set orthogonal contrasts. op <- options(contrasts=c("contr.helmert", "contr.poly")) ( npk.aov <- aov(yield ~ block + N*P*K, npk) ) summary(npk.aov) coefficients(npk.aov) ## to show the effects of re-ordering terms contrast the two fits aov(yield ~ block + N * P + K, npk) aov(terms(yield ~ block + N * P + K, keep.order=TRUE), npk) ## as a test, not particularly sensible statistically npk.aovE <- aov(yield ~ N*P*K + Error(block), npk) npk.aovE summary(npk.aovE) options(op)# reset to previous