csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory

Abstract

Keywords

Generalizability Theory measurement error conditional sem reliability R package absolute error variance relative error variance

Description

More than a quarter of a century ago, Brennan (1998) presented a general framework for estimating conditional standard errors of measurement (CSEMs) within Generalizability Theory (GT). For the univariate, single-facet, persons-by-items (p × i) crossed design, this approach yields person-specific estimates of absolute error variance (suited to criterion-referenced interpretations) and three estimates of relative error variance (relevant to norm-referenced testing).

Compared with alternative methods (see Lee & Harris, 2025, for a recent review), GT-based CSEM estimation offers several advantages: it requires only a single test administration, uses information from all items, accommodates dichotomous, ordinal, and continuous item responses, does not require grouping examinees by observed score, and assumes neither a specific latent variable model nor a particular distributional form for item responses. For dichotomous items, two of Brennan’s estimators also coincide with the well-known Lord and Keats-Lord binomial CSEMs. Despite these strengths, GT-based CSEM procedures have seen limited use in applied measurement.

This limited adoption stems in part from the scarcity of accessible software. Among traditional GT programs, only mGENOVA (Brennan, 2001) computes CSEMs, yet it implements only the absolute estimator and one of the three relative estimators. A more complete implementation has recently become available as gtcsem (Gempp, 2026), a user-written Stata command (StataCorp, 2025). However, no current R package provides the full set of GT-based raw-score CSEM estimators. The package gtheoryr (Tyagi, 2026) fits basic generalizability designs but does not provide CSEMs, while emreliability (Liu et al., 2025) computes some per-person CSEMs outside the GT framework. The latest version of JASP (JASP Team, 2026) includes CSEM estimators that are not GT-based.

The csemGT package fills this gap. Its core function, csem_gt(), accepts a balanced persons-by-items data matrix and computes, for each person, the absolute error variance in closed form (Brennan, 1998, Equation 20) and the relative error variance using any of the full, large_a, or uncorrelated estimators. These computations follow the corresponding formulations presented in Brennan (1998), specifically Equation 20 for absolute error variance and Equations 35–36, 40, and 41 for the relative-error variance estimators. Standard errors of CSEMs estimates can be obtained via closed-form analytical variances or item-resampling bootstrap.

Additional features include quadratic smoothing of CSEMs across the observed-score distribution, D-study extrapolation of CSEMs to alternate test lengths, and basic plotting capabilities. Resulting csem objects support print(), summary(), plot(), and coef() methods. Beyond CSEMs, the package also reports standard G-study and D-study results under the p × i design, including estimated variance components, generalizability coefficients, phi and phi(λ) coefficients, and overall absolute and relative error variances. Two synthetic datasets are included: iowa_like, based on Brennan (1998) ITED example, and ipip_like, a Likert-type conscientiousness scale.

Footnotes

ORCID iD

René Gempp

Funding

The author received no financial support for the research, authorship, and/or publication of this article.

Declaration of Conflicting Interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data Availability Statement

csemGT (version 1.0.0), its manual, and vignettes are freely available on the Comprehensive R Archive Network (CRAN) at https://CRAN.R-project.org/package=csemGT and can be installed using install.packages(“csemGT”). It is released under the GPL (≥3) license. The development version, source code, and package website are hosted on GitHub (https://github.com/rgempp/csemGT; ).

References

Brennan

R. L.

(1998). Raw-score conditional standard errors of measurement in generalizability theory. Applied Psychological Measurement, 22(4), 307–331. https://doi.org/10.1177/014662169802200401

Brennan

R. L.

(2001). Manual for mGENOVA. Iowa testing programs. University of Iowa.

Gempp

(2026). gtcsem: Stata module to compute conditional standard errors of measurement in generalizability theory (Version 1.0.0) [Stata package]. Boston College Department of Economics. https://github.com/rgempp/gtcsem

JASP Team . (2026). JASP (Version 0.97.0) [Computer software]. https://jasp-stats.org/

Lee

Harris

D. J.

(2025). Reliability in educational measurement. In Cook

L. L.

Pitoniak

M. J.

(Eds.), Educational measurement (5th ed., pp. 277–381). Oxford University Press. https://doi.org/10.1093/oso/9780197654965.003.0005

Liu

Lee

Liang

(2025). Emreliability: Test reliability and CSEM in educational measurement (Version 1.0.0) [R package]. Comprehensive R Archive Network. https://doi.org/10.32614/CRAN.package.emreliability

StataCorp . (2025). Stata 19 base reference manual. Stata Press.

Tyagi

(2026). Gtheoryr: Simple generalizability theory for crossed and nested designs (Version 0.1.0) [R package]. Comprehensive R Archive Network. https://doi.org/10.32614/CRAN.package.gtheoryr