Andrich, D. (1978). A rating formulation for ordered response categoriesPsychometrika, 43, 561-573.
2.
Bock, R.D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categoriesPsychometrika, 37, 29-51.
3.
Bock, R.D., & Mislevy, R.J. (1982). Adaptive EAP estimation of ability in a microcomputer environmentApplied Psychological Measurement , 6, 431-444.
4.
Brown, J.S., & Burton, R.R. (1978). Diagnostic models for procedural bugs in basic mathematical skillsCognitive Science , 2, 155-192.
5.
Brown, J.S., & VanLehn, K. (1980). Repair theory: A generative theory of bugs in procedural skillsCognitive Science, 4, 379-426.
6.
Coombs, C.H., Milholland, J.E., & Womer, F.B. (1956). The assessment of partial knowledgeEducational and Psychological Measurement, 16, 13-37.
7.
De Ayala, R.J., Dodd, B.G., & Koch, W.R. (1990). A computerized simulation of a flexilevel test and its comparison with a Bayesian computerized adaptive testJournal of Educational Measurement, 27, 227-239.
8.
De Ayala, R.J., Dodd, B.G., & Koch, W.R. (1992). A comparison of the partial credit and graded response models in computerized adaptive testingApplied Measurement in Education, 5 , 17-34.
9.
Dodd, B.G. (1984). Attitude scaling: A comparison of the graded response and partial credit latent trait models Doctoral Dissertation, The University of Texas at Austin.
10.
Dodd, B.G., Koch, W.R., & De Ayala, R.J. (1989). Operational characteristics of adaptive testing procedures using the graded response modelApplied Psychological Measurement, 13, 129-144.
11.
Frary, R.B. (1989). Partial-credit scoring methods for multiple-choice tests. Applied Measurement in Education , 2, 79-96.
12.
Haladyna, T., & Sympson, J.B. (1988, April). Empirically based polychotomous scoring of multiple-choice items: Historical overviewPaper presented at the annual meeting of the American Educational Research Association, New Orleans.
13.
Hambleton, R.K., & Swaminathan, H. (1985). Item response theory: Principles and applicationsBoston: Kluwer-Nijhoff.
14.
Hays, W.L. (1988). Statistics. New York : Holt, Rinehart, & Winston.
15.
Kingsbury, G.G., & Houser, R.L. (1988, April). A comparison of achievement level estimates from computerized adaptive testing and paper-and-pencil testingPaper presented at the annual meeting of the American Educational Research Association, New Orleans.
16.
Koch, W.R. (1981). Attitude scaling using latent trait theory Doctoral Dissertation, The University of Missouri at Columbia.
17.
Koch, W.R., & Dodd, B.G. (1989). An investigation of procedures for computerized adaptive testing using partial credit scoringApplied Measurement in Education, 2, 335-357.
18.
Lane, S., Stone, C.A., & Hsu, H. (1990, April). Diagnosing students' errors in solving algebra word problemsPaper presented at the annual meeting of the National Council on Measurement in Education, Boston.
19.
Levine, M., & Drasgow, F. (1983). The relation between incorrect option choice and estimated abilityEducational and Psychological Measurement , 43, 675-685.
20.
Lord, F.M. (1980). Applications of item response theory to practical testing problemsHillsdale NJ: Erlbaum.
21.
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47 , 149-174.
22.
McBride, J.R., & Martin, J.T. (1983). Reliability and validity of adaptive ability tests in a military setting. In D. J. Weiss (Ed.), New horizons in testing (pp. 223-237). New York: Academic Press.
23.
Nedelsky, L. (1954). Ability to avoid gross error as a measure of achievement. Educational and Psychological Measurement , 14, 459-472.
24.
Patience, W.M., & Reckase, M.D. (1980, April). Effects of program parameters and item pool characteristics on the bias of a three-parameter tailored testing procedurePaper presented at the annual meeting of the National Council on Measurement in Education, Boston.
25.
Reckase, M.D. (1979). Unifactor latent trait models applied to multifactor tests: Results and implicationsJournal of Educational Statistics, 4, 207-230.
26.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scoresPsychometrika Monograph Supplement , No. 17.
27.
Sympson, J.B. (1986, August). Extracting information from wrong answers in computerized adaptive testingPaper presented at the annual meeting of the American Psychological Association, Washington D.C.
28.
Tatsuoka, K.K. (1983). Rule space: An approach for dealing with misconceptions based on item response theoryJournal of Educational Measurement , 20, 345-354.
29.
Thissen, D.J. (1976). Information in wrong responses to Raven's Progressive MatricesJournal of Educational Measurement , 13, 201-214.
30.
Thissen, D.J. (1988). MULTILOG User's Guide (Version 5.1) [Computer program and manual]Mooresville IN: Scientific Software, Inc.
31.
Thissen, D.J., Steinberg, L., & Mooney, J. (1989). Trace lines for testlets: A use of multiple-categorical response modelsJournal of Educational Measurement , 26, 247-260.
32.
Urry, V.W. (1977). Tailored testing: A successful application of latent trait theoryJournal of Educational Measurement , 14, 181-196.
33.
Vale, C.D., & Weiss, D.J. (1977). A comparison of information functions of multiple-choice and free-response vocabulary items (Research Report 77-2)Minneapolis: University of Minnesota, Department of Psychology, Computerized Adaptive Testing Laboratory.
34.
Wainer, H. (1990). Computerized adaptive testing: A primerHillsdale NJ: Erlbaum.
35.
Wainer, H., & Kiely, G.L. (1987). Item clusters and computerized adaptive testing: A case for testletsJournal of Educational Measurement , 24, 185-201.
36.
Wang, M.W., & Stanley, J.C. (1970). Differential weighting: A review of methods and empirical studies. Review of Educational Research , 40, 663-706.
37.
Weiss, D.J. (1982). Improving measurement quality and efficiency with adaptive testingApplied Psychological Measurement , 6, 473-492.
38.
Weiss, D.J. (1983). New horizons in testing: Latent trait test theory and computerized adaptive testingNew York: Academic Press.
39.
Wherry, R.J., Sr., Naylor, J.C., Wherry, R.J., Jr., & Fallis, R.F. (1965). Generating multiple samples of multivariate data with arbitrary population parameters. Psychometrika , 30, 303-314.