Norm-Referenced Standards

Norm-Referenced Standards

Norm-referenced standards are developed by testing a large number of people of a defined group. Descriptive statistics are then used to develop standards. A common norming method is the use of percentile ranks. A percentile rank norm reflects the percentage of the group that can be expected to score below a given value. For example, a 1-mile run time of 11:31 for a boy 11 years of age is at the 25th percentile; only $25 \%$ ran slower, while $75 \%$ of the 11-year-old boys could be expected to run faster.

The characteristics of the group on which the standards were developed are important to consider when norm-referenced standards are to be used. The norm does not always translate to a desirable level. For example, if the average cholesterol level of men aged $40-49$ years is $214 \mathrm{mg} / \mathrm{dL}$, then this average does not represent a desirable level. A cholesterol level of less than $200 \mathrm{mg} / \mathrm{dL}$ is considered a desirable level for health. In this instance, average would not be desirable because it has been shown that there is a relationship between high levels of cholesterol and risk of coronary heart disease mortality (Anderson, Castelli, \& Levy, 1987; Castelli et al., 1977; Wood et al., 1988).

Criterion-Referenced Standards

A criterion-referenced standard is a predetermined standard that can be used to determine if an individual has achieved a desired level of performance. It is unlike a norm-referenced standard in that the performance of the individual is not compared with that of other individuals; instead, the performance is compared against the standard. A norm-referenced evaluation can be considered a relative evaluation-evaluation relative to norms developed on other people. A criterion-referenced evaluation can be considered an absolute evaluationevaluation by comparison to an absolute criterion.

Many authors use the term criterion-referenced test, suggesting that the difference is not just with the standard, but also with the method used to develop the test (Glaser \& Nitko, 1971; Safrit, 1989). Glaser and Nitko (1971) define a criterion-referenced test as one developed to provide measurements that are directly interpretable in terms of explicit performance standards. Although some tests used in education were constructed to be criterion-referenced tests, the more common practice is to apply a criterion-referenced standard to a test that was originally developed as a norm-referenced test. For example, the mile run is a youth fitness test item designed to assess aerobic fitness. The mile run was previously used in a norm-referenced fashion by interpreting how well the performer compared to others of the same age and sex. Currently, the mile run is used in the FITNESSGRAM ${ }^*$ in a criterion-referenced fashion to determine whether the performance met or did not meet the standard. In this instance, the test itself has not changed, but the type of standard used to evaluate aerobic fitness has changed.

