Evaluation of Effectiveness: Colorado

Identifying Effective Teachers Policy

Goal

The state should require instructional effectiveness to be the preponderant criterion of any teacher evaluation.

Meets goal
Suggested Citation:
National Council on Teacher Quality. (2013). Evaluation of Effectiveness: Colorado results. State Teacher Policy Database. [Data set].
Retrieved from: https://www.nctq.org/yearbook/state/CO-Evaluation-of-Effectiveness-22

Analysis of Colorado's policies

Commendably, Colorado requires that objective evidence of student learning be the preponderant criterion of its teacher evaluations. The state is in the process of developing the Colorado Model Evaluation System. Districts may adopt this system or develop their own as long as it meets or exceeds the state's rules. 
Beginning school year 2013-2014, 50 percent of the overall performance evaluation rating must be determined by multiple measures of student academic growth. Measures of student growth must include the following: a measure of individually attributed student academic growth; a measure of collectively attributed student academic growth; statewide summative assessment results, when available, and for subjects with annual statewide summative assessment results in two consecutive grades, results from the Colorado Growth Model. Additional measures may also be used. 
The remaining 50 percent are measures of professional practice. The method for evaluating teachers' professional practice must include data collection for multiple measures on multiple occasions. Data must include observations and at least one of the following measures: student perception measures (surveys); peer feedback; feedback from parents or guardians; or review of lesson plans or student work samples. 
Evaluations must use the following four rating categories: highly effective, effective, partially effective and ineffective. 

**
POLICY UPDATE
For SY 2014-15, districts have flexibility to determine how much weight the measures of student learning/outcomes standard counts in the teacher's final evaluation rating. It may be 0-50 percent. 

District assigning no weight to student learning must still submit ratings in the data submission process. 
A teacher's final evaluation rating during the 2014-15 school year WILL count toward the earning/loss of nonprobationary status. 
The state says it expects full 50 percent weight for student learning/outcomes in 2015-16 school year. 

Citation

Recommendations for Colorado



State response to our analysis

Colorado recognized the factual accuracy of this analysis.

Research rationale

Teachers should be judged primarily by their impact on students.

While many factors should be considered in formally evaluating a teacher, nothing is more important than effectiveness in the classroom. Unfortunately, districts have used many evaluation instruments, including some mandated by states that are structured, so that teachers can earn a satisfactory rating without any evidence that they are sufficiently advancing student learning in the classroom. It is often enough that teachers appear to be trying, not that they are necessarily succeeding.

Many evaluation instruments give as much weight, or more, to factors that lack any direct correlation with student performance—for example, taking professional development courses, assuming extra duties such as sponsoring a club or mentoring and getting along well with colleagues. Some instruments hesitate to hold teachers accountable for student progress. Teacher evaluation instruments should include factors that combine both human judgment and objective measures of student learning.

Evaluation of Effectiveness: Supporting Research

Reports strongly suggest that most current teacher evaluations are largely a meaningless process, failing to identify the strongest and weakest teachers. The New Teacher Project's report, "Hiring, Assignment, and Transfer in Chicago Public Schools", July 2007 at: http://www.tntp.org/files/TNTPAnalysis-Chicago.pdf, found that the CPS teacher performance evaluation system at that time did not distinguish strong performers and was ineffective at identifying poor performers and dismissing them from Chicago schools. See also Lars Lefgren and Brian Jacobs, "When Principals Rate Teachers," Education Next, Volume 6, No. 2, Spring 2006, pp.59-69. Similar findings were reported for a larger sample in The New Teacher Project's The Widget Effect (2009) at: http://widgeteffect.org/.  See also MET Project (2010). Learning about teaching: Initial findings from the measures of effective teaching project. Seattle, WA: Bill & Melinda Gates Foundation.

A Pacific Research Institute study found that in California, between 1990 and 1999, only 227 teacher dismissal cases reached the final phase of termination hearings. The authors write: "If all these cases occurred in one year, it would represent one-tenth of 1 percent of tenured teachers in the state. Yet, this number was spread out over an entire decade." In Los Angeles alone, over the same time period, only one teacher went through the dismissal process from start to finish. See Pamela A. Riley, et al., "Contract for Failure," Pacific Research Institute (2002).

That the vast majority of districts have no teachers deserving of an unsatisfactory rating does not seem to correlate with our knowledge of most professions that routinely have individuals in them who are not well suited to the job. Nor do these teacher ratings seem to correlate with school performance, suggesting teacher evaluations are not a meaningful measure of teacher effectiveness. For more information on the reliability of many evaluation systems, particularly the binary systems used by the vast majority of school districts, see S. Glazerman, D. Goldhaber, S. Loeb, S. Raudenbush, D. Staiger, and G. Whitehurst, "Evaluating Teachers: The Important Role of Value-Added." The Brookings Brown Center Task Group on Teacher Quality, 2010. 

There is growing evidence suggesting that standards-based teacher evaluations that include multiple measures of teacher effectiveness—both objective and subjective measures—correlate with teacher improvement and student achievement. For example see T. Kane, E. Taylor, J. Tyler, and A. Wooten, "Evaluating Teacher Effectiveness." Education Next, Volume 11, No. 3, Summer 2011, pp.55-60; E. Taylor and J. Tyler, "The Effect of Evaluation on Performance: Evidence from Longitudinal Student Achievement Data of Mid-Career Teachers." NBER Working Paper No. 16877, March 2011; as well as H. Heneman III, A. Milanowski, S. Kimball, and A. Odden, "CPRE Policy Brief: Standards-based Teacher Evaluation as a Foundation for Knowledge- and Skill-based Pay," Consortium for Policy Research, March 2006.