Archives for category: Teacher Evaluations

A teacher faces the state’s new evaluation with anxiety, uncertainty, fear, and confusion. She has a great idea about how to game this invalid, meaningless, absurd and punitive system:

It’s not just in LA that teachers are afraid. VA is implementing new eval requirements that must include student progress worth 40% of a teacher’s overal eval. The plan at my former school (might be at the new school I will be working for in the fall as well) was to develop pre and post tests to show growth over the year for non-tested subjects (and first year of a subject wouldn’t count, since obviously students would be starting at zero. I wasn’t really sure about the logic there). Yet, I was unable to ever get clear answers on how they would be implented for teachers of multiple levels/preps. One idea was that only one level would count. After I pointed out that this would incentivize focusing on only the level that counted to the detriment of the non-counted levels, I was met with silence. When I asked who was picking, I was also met with confusion. Moreover, it also occurred to me how easy it would be for some teachers to game the system. I teach Latin, which is something that most people have little familiarity with. So, I could create an insanely difficult pre-test and a cake post test so that I showed awesome growth, and really who would know, except me, and maybe the kids (certainly those of my bosses who have no familiarity with the subject would not be able to figure it out). Moreover, they wouldn’t be able to break down how students do on partiular sections and what that means, nor figure out what sections represented higher level thinking and which represented rote memorization. Finally, I began to question how one evaluated the ability to translate Latin to English by only using a multiple choice test. I wondered why the ability to translate Latin to English wasn’t measured by actually having students translate Latin to English. For the first time, I felt the fear and the pressure that teachers of the core had been feeling for years. I dread these new eval requirements with a sick feeling in the pit of my stomach that never quite goes away.

I try to be careful in terms of what I write and what I repost from readers. If readers express their personal experiences and strong views, I have no problem with that, and I never repost anything that uses epithets or goes beyond the bounds of civility or fair disagreement. If I err, it is not knowingly.

I just received a comment about Tennessee’s evaluation system and its provenance. This was in response to my post today about “legal fraud.” I would appreciate readers’ responses, not opinions, but citations and facts:

TN bought TEAM/TAP, a teacher evaluation system from a Milken owned company called NIET. http://www.tapsystem.org/about/about.taf?page=nietbio_lmilken
The 1-5 scoring rubric met the requirement in Race to the Top to evaluate teachers with “objective” measures.The Milkens have been marketing TEAM/TAP since the mid 1990’s. If it is so effective, shouldn’t there be dozens of studies replicating its success? There are none. Peer-review constrains bad science and practices, and protects those from harm who are subjected to its application. It’s not an overstatement to say that, absent critical review, the TEAM evaluation can identify teaching quality about as well as rolling dice.

That the Milken family foundation has bypassed peer review and critical analysis and sold its “product” to the taxpayers is very curious. This use of our public funds deserves further scrutiny. Recall that co-founder Michael Milken is a convicted felon. Rudy Giuliani successfully prosecuted him for a massive fraud in 1989 that destroyed his company and cost the taxpayers millions in the ensuing cascade of savings and loan failures. Milken’s crimes were so egregious, president G.W. Bush refused to give him a presidential pardon. Caution and study here would seem wise. http://articles.latimes.com/2009/feb/03/business/fi-milken3

Outsourcing to contractors with a history of fraud has the potential to be a spectacular boondoggle. Our students will be the victims. Who will be held accountable?

After my last book was published, I did some radio interviews and got some interesting feedback.

One of the most informative responses came from a distinguished professor emeritus at the University of Michigan, Harry Frank, who has written textbooks about measurement and evaluation.

His observations about testing and evaluation were brilliant. What he wrote helped me understand why NCLB had failed. As I re-read this letter, I understood better why Race to the Top will fail. For one thing, it assumes that the same tests may be used both to evaluate the teacher and to counsel the teacher. What this does, he says, is to promote cheating and teaching to the test.

And Professor Frank explains why student evaluations distort the educational process.

Professor Frank gave me his permission to reprint his letter.

.
I am by training a social psychologist, with a subspecialty and one-time consulting practice in testing and measurement.  When the Flint campus sought its first accreditation independent of the main (Ann Arbor) campus, the provost established an ad-hoc committee to develop assessment procedures.   I spent nine years on the committee, my last couple as its chair.  The procedures we developed became something of a model for the North Central Association of Schools and Colleges.  It has worked extremely well precisely because it conformed to some very fundamental principles of validation, which No Child Left Behind blatantly (if not intentionally) ignored. 

The first principle is that no assessment can be used at the same time for both counseling and for administrative decisions (retention, increment, tenure, promotion).   As you emphasized (and as every organizational psychologist with an ounce of brains wailed when No Child was first described), all this does is promote cheating and teaching to the exam (much as does the staatsexamen in Germany).   This principle is so basic that it’s often covered in the very first chapter of introductory texts on workplace performance evaluation.

Accordingly, in the very first meeting of the committee, we established an absolute firewall.  Department chairs, deans, and executive committees would never be permitted to see individual raw data; they would see only departmental pooled data.  This action did not immediately eliminate faculty resistance, but it went further in that regard than even you might imagine.  The same should apply to K-12 teachers’ unions.

Like you, I don’t think the problem is testing–any more than the problem with a badly built house is with the hammers and saws.  The problem in both cases is how potentially useful tools should be used.   Many of the current difficulties would be reduced or eliminated if it were clear that

(1)  K-12 education is a developmental process, so assessment in schools is a developmental measure not a terminal measure. The concern should be with change not simply “score.”

(2)  Assessment should be a counseling resource, not a source of extrinsic motivation, i.e., rewards and punishments for teachers, administrators, and school districts.

(3)  Student evaluations are worse than useless; they are egregiously misleading.  A 10-year study by the American Psychological Association indicated that student evaluations are correlated with only two factors:

   i.  Students’ expected course grades compared with their expected grades in other courses. 

   ii  workload (negative correlation).

For untenured faculty, course evaluations–if used for administrative decisions–therefore have the effect of motivating both grade inflation and the dumbing down of course content.

 (4)  Instruments and procedures must be national in scope and standardized in their administration and reportage (cf. your interview comments concerning the superior validity of the national examination vs. state examinations).

(5)  Data should be clustered rather than pooled.  That is, performance of mainstream students, students whose first language is not English, and developmentally disabled students should be examined separately.  It is clearly inappropriate to compare overall scores for students in, say, Birmingham, Michigan, where an overwhelming majority are native speakers of English, with students in Taos, New Mexico, where English as a first language falls behind both Spanish and Tiwa.

 (6)  Teachers should never have access in advance to test questions or even precise content.  They should be given global guidelines–general areas in which student competence is expected.

(7)  Ideally, the procedures should make no attempt to be exhaustive.  They should represent a random sampling of content, and the sample should change annually so that past tests cannot be used to prep students but can and should be used to familiarize students with the form of the questions, the level of detail expected, and so on.

Since Arne Duncan became Secretary of Education and unleashed the Race to the Top, almost every state has adopted laws to evaluate teachers by the test scores of their students. Most teachers know that this is unfair because the factors that have the greatest influence on students’ test scores are not within the control of teachers. Reformers tell us that teachers are the most important influence within the school on student scores, and that is right. But the teacher contribution to scores is dwarfed by the influence of family and other out of school factors.

It is also obvious to everyone but the U.S. Department of Education that when testing becomes the determinant of teachers’ evaluation, their reputation, and their careers, the results are predictable: narrowing of the curriculum, teaching to the test, gaming the system, and cheating. None of this improves education. Why would any responsible public official want to promote such behavior?

The eminent mathematician John Ewing, who is the president of Math for America, wrote a concise and slashing attack on the misuse of mathematics in value-added methodology. He writes about how teachers in Los Angeles were bullied by journalists who ranked them and then confronted them with their low scores. The journalists warned that value-added should not rely on a single measure, but they themselves relied on a single measure to create their rankings.

Ewing says that the public is being subjected to “mathematical intimidation” by policymakers and education “experts,” and that mathematicians have a duty to speak out and tell them to stop misusing their field for political ends.

Indiana is one of the states where the governor and the state commissioner of education seem determined to put public education out of business. They are implementing vouchers, expanding charters, and given the green light to for-profit online charter schools. They do not have a shred of evidence that any of this will improve the education of children in Indiana, but that doesn’t slow them down. They are in love with the ideology of choice and competition and the glories of the marketplace, and that’s the end of the discussion. Plenty of entrepreneurs will get rich off taxpayers’ dollars in Indiana.

Fortunately, there is strong resistance from parents and educators in Northeast Indiana. When I spoke in Indiana last fall, I met some of the parent leaders. They were in despair about the destructive policies being pushed through the legislature. I am glad to say that they organized and are speaking out. They can serve as a model for other concerned citizens.

They have drafted a statement in opposition to what Governor Mitch Daniels and State Superintendent Tony Bennett are doing. They not only oppose these harmful policies, but they offer a platform describing the positive steps that must be taken to save public education in the state of Indiana.

Congratulations to these courageous, thoughtful, and concerned citizens of Indiana!

I hope that others will take this statement of principles and adapt it to their own community and state. Help it go viral, as the Texas anti-high-stakes testing resolution has gone viral. Join with your friends and neighbors to awaken the American public to support good education policies that strengthen our public schools and our democracy.

A reader writes in response to the post about New Jersey Governor Christie:

When you solve this mystery please come and help us in Ohio to uncover why Gov. Kascich has made teachers public enemy number one. We may have defeated his infamous House Bill 5, but he and his cronies are managing to slip in most of the laws and regulations in the back door .

Are there any Republican governors who are not at war with the teachers in their state? If so, please let me know. Maybe something happened at the Republican Governors’ conference in 2011.

A reader from Pennsylvania asks whether charter schools are public schools if they seek to avoid transparency and if their teachers are not subject to the same evaluation scheme as public school teachers:

Charters insist on being called “public” schools.

Yet in Pennsylvania charters are in court trying to prevent laws requiring them to be transparent about their operations, as public schools are required to do.

The state legislature just passed a law requiring 50% of teacher evaluations to be based test scores. The law EXEMPTS charter teachers from this new evaluation system.

In the ALEC rush of legislation at the close of its session last week, a bill was introduced in the PA legislature to EXEMPT charters from the state’s Sunshine Law which requires public institutions receiving state money to be transparent about their contracts. It received 120 favorable votes in the House and failed by a few votes in the Senate.

In Philadelphia we have a charter operator, Universal, which was given Audenreid High School, which was made a charter as soon as a new facility was built at tax payer expense, operating for the past year rent and maintenance cost free. Next year they will have to pay $500,000 which just a quarter of the expense for rent and maintanence.  The SRC will cover the rest. This is in a School District which has a $265 million deficit, plans to close 65 public schools over the next few years, and is threatening to unilaterally cut the wages and benefits of public school employees.

So I take back what I said at the beginning of this thread. Charter schools are not open to public scrutiny.

In a close vote, teachers at the Green Dot charter school chain endorsed a merit pay plan tied to test scores.

Although test score-based evaluation is highly unstable, the teachers decided to go along in hopes of qualifying for a bonus.

A teacher rated effective one year may be rated ineffective the next year, because there are so many factors beyond the teacher’s control that affect student scores.

The National Council on Teacher Quality thought this was a good move. So did Green Dot CEO, Marco Petruzzi, who previously worked as a management consultant at McKinsey and Bain Capital.

Some teachers were not happy with the decision. Some were suspended or fired for fighting it. Students joined with teachers to protest, and the administration said the whole thing was blown out of proportion.

Scholars have warned that this method of evaluating teachers encourages teaching to the test, narrowing the curriculum, and other negative behaviors. Teachers who teach special education or English language learners will see the smallest gains. If these groups are underrepresented at Green Dot, as they are in many charter schools, that won’t be a problem.

I have published several posts (see here, here, and here) about Memphis, where a “Transition Planning Committee” devised a plan to merge the Memphis public schools and the Shelby County Schools. The planning was based on work by the Boston Consulting Group; the director of the TPC is the executive director of Stand for Children in Memphis. The plan proposes to shift many children out of the Memphis public schools and into new charter schools, so that charter enrollment will increase from 4% of Memphis students to 19% by 2016. The plan also involves a transfer of $212 million from the public schools to charter schools.

I have received letters from Stand for Children and from both supporters and opponents of the plan. Today I heard from Memphis teachers:

I teach in one of the grades K-3 in Memphis. In addition to the injustice of using test scores at all in making personnel decisions, K-3 teachers are evaluated based on the test scores of students they have never taught.Every teacher in Tennessee who teaches K-3 and every art, music, P.E. teacher, and librarian, instead of using their students’ value-added scores for half of their evaluation (because there aren’t any), is assigned their school’s value-added score for half of their evaluation.This is clearly designed to make the bad schools worse. Already, nearly all of the K-3 teachers at my failing school have transferred to other schools with better school-wide value-added scores. I don’t yet know who the principal has hired to replace them, but I’m guessing many will be TFA types (we also have a TFA-style program here called Memphis Teaching Fellows, run by The New Teacher Project), most of whom will be ineffective their first year.This legislation is designed to make the bad schools worse, so that they can be closed and turned into charters.The same teacher wrote this comment:In the meeting the Transition Planning Commission (TPC) had with teachers, the district strongly encouraged all teachers to go in place of faculty meeting. I didn’t go because I knew it would be a waste of time, but my colleagues went. According to them, it was a waste of time. They had a thousand teachers in the auditorium of a high school and no organization for the meeting. Teachers were not given an opportunity to speak the the group as a whole. Instead, they broke off into discussion groups comprised of a large number of teachers and one staff member. Teachers’ suggestions in these groups were written down and supposedly submitted to the TPC. The teachers I spoke to doubted anyone would read their suggestions. Why couldn’t they just ask teachers to email these suggestions, instead of wasting enormous amounts of time at the end of a long school day to organize a thousand teachers into discussion groups?

This comes from another Memphis teacher:

Let’s clear up the confusion around teacher input and the transition plan. There are NO current teachers on the Transition Planning Commission. The TPC appointed NO current teachers to the work groups who prepared the plan. The only input the TPC got from actual teachers was what they allowed them to say at community “listening tours”. These tours were usually about two hours long and held in various parts of the community. I believe there were six of these events. They were open to the public and teachers could attend and speak. The TPC members present answered virtually no questions as these were listening events. I do not call this teacher input. Additionally the teacher unions were intentionally left out of the discussions and were told their input was not needed even though the two unions involved represent over 7000 teachers. Teachers have no idea what is really in this plan.

 

At a recent meeting in New York City, Secretary of Education Arne Duncan said that “we as a country don’t know” how much student test scores should count as part of teachers’ evaluation. He said it shouldn’t be zero, and it shouldn’t be 100%. But it should be somewhere in between. As to what the number should be, the secretary said, “we don’t know.”

Here’s a thought: What if the current methods of calculating value-added are inaccurate? What if they are fundamentally flawed? What if they say nothing about teacher quality? What if they reflect who is in the class rather than teacher quality?

What if, say, a few years from now, we look back and realize they are junk science?

How much should they count then?

And if we don’t know whether they are accurate, and we don’t know if they are a reasonable measure of teacher quality, and if we have no evidence that their use in evaluation helps teachers improve or students achieve, why are we counting them at all? Shouldn’t we wait until we have clear evidence that the methods we use to evaluate teachers and principals are accurate, fair, reliable and valid, before putting them into practice?

I know that “we can’t wait,” but shouldn’t we wait long enough to know that what we are doing will help and not harm?

Or, are we still building a plane in mid-air?