Merit pay and the candle problem

April 09, 2012 in Article

Let us pretend for one moment that many of the corporate education reforms being proposed offered more than just a metaphorical big stick with which to fire teachers more easily, but also a few carrots too in the form of extra money toward paying high performers as determined by their students test scores. Yes, yes, we know.

Let's go even further, and pretend that student test scores were the perfect means with which to judge the effectiveness of any teacher. What do we know of financial incentives? From Nature magazine.

here's a simple fact we've known since 1962: using money as a motivator makes us less capable at problem-solving. It actually makes us dumber.

In the 1940, an experiment was carried out, now referred to as the "Candle Problem". The experiment has the participant try to solve the problem of how to fix a lit candle on a wall in a way so the candle wax won't drip to the floor. The participant can only use (along with the candle) a book of matches and a box of thumbtacks.

Let's go back to that Nature article to explain the rest of the experiment, and it' counterintuitive results

The only answer that really works is this: 1.Dump the tacks out of the box, 2.Tack the box to the wall, 3.Light the candle and affix it atop the box as if it were a candle-holder. Incidentally, the problem was much easier to solve if the tacks weren't in the box at the beginning. When the tacks were in the box the participant saw it only as a tack-box, not something they could use to solve the problem. This phenomenon is called "Functional fixedness."

Sam Glucksberg added a fascinating twist to this finding in his 1962 paper, "Influence of strength of drive on functional fixedness and perceptual recognition." (Journal of Experimental Psychology 1962. Vol. 63, No. 1, 36-41). He studied the effect of financial incentives on solving the candle problem. To one group he offered no money. To the other group he offered an amount of money for solving the problem fast.

Remember, there are two candle problems. Let the "Simple Candle Problem" be the one where the tacks are outside the box -- no functional fixedness. The solution is straightforward. Here are the results for those who solved it:

Simple Candle Problem Mean Times :
WITHOUT a financial incentive : 4.99 min
WITH a financial incentive : 3.67 min
Nothing unexpected here. This is a classical incentivization effect anybody would intuitively expect.

Now, let "In-Box Candle Problem" refer to the original description where the tacks start off in the box.

In-Box Candle Problem Mean Times :
WITHOUT a financial incentive : 7:41 min
WITH a financial incentive : 11:08 min
How could this be? The financial incentive made people slower? It gets worse -- the slowness increases with the incentive. The higher the monetary reward, the worse the performance! This result has been repeated many times since the original experiment.

We've published a video on this phenomenon before, titled "As Teacher Merit Pay Spreads, One Noted Voice Cries, ‘It Doesn’t Work’", and an article from the Harvard Business Review, titled "Stop Tying Pay To Performance".

Here's another video - The surprising truth about what motivates us

Knowing all this begs the question, why are we going down the path of some of these corporate education reforms, when we have known for over half a century many of them are flawed concepts that have been demonstrated to fail time and time again?

Trust the Evidence, Not Your Instincts

September 07, 2011 in Article

Consider this scenario. You have a serious illness. Your doctor prescribes an intrusive, painful, and expensive treatment— and you have to pay for it. What she doesn’t tell you—because she has not consulted the research – is that most studies show the treatment is ineffective and fraught with negative side-effects. You go through the procedure, suffer severe pain, and spend a lot of money. Unfortunately, as with most patients, the procedure proves ineffective. You later uncover the research your doctor failed to consult. When you ask why she didn’t use this evidence, she answers, “Who pays attention to studies? I have years of clinical experience. Besides, the protocol seemed like it ought to work.”

Does that sound like malpractice? It does to us. Fortunately, pressures to practice evidence-based medicine are reducing preventable errors. Not so in most of our workplaces, where failure to consider sound evidence repeatedly inflicts unnecessary damage on employee well-being and organizational performance. But it doesn’t have to be this way.

No workplace practice is as important—and apparently vexing—as pay. Many people believe that pay for-performance will work in virtually any organization, so it is implemented again and again to solve performance problems -- even in settings where evidence shows it is ineffective. Consider the recent decision to end New York City’s teacher bonus program after wasting three years and 56 million dollars. As this newspaper reported in July, a Rand Corporation study found this effort to link incentive pay to student performance “had no effect on students’ test scores, on grades on the city’s controversial A to F school report cards, or on the way teachers did their jobs.” This bad news could have been predicted before squandering all that time and money.The failure of such programs to boost student performance has been documented for decades. A careful review of pay for performance in schools in the 1980s showed these programs rarely lasted more than five years and consistently failed to improve student performance. The 300 page Rand report emphasizes that (although exceptions exist) evidence against the efficacy of teacher incentive pay in U.S. schools continues to grow stronger and is especially evident in the most rigorous studies.

This practice doesn’t just waste money. As Chicago economist Steve Levitt and others show, strong incentive programs can entice – or scare -- teachers and administrators to “cheat” on the tests, either by providing students with questions and answers in advance or changing student’s answer sheets to increase apparent performance. Recent well-publicized cheating scandals in Atlanta, Baltimore, Washington D.C., and elsewhere could have been foreseen by anyone who read and heeded this research. Building a culture of cheating in schools corrupts both students and teachers for no good purpose.

[readon2 url="http://bobsutton.typepad.com/my_weblog/2011/09/our-new-york-times-piece-on-evidence-based-management-the-uncut-version.html"]Continue reading...[/readon2]

Test-Based Incentive Programs Have Not Consistently Raised Student Achievement

May 27, 2011 in Article

The National Research Council has produce a paper that demonstrates that high stakes testing does not improve student achievement

Despite being used for several decades, test-based incentives have not consistently generated positive effects on student achievement, says a new report from the National Research Council. The report examines evidence on incentive programs, which impose sanctions or offer rewards for students, teachers, or schools on the basis of students’ test performance. Federal and state governments have increasingly relied on incentives in recent decades as a way to raise accountability in public education and in the hope of driving improvements in achievement.

School-level incentives -- like those of No Child Left Behind -- produce some of the larger effects among the programs studied, but the gains are concentrated in elementary grade mathematics and are small in comparison with the improvements the nation hopes to achieve, the report says. Evidence also suggests that high school exit exam programs, as currently implemented in many states, decrease the rate of high school graduation without increasing student achievement.

Policymakers should support the development and evaluation of promising new models that use test-based incentives in more sophisticated ways as one aspect of a richer accountability and improvement process, said the committee that wrote the report.

Incentives’ Effects on Student Achievement

Attaching incentives to test scores can encourage teachers to focus narrowly on the material tested -- in other words, to “teach to the test” -- the report says. As a result, students’ knowledge of the part of the subject matter that appears on the test may increase while their understanding of the untested portion may stay the same or even decrease, and the test scores may give an inflated picture of what students actually know with respect to the full range of content standards.

To control for any score inflation caused by teaching to the test, it is important to evaluate the effects of incentive programs not by looking at changes in the test scores tied to the incentives, but by looking at students’ scores on “low stakes” tests -- such as the National Assessment of Educational Progress -- that are not linked to incentives and are therefore less likely to be inflated, the report says.

When evaluated using low-stakes tests, the overall effects on achievement tend to be small and are effectively zero for a number of incentives programs, the committee concluded. Even when evaluated using the tests attached to the incentives, a number of programs show only small effects.

Some incentives hold teachers or students accountable, while others affect whole schools. School-level incentives like those used in No Child Left Behind produce some of the larger achievement gains, the report says, but even these have an effect size of only around .08 standard deviations – the equivalent of moving a student currently performing at the 50th percentile to the 53rd percentile. For comparison, raising student performance in the U.S. to the level of the highest-performing nations would require a gain equivalent to a student climbing from the 50th to the 84th percentile. The committee noted, however, that although a .08 effect size is small, few other education interventions have shown greater gains.

Effects of High School Exit Exams

The study also examined evidence on the effects of high school exit exams, which are currently used by 25 states and typically involve tests in multiple subjects, all of which students must pass in order to graduate. This research suggests that such exams decrease the rate of high school graduation without improvements in student achievement as measured by low-stakes tests.

Broader Measures of Performance Needed

It is unreasonable to implement incentives tied to tests on a narrow range of content and then criticize teachers for narrowing their instruction to match the tests, said the committee. When incentives are used, the performance measures need to be broad enough to align with desired student outcomes. This means not only expanding the range of content covered by tests but also considering other student outcomes beyond a single test.

Policymakers and researchers should design and evaluate alternate approaches using test-based incentives, the committee said. Among the approaches proposed during current policy debates are those that would deny tenure to teachers whose students fail to meet a minimal level of test performance. Another proposal is to use the narrow information from tests to trigger a more intensive school evaluation that would consider a broader range of information and then provide support to help schools improve. The modest and variable benefits shown by incentive programs so far, however, means that all use of incentives should be rigorously evaluated to determine what works and what does not, said the committee.

In addition, it is important that research on and development of new incentive-based approaches does not displace investment in the development of other aspects of the education system – such as improvements in curricula and instructional methods -- that are important complements to the incentives themselves, the report cautions.

The study was sponsored by Carnegie Corporation of New York and the William and Flora Hewlett Foundation. The National Academy of Sciences, National Academy of Engineering, Institute of Medicine, and National Research Council make up the National Academies. They are private, nonprofit institutions that provide science, technology, and health policy advice under a congressional charter. The Research Council is the principal operating agency of the National Academy of Sciences and the National Academy of Engineering. For more information, visit http://national-academies.org.