The New York Times published an article yesterday about the fall and stagnation of scores in reading and math in the U.S. it was written by Claire Cain Miller, Francesca Paris and Sarah Mervosh. The declines are no longer the fault of the pandemic. They cut across income, racial, and geographic divides.
The link above goes to s gift article, so please open and review the graphs and finish the article.
The authors attribute the stagnation to two likely phenomena: 1) easing the testing-and-accountability pressure of the NCLB-Race to the Top era; and 2) the ubiquity of Ed-tech in the schools.
I reject the claim that scores have stagnated because of the easing of NCLB-RTTT pressures. Sure, they increased pressure on students, teachers, and principals, but their negative effects undermined the quality of education. Picking the right bubble on a standardized test became the goal of education.
Campbell’s Law says that when a measure becomes the goal, it loses its value as a measure.
Social scientist Donald Campbell wrote that “the more any quantitative social indicator is used for social decision-making, the more subject it will be to corruption pressures and the more apt it will be to distort and corrupt the social processes it is intended to monitor.”
Another way of putting it: “the more important a metric is in social decision making, the more likely it is to be manipulated.”
Lest we forget: NCLB brought us cheating on an industrial scale. Rigging the system to improve scores. Narrowing the curriculum, with schools making time for test prep by dropping the arts, recess, physical education, and allotting less time for subjects that were not tested, such as civics, history, foreign languages, and science. Fewer teachers assigned whole books, but instead focused on short passages, the kind that appear on standardized tests.
The tests themselves are flawed. The scoring is flawed. The underlying assumption that every question has a right answer and only one right answer is bad teaching.
I have written long essays and chapters in books about how standardized testing is toxic to the principles of good education. Guessing “the right answer” does not promote critical thinking, which might lead a student to pick a different answer or two right answers. As I have written elsewhere, asking the right question matters more than guessing the right answer.
Testing experts like Daniel Koretz have demonstrated their limitations. Todd Farley, in “Making the Grades,” showed how shabbily the tests are scored.
As the Times‘ article points out, other countries have experienced the same score decline and stagnation, even without NCLB and RTTT.
For God’s sake and for our children’s sake, let us not return to the horrid era of test and punish. Let it go. Students may get bigger test scores under pressure, but they may be less interested in learning.
Many European nations have concluded, as I showed in several articles posted here last week, that Ed-tech in the classroom has dampened students’ attention, persistence, and interest in learning. Sweden and Norway are pulling the plugs. Norway never fell for the tech revolution. See the Sweden article here. See the Norway post here.
If the testing industry and the heroes of yesteryear’s failed reforms want another go at killing love of learning, the parents of America will have to organize and stop them with massive opt outs. Again.
***********************
The Times’ story begins like this;
Something troubling is happening in U.S. education.
Almost everywhere in America, students are performing worse than their peers were 10 years ago, according to new, district-level test score data released Wednesday by the Educational Opportunity Project at Stanford.
Compared with a decade earlier, reading scores were down last year in 83 percent of school districts where data was available. Math scores were down in 70 percent. The declines have affected both rich and poor districts, and crossed racial and geographic divides….
The new data provides the first national comparison of school districts through 2025, and offers a detailed picture of how individual school districts have performed over time. It underscores that many districts have experienced a long-term slump in student achievement, not just a blip during the pandemic.
From 2017 to 2019, students lost as much ground in reading as they did during the pandemic, and reading scores continued to fall at a similar rate through 2024.
Immediately after the pandemic, there was hope that students would recover quickly. The new data shows that scores inched upward in reading last year, and have climbed more steadily in math since 2022. But it has been nowhere near enough to make up for lost ground, researchers said….
The biggest losses have been among the lowest-achieving students….
Education experts say there is no single reason for the declines. But the timing provides some clues.
Students’ test scores had been increasing since 1990 — then abruptly stopped in the mid-2010s. That coincided with two events: an easing of federal school accountability under No Child Left Behind, which was replaced in 2015, and the rise of smartphones, social media and personalized school laptops.
The pandemic then accelerated learning declines, especially for the poorest students. Some pandemic effects have lingered. Student absenteeism, for example, remains higher than prepandemic.
Nationwide declines
In one in three school districts in the United States, students are reading a full grade level lower than they were in 2015…
Some experts believe that the end of No Child Left Behind, the contentious school accountability law signed by President George W. Bush in 2002, explains some of the recent test score declines.
The law set a goal that all students would be proficient in reading and math, and schools that did not show progress could face penalties. It coincided with a period of rising test scores, especially in math, though reading scores improved more modestly. Low-performing students saw the biggest gains.
The law, though, was deeply unpopular with many educators and parents. Critics said it put an outsize focus on testing, pushing schools to teach to the test and spend less time on other important subjects, like the arts or social studies. In 2015, Congress replaced it, and many states dialed back on requirements.
Like many who have studied the law, Brian A. Jacob, professor of education policy at the University of Michigan, showed that it increased test scores but had problematic elements.
“It was not a cure-all, but I think it really did improve student achievement,” he said. “There’s evidence that school accountability does change behaviors of teachers and administrators and probably parents and students.”
Beyond the policy specifics, its passage reflected a nationwide, bipartisan push to improve education, some experts said, that the country seems to have lost in its absence.
Yet some other countries have seen similar declines in scores, suggesting additional factors may be at play.
Screens, screens everywhere
Something happened globally around the same time: the proliferation of devices, at home and in school.
Nearly half of American teenagers now say they are online “almost constantly,” compared with just under a quarter who said that a decade ago, according to Pew Research Center. Virtually all schools give children laptops or tablets in class, as early as kindergarten.
Few rigorous studies have teased out the role of devices in academic outcomes. Yet educators say there’s no question that swiping has decreased students’ focus and persistence, and time on devices has displaced time spent reading or studying. Far more teenagers — nearly one in three — now say they “never or hardly ever” read for fun.
In turn, schools expect less from students, assigning fewer whole books and simplifying the curriculum, said Carol Jago, associate director of the California Reading and Literature Project at the University of California, Los Angeles.
“There’s no other way, except volume, in order to become a really proficient, fluent, avid reader,” she said.
Radnor Township, an affluent district outside Philadelphia, is one of the highest scoring in Pennsylvania. Teachers still expect students to read full books, including novels like “To Kill a Mockingbird.” The vast majority of students are proficient readers. Still, fewer score at an advanced level on state tests — under 40 percent last year, down from 51 percent in 2015.

Are we even sure the test scores are accurate? ACT just had a nationwide glitch that forced them to revoke reported scores due to some kind of error. “Corrected” scores will be available June 2. If they can mess up that badly, I have no doubt the other standardized testers can do it too.
LikeLiked by 3 people
It is clear that test scores have stagnated in the absence of a serious effort to improve the process. Why on Earth would we expect this year’s crop of fourth graders to be any better than last year’s crop. That’s like expecting this years corn crop to be better than last years, even though everything is done the same.
In fact an expectation that scores would increase without a process in place to do so is a delusion. “Emphasizing” the scores is not an improvement process. At best it is a pressure tactic to get teachers to figure out by themselves “how to do it better.”
LikeLiked by 1 person
Steve,
You hit on the crazy idea at the heart of NAEP commentary: the expectation that this year’s 4th and 8th graders will have higher scores than those who precede them.
It’s counted a “failure” by the commentariat is scores are the same. Why shouldn’t they be?
LikeLike
A prediction of whether scores should go up pr down should be based upon many factors: Is there a drug gang war going on in the neighborhood? Is their widespread hunger in the district? Has the quality and/or training of a district’s teachers gone up or down, and so on. Has there been a serious pandemic raging? (I know I am preaching to the choir, but others do read these comments.)
LikeLike
As a teacher I would like to add that the testing culture has influenced teaching and perhaps degraded it. Instruction is often now implementd without the time or possibility of reflection, and without the idea of tweaking, altering, individualizing the ideas and the organization of the curriculum. The feeling of teaching has been negatively affected.
LikeLike
It is a naive assumption to believe that standardized scores are dropping due to the end of the punitive aspect of NCLB. This is a simplistic assumption. There is a consensus that online delivery of academics is failing our young people, and the absence of reading, writing and thinking in daily instruction has taken a toll. Some schools do not even have a functioning library or a school librarian. Some of the decline may also be due to funding cuts that many districts have faced from ever expanding privatization. It is difficult to support comprehensive school programming on smaller and smaller budgets.
Another consideration may be to look at the poverty levels in public schools which may be increasing due to loss of middle class students to charters and now vouchers and the fact that many employees are victims of wage stagnation while costs rise. The media often looks to assign blame teachers and make them the scapegoat for society’s problems as teachers are easy targets for lazy journalists that fail to research their assertions.
LikeLike
Thanks for this response. I didn’t recognize any of the piece’s authors, when I read it yesterday, but I recognized their thought process: how much students have learned is entirely determined by standardized tests, which are our educational product (and that’s not only OK, but “scientific”).
As for their two possible rationales, easing up on the accountability pressure has never actually happened, but we’re hearing that constantly in MI, over the Governor’s support for ending mandatory retention. Now, all the Republicans want to be just like Mississippi, flunking poor kids (and costing their districts another year’s worth of per-pupil funding) to juke the stats.
Nobody wants to think about the pandemic, but my personal theory is that we have underestimated its impact for that very reason: avoidance and a deep desire to be ‘normal’ again. I work as a volunteer in two local districts, and teachers talk all the time about how different middle schoolers (who were in kindergarten and first grade during the worst of the pandemic) are from previous classes. Less organized, less concerned about achievement, lower literacies (both verbal and mathematical), more free-floating parent anger. There was never a bounce-back from the trauma.
LikeLike
Well said about lingering effects of pandemic.
LikeLike
There’s been no “Mississippi Miracle.” Mississippi holding back third graders vis-à-vis the state’s Literacy-Based Promotion Act of 2013 did not shift the state to a higher trajectory. Rather, the Act turned out to be an addition to the many “common cause” factors keeping Mississippi on its continually improving trajectory that began in 2002 or earlier. The Act hasn’t been a “special cause” factor.
Mississippi Miracle? Rather, Mississippi’s journey of continual improvement.
LikeLike
Yes, it’s funny that Mississippi should be held up as a model when it is one of the poorest, most segregated, most racist states in the nation. But the test scores/–/have gone up but are still below the national average.
LikeLike
There’s been no “Mississippi Miracle.” Mississippi holding back third graders vis-à-vis the state’s Literacy-Based Promotion Act of 2013 did not shift the state to a higher trajectory. Rather, the Act turned out to be an addition to the many “common cause” factors keeping Mississippi on its continually improving trajectory that began in 2002 or earlier. The Act hasn’t been a “special cause” factor.
Mississippi Miracle? Rather, Mississippi’s journey of continual improvement.
LikeLike
I do not think I can pinpoint a real recovery from the trauma of the 2008 recession. Even the children I taught who came from a stable background seemed to mirror the social problems that ultimately gave us the 2016 election of Trump and the subsequent problems. Imagine what is going to follow the present debacle.
LikeLike
Boy, isn’t school choice working so well.
Sooo, lemme get this straight, New York Times. The event during the Obama administration that negatively affected the (less than meaningful) test scores was easing the NCLB requirement that every student would score “proficient” by 2014? Hmmm… I seem to recall a different Obama era event, a certain basketball playing secretary of education saying in 2009 that it would take ten years to realize the miracle-to-be “achievement” gains of the extremist Race to the Top grant competition with its hasty, sloppy Common Core obfuscation of sense tests. He said that in 2009, and the scores stagnated by 2019. What a coinkydink!
Charters, vouchers, deprofessionalization, and Common Core — those were the policies that dominated the period of stagnation noted by the Times. When are we going to look the Common Core standards in the face? They stink. We’re still dealing with them.
You know, it actually is possible to boil down the “achievement” stagnation problem to one single cause: the Billionaire Boys and Girls Club. Heck, Bill Gates all by himself…
LikeLike
Yes, yes, YES!!
LikeLiked by 1 person
Did I say Duncan predicted “miracle-to-be achievement gains” in ten years from Race to the Top? I meant he predicted Scotts Miracle-Gro-to-be achievement gains. (That’s a Michelle Rhee pawning off bull droppings joke, folks.) Common Core by another name still smells as poop.
LikeLike
Duncan was wrong about so many things.
LikeLike
My comment is awaiting moderation as usual. I don’t think the moderator bot likes my writing style very much. Probably not tinny enough.
LikeLike
Consider the data this report draws from: SEDA [Stanford Education Data Archive], which = district-level data on the annual state-standardized assessments for 3rd-8th grade + 1 hisch yr. I.e., NCLB test scores. There are many issues with using these test scores to compare student progress/ achievement from one year to another. An obvious one: the vast majority of states have changed their annual state-standardized testing vendors at least once, with many doing so multiple times. This report is handy for summarizing other basic problems: https://fas.org/publication/improving-standardized-test-scoring/ [11/5/25 “Improving Standardized Test Score Reporting and Administration for Students, Caregivers, and Educators” from Federation of American Scientists]
Consider the source. Stanford has long been in business to cast shade on public school ed achievement. Home of Hoover Institution’s Eric Hanushek (and, from 2015-2018, Raj Chetty). Founder of Stanford-Binet IQ testing a century ago. K12 tester par excellence: Stanford Achievement Test Series from 1922 to present (currently in its 10th version). Motto: Weigh the Pig as Often as Possible. With a cite from an AEI stink-tanker thrown in for good measure. These people have a product to sell, and a political agenda.
Consider the immediate source: NYT, whose educational reporting over the years has been lazy and uninformed, often sounding like a cut&paste of a lobbyist’s press release. Let’s take this article as an example. “Students’ test scores had been increasing since 1990 (etc exaggeration e.g. fell off a cliff in 2012). Here, they switch from Stanford to NAEP long-term trend assessment. Take a look at the charts at the link. Did reporters? 8th gr reading scores were static throughout the ‘90s, with no more than a 1% variation. A tiny dip in ’04; 1.9% improvement thro ’12, ending back at ’04 level by 2023. [Went up 1% by 2024]. These variations are not newsworthy. The reporters stretch to “explain”—including the dubious claim that there was “an easing of federal school accountability under No Child Left Behind” via ESSA*. And hey if they’re looking to “explain” what appears in their microscope like a decline, why not consider the 2009-2010 imposition of Common Core-based assessments in 46 states?
I would be very interested in a comprehensive retrospective on 4th/ 8th/ 12th grade NAEP results. This one– hard pass.
* The USDE must review and approve state ed-accountability systems, which must include academic indicators (test scores, graduation rates, at least one non-academic indicator of school quality. These systems must still identify and address measures to be taken for the lowest-performing 5% of schools.
LikeLiked by 1 person
Calling Duane Swacker!
Also, Diane, thanks for citing Todd Farley’s 2009 (yes, 2009, & NOTHING has changed for the better) book, Making the Grades: My Misadventures in the Standardized Testing Industry. The tests were, have been & will continue to be $–T, plain & simple. They are not/have never been “standardized,” best scores are determined by zipcodes & those grading the tests have been mostly unqualified to do so. For those of you who haven’t read this book, it’s a must-read. (& if you can’t find the book or want to read more, Diane had posted “An Interview w/Todd Farley” in December 2012; also, you can find his earlier writings (rantings!) on Huffington Post.
It’s sad to have to bring this up, but I’ve always found it interesting (& the absolute right thing to do) that, after the Sandy Hook school shooting, the governor of CT decreed that the surviving students would not have to take state tests that year.
And yet our students who live in neighborhoods where violence (& poverty)is a daily part of life are forced to take these tests EVERY year.
LikeLike
Dr. Ravitch – I have written many times and will write here again – there are many things that I dislike about the testing process. My students just underwent the standardized testing over the past two weeks. There is no incentive for them to do well (or not), which I think is part of the reason why you don’t see an increase.
What I have yet to see from you (or from others that are so against standardized testing) is what measure do we use to make sure that our students are a) improving in their learning and b) how do we help insure that our teachers are doing the jobs they should be doing in sharing the material.
I know you may respond to me (as you have in the past) that it becomes the job of the principal, or others, to make sure these things are happening. But you also have written that principals are overburdened. Your answer may be to take the burden off by adding admin staff – but well, in my county right now there are concerns the local county council will not fully fund the school system (the gap in the budget is between 18 million and up to 90 million). Some of the gap is certainly due to the lack of support from the federal govt and it’s Dept of Ed, and then states not getting the support in their funding. But the gap is also due to other things – 1) Schools misusing the funds they received from COVID era ESSA funding and then realizing they had to make cuts because they overspent, 2) States creating sweeping education plans – see Maryland’s Blueprint for Education Reform – but then passing the costs onto the districts (aka unfunded mandates).
I have taught in too many schools where I have seen my fellow teachers dumb down the curriculum so that students get A’s. We have also seen a decline in the expectations for all students – where in English classes students are not expected to read full novels, but rather snippets of books. Some will point to testing as an issue for this, but there are other issues as well – a fear to push students because of the possible mental health issues. Students will say “I’m anxious or stressed” at the drop of a hat, and we say “Oh, I’m so sorry” Let me clear, I understand the issues related to mental health.
Here is my point – education is very complex. To say we should not have testing does not address the problems that George W. Bush attempted to solve. I know, you were there, and you have done a complete 180 on this topic. I get all of this…but then, I have yet to hear from you (or many others) real solutions to how we fix things.
LikeLike
We should have testing. It should be used to inform teachers and students, not to punish them.
LikeLike
Yes–tests where teachers & parents receive results that they can actually read, use, interpret to find areas of strength (okay, Johnny has 98% in multiplication, so we don’t have to teach to that) & weakness (his Reading Comprehension is 60%; teacher should find strategies for teaching skillset & helping parents follow-up at home).
Let’s return to administering the VERY easily reported & interpreted Iowa Tests of Basic Skills!!!
LikeLike