Making the grade

New data show more than half of NYC teachers judged, in part, by test scores they don’t directly affect

PHOTO: Christina Veiga

Just over half of New York City teachers were evaluated in the 2015–16 school year, in part, by tests in subjects or of students they didn’t teach, according to data obtained by Chalkbeat through a public records request.

At 53 percent of city teachers, it’s significant number, but substantially lower than in previous years, possibly thanks to a moratorium placed on using state tests, instituted mid-year.

That figure also highlights a key tension in evaluating all teachers by student achievement, even teachers who work with young students or in subjects like physical education. Being judged by other teachers’ students or subjects has long annoyed some educators and relieved others, who otherwise might have had to administer additional tests.

Supporters say evaluating teachers by group measures — often school-wide scores on standardized tests — helps create a sense of shared mission in a school. But the approach could also push teachers away from working in struggling schools.

“The key point around school-wide measures is that this could serve as a strong disincentive for these teachers in non-tested grades and subjects to stay in lower-performing schools,” said Matthew Steinberg at the University of Pennsylvania, who has studied teacher evaluation systems.

Will Mantell, a spokesperson for the New York City Department of Education, defended the district’s approach.

“Selecting school-wide [or] grade-wide … measures may better measure educators’ practice and support professional development,” he said. “For example, it makes sense for a social studies teacher who emphasizes writing in her classroom to be evaluated partially on an assessment of students’ ELA skills.”

New York’s evaluation system has gone through a number of substantial changes since it was first codified in state law in 2012, part of a nationwide push to connect teacher performance to student test scores, spurred by federal incentives.

Student assessments have comprised anywhere from 40 percent of the evaluation to essentially 50 percent, under a matrix system pushed by Governor Andrew Cuomo in 2015. Most recently, New York stopped using grades 3-8 English and math state tests as part of the system, but teachers must continue to be judged based on some assessment.

States across the country have struggled to evaluate teachers in traditionally non-tested grades and subjects. New York City has created a number of exams — known as performance assessments — in non-tested areas and given schools significant flexibility in which measures are used to judge their teachers.

In the 2015-16 school year, 53 percent of teachers were evaluated by a group metric, meaning one not focused on their subject or students. In the two previous years, the number was much higher — around 85 percent. It’s not clear why there was a substantial drop, but a spokesperson for the city’s education department notes that 2015-16 was an “outlier” due to the moratorium on state tests, instituted mid-year.

In all three years, most teachers were also evaluated by at least one individualized measure targeted to teachers’ grade, subject and students.

Data for the most recent school year are not yet available.

It’s also not clear what percentage of a teacher’s rating was based on group measures, and Mantell said this “varies from teacher to teacher.”

The United Federation of Teachers has pushed to give schools more individual options, including the use of more “authentic” assessments, not based on multiple choice questions.

“Right now, we don’t have enough options, which is why our most recent agreement with the DOE seeks to build more authentic assessments for additional grades and subjects,” said Michael Mulgrew, president of the UFT in a statement.

Group measures offer an alternative to creating exams for each teacher in every grade and subject, which can lead to a proliferation of new tests, though in New York City teachers have often been judged by both group and individual metrics.

The challenge of evaluating teachers in traditionally untested areas is not unique to New York, and a number of states have embraced group or school-wide approaches. An analysis of 32 states, conducted by Steinberg, found that the average teacher in a non-tested grade or subject had about 7 percent of his or her evaluation based on school-wide achievement measures, though this averaged together substantial variation from place to place. Teachers in Tennessee and Florida have sued (unsuccessfully), arguing that it is unfair to evaluate them based on students they didn’t teach.

A more popular option, used in some districts in New York, has been student-learning objectives, in which teachers set goals for students often based on classroom exams. This approach has been praised for helping teachers set specific goals, but criticized as burdensome and easy to manipulate.

Research has found that using school-wide measures of performance tends to bring teachers closer to average performance. An analysis by the Brookings Institution showed that these group measures pulled down ratings of teachers with higher individual ratings at low-performing schools.

good news bad news

Most Tennessee districts are showing academic growth, but districts with the farthest to go improved the least

PHOTO: Alan Petersime

It’s not just Memphis: Across Tennessee, districts with many struggling schools posted lower-than-expected growth scores on this year’s state exams, according to data released Tuesday.

The majority of Tennessee’s 147 districts did post scores that suggest students are making or exceeding expected progress, with over a third earning the top growth score.

But most students in three of the state’s four largest districts — in Memphis, Nashville and Chattanooga — aren’t growing academically as they should, and neither are those in most of their “priority schools” in the state’s bottom 5 percent.

The divide prompted Education Commissioner Candice McQueen to send a “good news, bad news” email to superintendents.

“These results point to the ability for all students to grow,” she wrote of the top-performing districts, many of which have a wide range of academic achievement and student demographics.

Of those in the bottom, she said the state would analyze the latest data to determine “critical next steps,” especially for priority schools, which also are located in high-poverty communities.

“My message to the leaders of Priority schools … is that this level of growth will never get kids back on track, so we have to double-down on what works – strong instruction and engagement, every day, with no excuses,” McQueen said.

Growth scores are supposed to take poverty into account, so the divide suggests that either the algorithm didn’t work as it’s supposed to or, in fact, little has happened to change conditions at the state’s lowest-performing schools, despite years of aggressive efforts in many places.

The results are bittersweet for Tennessee, which has pioneered growth measures for student learning and judging the effectiveness of its teachers and schools under its Tennessee Value-Added Assessment System, known as TVAAS.

On the one hand, the latest TVAAS data shows mostly stable growth through the transition to TNReady, the state’s new test aligned to new academic standards, in the first year of full testing for grades 3-11. On the other hand, Tennessee has invested tens of millions of dollars and years of reforms toward improving struggling schools — all part of its massive overhaul of K-12 education fueled by its 2009 federal Race to the Top award.

The state-run Achievement School District, which launched in the Race to the Top era to turn around the lowest-performing schools, saw a few bright spots, but almost two-thirds of schools in its charter-reliant portfolio scored in the bottom levels of student growth.

Shelby County’s own turnaround program, the Innovation Zone, fared poorly too, with a large percentage of its Memphis schools scoring 1 on a scale of 1 to 5, after years of scoring 4s and 5s.


District profile: Most Memphis schools score low on student growth


Superintendent Dorsey Hopson called the results a “wakeup call” for the state’s biggest district in Memphis.

“When you have a population of kids in high poverty that were already lagging behind on the old, much easier test, it’s not surprising that we’ve got a lot of work to do here,” he said, citing the need to support teachers in mastering the state’s new standards.

“The good part is that we’ve seen the test now and we know what’s expected. The bad part is we’ve seen the test … and it’s a different monster,” he told Chalkbeat.

You can find district composite scores below. (A TVAAS score of 3 represents average growth for a student in one school year.) For a school-by-school list, visit the state’s website.

exclusive

Most Memphis schools score low on student growth under new state test

PHOTO: Stephanie Snyder

More than half of Memphis schools received the lowest possible score for student growth on Tennessee’s new test last school year, according to data obtained by Chalkbeat for Shelby County Schools.

On a scale of 1 to 5, with 1 being the lowest measure, about 54 percent of the district’s 187 schools scored in the bottom rung of the Tennessee Value-Added Assessment System, known as TVAAS.

That includes most schools in the Innovation Zone, a reversal after years of showing high growth in the district’s prized turnaround program.

Charter schools fared poorly as well, as did schools that were deemed among the state’s fastest-improving in 2015.

Superintendent Dorsey Hopson called the scores a “huge wakeup call.”

“It shows that we’ve got a tremendous amount of work to do,” Hopson told Chalkbeat on Monday. “It’s going to be hard and it’s going to be frustrating. … It starts with making sure we’re supporting teachers around mastering the new standards.”

District leaders across Tennessee have been trying to wrap their heads around the latest growth scores since receiving the data in late August from the State Department of Education. Only two years earlier, the Memphis district garnered the highest possible overall growth score. But since then, the state has switched to a harder test called TNReady that is aligned for the first time to more rigorous academic standards.

TVAAS results are scheduled to be released publicly this week, but Chalkbeat obtained a copy being circulated within Shelby County Schools, Tennessee’s largest district.

The data is prompting questions from some Memphis educators — and assurances from state officials — over the validity of TVAAS, the state’s system for measuring learning and judging the effectiveness of its teachers and schools.

This is the first year of issuing district-wide TVAAS scores since 2015. That’s because of the state’s cancellation of 2016 testing for grades 3-8 due mostly to failures in the switch to online testing.

Some educators wonder whether the bumpy switch to TNReady is a factor in this year’s nosedive, along with changes in how the scores are calculated.

For example, data for fourth-graders is missing since there is no prior state testing in third grade for comparison. Elementary and middle schools also don’t have growth scores for social studies, since the 2017 questions were a trial run and the results don’t count toward a school’s score.

Hopson acknowledged concerns over how the state compares results from “two very different tests which clearly are apples and oranges,” but he added that the district won’t use that as an excuse.

“Notwithstanding those questions, it’s the system upon which we’re evaluated on and judged,” he said.

State officials stand by TVAAS. They say drops in proficiency rates resulting from a harder test have no impact on the ability of teachers, schools and districts to earn strong TVAAS scores, since all students are experiencing the same change.

“Because TVAAS always looks at relative growth from year to year, not absolute test scores, it can be stable through transitions,” said Sara Gast, a spokeswoman for the State Department of Education.

Shelby County Schools is not the only district with disappointing TVAAS results. In Chattanooga, Hamilton County Schools logged low growth scores. But Gast said that more districts earned average or high growth scores of 3, 4 or 5 last school year than happened in 2015.

Want to help us understand this issue? Send your observations to [email protected]

Below is a breakdown of Shelby County’s TVAAS scores. A link to a school-by-school list of scores is at the bottom of this story.

Districtwide

School-wide scores are a combination of growth in each tested subject: literacy, math, science and social studies.

Fifty three schools saw high growth in literacy, an area where Shelby County Schools has doubled down, especially in early grades. And 51 schools saw high growth in math.

Note: A TVAAS score of 3 represents average growth for a student in one school year. A score of 1 represents significantly lower academic growth compared to peers across the state.

2017

School-wide composite Number of schools Percent of schools
1 101 54%
2 19 10%
3 20 11%
4 10 5%
5 37 20%

2015

School-wide composite Number of schools Percent of schools
1 58 28%
2 16 8%
3 38 19%
4 18 9%
5 75 37%

Innovation Zone

Out of the 23 schools in the district’s program to turn around low-performing schools, most received a growth score of 1 in 2017. That stands in stark contrast to prior years since the program opened in 2012, when most schools were on a fast growth track.

School-wide composite Number of iZone schools
1 14
2 2
3 2
4 0
5 5

Reward schools

Nearly half of 32 schools deemed 2015 Tennessee reward schools for high growth saw a major drop in TVAAS scores in 2017:

  • Central High
  • Cherokee Elementary
  • Germanshire Elementary
  • KIPP Memphis Middle Academy
  • Kirby High
  • Memphis Business Academy Elementary
  • Power Center Academy High
  • Power Center Academy Middle
  • Ross Elementary
  • Sheffield High
  • South Park Elementary
  • Southwind High
  • Treadwell Middle
  • Westside Elementary

Charter schools

Charter schools authorized by Shelby County Schools fared similarly to district-run schools in growth scores, with nearly half receiving a TVAAS of 1 compared to 26 percent of charter schools receiving the same score in 2015.

2017

School-wide composite Number of iZone schools
1 18
2 6
3 7
4 2
5 7

2015

School-wide composite Number of iZone schools
1 10
2 2
3 7
4 3
5 16

Optional schools

Half of the the district’s optional schools, which are special studies schools that require students to test into its programs, received a 1 on TVAAS. That’s compared to just 19 percent in 2015.

2017

School-wide composite Number of iZone schools
1 23
2 6
3 5
4 2
5 10

2015

School-wide composite Number of iZone schools
2 5
3 6
4 5
5 14

You can sort through a full list of TVAAS scores for Shelby County Schools here.