« NCAALBAASA to After-School Teachers: "You're Fired" »

    Are We There Yet?

    03/11/09

    Permalink 07:19:40 am, Categories: Announcements [A]

    Education Sector has just published a paper that I authored on the Tennessee Growth Model being used to measure school and district AYP.

    Great pains were made to make the paper as fair as possible. All growth models have strengths relative to the status and safe harbor models laid out in the ESEA statute, and Tennessee’s model is no exception. But, as the paper points out, the Tennessee model (variations of which are also being employed in Ohio and Pennsylvania and are under debate in Texas) also has some serious shortcomings.

    The Tennessee model is highly technical, using multivariate, inferential statistics. And therein lies a good deal of the problem. More statistical sophistication means less transparency. And it’s not clear to me that all that sophistication actually buys you much relative to other, simpler methods of measuring growth.

    Some of the methodology and most of the data are proprietary, meaning they are privately owned, i.e., no public access. This all makes it very difficult for even Ph.D. and J.D.-level policy experts to get a handle on what is going on (which I found as a peer-reviewer last year), let alone teachers, parents, and the general public. 

    Tennessee officials communicating with me privately acknowledge the points raised in the paper. They also point out, in all fairness, that they are in the midst of raising Tennessee standards, in response to the second key point in the paper: slow growth toward a low standard, especially for grade school children who risk falling too far behind and never catching up, may be a risky and inefficient accountability strategy.

    The third key point is that the Tennessee model has an inherent paradox: while the Tennessee model implies a 3-year trajectory to proficiency, even if a student meets his or her interim goals each year, they likely will not reach proficiency in three years. This is a well-known statistical phenomenon called "Zeno’s Paradox."

    What drives the paradox is that Tennessee recalculates a a child’s "projected" 3-year path to proficiency each year.

    Think of a child’s academic progress as covering the "distance" from where they are academically this year to "proficiency" three years from now. Here’s a simple example of the problem:

    Let’s say a frog very much wants to get to a lily pad that is 100 meters away. He knows he can’t make it all the way, but judges he can make it a third of the way (he’s a championship frog). So on his first try, he jumps 33 1/3rd meters. He decides to jump a third of the way to his goal from where he stands each time, until he gets there.

    When does he get there? On first glance, most of us would say in three jumps. In fact, in strict mathematical terms, he never gets there. After three tries, he is actually only 19/27ths (71 percent) of the way there. The Table below shows the calculation. 

    Zeno’s Paradox

    on the Way to Proficiency

    Jump Number

    Distance of the Jump = 1/3 of distance remaining

    Total Distance

    1

    1/3rd

                                1/3rd

    2

    1/3rd of 2/3rd = 2/9

    1/3 + 2/9 =       5/9ths

    3

    1/3rd of 4/9th = 4/27

    5/9 + 4/27 =  19/27ths

     

    When we say that the frog (or student) who meets the interim goal will never meet, in a strict mathematical sense, the end goal, we do not mean that at some point they will not get “close enough” to be considered “as if.” Where that point is, is a “judgment call” that, right now, only the state of Tennessee is in a position to make. And perhaps when ESEA is reauthorized, the President and Congress will decide that this is just fine. But they, and the state of Tennessee, should do so with as much understanding as possible of what is really going on underneath all the fancy statistics.

    Dr. William Sanders, who created the model and now works for SAS (a large software company; think of SAS as the Microsoft of the statistical world) has taken issue with the paper, and I have had a chance to preview his comments (which will be posted at Ed Sector later). In contrast with Tennessee officials, he (with the help, I imagine, of the statisticians employed by SAS) is taking a much more confrontational stance, which is not surprising. But I see nothing in his comments so far that refutes the conclusions drawn in the paper. They seem to be an issue of emphasis more than difference.
     
    I look forward to ongoing discussion, in part because I know all of us interested in this will learn something, but more broadly because it is my belief that if you put something like this in the public policy sphere, and ask the public to subsidize it, then public scrutiny comes with the territory.

    Trackback address for this post

    Trackback URL (right click and copy shortcut/link location)

    6 comments

    Comment from: Sherman Dorn [Visitor] · http://www.shermandorn.com
    The mid-90s state audit reports on TVAAS made similar points about the proprietary basis of Sander's system (one of which was written by Tom Fisher). I suspect a number of true quantoids have fiddled enough with PROC MIXED (the commercial version in SAS) to know some of its quirks, but that doesn't necessarily hold for the calculations Sanders does in private for Tennessee and other clients.

    In addition, one of the concerns I have with an iterative procedure such as Sanders uses is its quirky nature. Sometimes you have to reset initial values to get things to converge, and the more complex the system, the more you have to fiddle. That's fine in research but lousy in a public accountability system. Even if the procedures are technical, there should be a good several dozen assessment people in each state who can replicate the calculations to keep everyone honest.
    03/11/09 @ 07:37
    Comment from: Marktropolis [Visitor]
    I posted a comment at QuickandtheEd before I saw this. So, forgive the cross-posting, but what struck me about the paper was that it basically ignored the research out there that raises a plethora of issues around value-added models. I'm not surprised that you're getting the reaction from Sanders that you are - he has steadfastly refused to let anyone peek inside of the "black box" wherein he does all the calculations to deal with issues like student mobility, missing records, teacher mobility, race/ethnicity, etc. And when it comes down to it, how you fix the errors in the data can make a tremendous difference in your results.

    I'll pass on the same link I did at Q&E: http://www.wcer.wisc.edu/news/events/natConf_papers.php That WCER conference was not only a who's who in value-added, but also provided a stack of papers dissecting this issue. And Barone's piece didn't cite any of them.
    03/11/09 @ 08:43
    Comment from: Charlie [Member] Email · http://www.swiftandchangeable.org
    That research might have been a good appendix, but the value-added issues - outside of their proprietary nature - are actually outside the scope of what is already a long paper. As far as I know, no one until now has critiqued the TN growth model, and our paper focuses on its unique aspects. That was the goal.
    03/11/09 @ 09:06
    Comment from: Marktropolis [Visitor]
    Not sure what you mean by "no one until now" in terms of critiquing the TN growth matter. The VAM in TN has been critiqued plenty of times. As far back as 2003. http://www.rand.org/pubs/monographs/2004/RAND_MG158.pdf

    And I'm also not clear on how the value-added issues are outside the scope of the paper. The TN growth model was built on their VAM work. And if the VAM work is up for discussion (which it is), I think that's a pretty important issue.

    This does remind me somewhat of the debate last year when NYC wanted to use a growth model to determine teacher quality - and the UFT (rightly in my opinion) - challenged that initiative on technical grounds. I think EduWonkette did a few pieces on that but I can't find them right now.

    Yes, the resource might have been a good appendix, and I'm not even asking that the paper get "longer," just that it pay attention to existing research. Isn't a lit review still part of the process these days?
    03/11/09 @ 09:17
    Comment from: Charlie [Member] Email · http://www.swiftandchangeable.org
    I may not be getting your point. What we key in on are the things that make the TN growth model distinct for the accountability purposes of of NCLB. The expected score, in relation to TN's low standards, and how that affects AYP under ESEA. The VAM model is very different than the growth model in these respects. The growth model only came on board in 2005. Comparing them - VAM and growth - would have been a whole other paper. I think our (my) goals were more modest than you would like them to be.
    03/11/09 @ 09:46
    Comment from: Marktropolis [Visitor]
    OK. My mistake. I just spent waaaay too much time tracking down TN's original application to USED (p.s., the link in your paper is broken). Came across this nugget: "Given Tennessee’s historical use of value-added scores for district and school accountability and teacher evaluation, it is important to clarify that neither the proposed projection model for AYP nor its underlying projection methodology relies on a value-added model."

    I'm going to crawl back under that rock I came out of...
    03/11/09 @ 10:33

    This post has 600 feedbacks awaiting moderation...

    Leave a comment


    Your email address will not be revealed on this site.

    Your URL will be displayed.
    (Line breaks become <br />)
    (Name, email & website)
    (Allow users to contact you through a message form (your email will not be revealed.)
    September 2010
    Sun Mon Tue Wed Thu Fri Sat
     << <   > >>
          1 2 3 4
    5 6 7 8 9 10 11
    12 13 14 15 16 17 18
    19 20 21 22 23 24 25
    26 27 28 29 30    

    Swift & Change Able will attempt to cover all aspects of education policy, including:
    * Legislation
    * Policy
    * Politics

    Swift & Change Able will try to look at events in the states and local school districts, as well as the ongoing ESEA reauthorization process in Washington, D.C.

    Content on Swift & Change Able will include:
    * Analysis
    * Commentary
    * Research
    * Data
    * Public Forums

    and, sometimes, some surprises just to liven things up.

    Search

    The requested Blog doesn't exist any more!

    XML Feeds

    powered by b2evolution free blog software