Bad Data, Good Data, Red Data, Blue Data

by Mike Ziegler Notes from the Classroom

shutterstock_148016636 Back in part one of this post, I explored a problem that my PLC had while attempting to gather accurate data from student assessments.

This post, while still recognizing some problems with data, is more upbeat, and provides some reassurance that you can (and should!) continue to gather and use data.

In my last post, I described how the word primitive was a foreign term even to some “A” students, and how this proved to be a problem for an assessment on the use of textual evidence. Beyond such content-specific vocabulary, there’s a secondary issue of assessment lingo: we ask kids to examine, analyze, compare, and evaluate, but few teachers directly instruct exact meanings for these terms.

The solution here is simple, but often bothers English teachers: define terms for the kids. When students ask you what you mean by contrast, you should be willing to explain that for them every time.

Why? Because the term itself isn’t the skill you want data about. If diction is the learning target, independence is obviously an expectation. For all the other assessments, though, you’re damaging your own data if you don’t make sure the students understand every word.

Aim Small, Miss Small

Here’s one of the most regular, self-inflicted data failures we bring upon ourselves: writing questions that attempt to assess too many things at once.

If I’m writing a short-answer question for an assessment about a passage’s tone, my expectation is for:

complete sentences;
a clear response to the question;
a quote (embedded and cited) to help prove the answer is correct;
and an analysis of the quote to tie it all together.

Even without getting into partially correct responses, you can see where my expectations have created six (!) potential point reductions.

But what have I done to my data if I take off one of two possible points for, say, not including a quote? If students paraphrased the text effectively and were right about the tone of the passage, then they’ve actually provided me two separate pieces of data about two different learning goals; they have mastered tone analysis, but they are deficient in using textual evidence to prove their arguments.

When we conflate the two and give them a ½ on the question, we have provided ourselves a sloppy data point. And by the time we’ve graded a set of 120 of that assessment, we might come to a wrong-minded, broad conclusion that sets the class back needlessly. Do they even know what tone is? Or are they just averse to quotes?

Consider that tone example once more. Does the question need to be rewritten? Maybe not.

As long as you’re willing to grade the assessment question for only the core skill (tone or textual evidence, but not both at once), then it can provide you some excellent data.

Writing questions that address one clear skill is ideal. But sometimes a question that entails multiple skills can be highly useful—as long as you aren’t attempting to score it for every skill at once.

Post-Assessment Interventions

Logic suggests a problem with this, though. Even if I narrow the learning target I’m assessing, it doesn’t clarify the problem’s source. Did a student choose her quote poorly because she doesn’t know tone, or because she lacks the ability to choose textual evidence well?

The solution to this, I think, is the post-assessment tool box most teachers already put to use. Conference with students for a couple minutes. They can speak effectively to where things went wrong, and data then becomes highly reliable.

When you don’t have time for one-on-one conferencing, having students self-reflect while you go over the assessment as a class can be just as useful. Ask students to make follow-up marks that you can look over later (“T” meaning “I didn’t understand the tone that well,” or “Q” for “I didn’t know what quote to select.”). This might seem like an inelegant solution, but think about what you’ve created: a robust data set that includes your initial impressions of their skills, alongside a self-evaluation where students have provided input on exactly what skill failed them.

There are obviously dozens of other solutions to the problem of inexact data, but I think the simplest takeaway is to be vigilant about communicating what you want your students to know, and explicit about how your grading rubric measures each learning goal in isolation.

It takes time to fix these sorts of systemic problems. But I’d argue that it amounts to less time than we spend reviewing concepts in class that we’ve misidentified as problematic, having listened to the lies of Bad Data.

Michael Ziegler

Michael Ziegler (@ZigThinks) is a Content Area Leader and teacher at Novi High School. This is his 15th year in the classroom. He teaches 11th Grade English and IB Theory of Knowledge. He also coaches JV Girls Soccer and has spent time as a Creative Writing Club sponsor, Poetry Slam team coach, AdvancEd Chair, and Boys JV Soccer Coach. He did his undergraduate work at the University of Michigan, majoring in English, and earned his Masters in Administration from Michigan State University.

Good Teacher, Bad Data

by Mike Ziegler Literacy & Technology Notes from the Classroom Professional Learning

shutterstock_268996268 I think it’s safe to say that there’s a bit more mathematical calculation in your normal English classroom pedagogy than there was, say, five years ago.

And you know what? That’s a good thing—a great thing if you’ve found meaningful ways to use the data gathered from formative and summative assessments.

But data can also be pretty misleading.

The idea of using data to improve instruction has always been presented as a simplistic and elegant solution: gather data that shows which students miss which questions and, voila!, you know where to direct differentiated instruction, to help every student reach mastery of the learning goals.

To wit: An easy question about the tone of an author yields 90% of your students who correctly identify and explain the tone, but the second tone question on the same assessment—testing the same learning goal but providing a much more challenging passage—reveals that only 50% of your class can really decipher tone when the going gets tough (or the tone gets subtle).

This is really fantastic information to have! Ten percent of your kids need to go back and review their notes and probably do some formative practice. But there’s another 40% who need to work on applying their newfound skill. They clearly know what tone is, but at some point when the tone isn’t smacking them in the face, they actually aren’t that great at recognizing the trait in writing. The needs of these two groups are different, but now you know whom to direct to which formative task!

The Signal, The Noise, The Headache

51Ui-zv3m7L._SX324_BO1,204,203,200_ The funny thing about data, though, is that numbers aren’t as clear and objective as all those charts and bar graphs would have us believe. If you don’t want to take an English teacher’s word for that, get ahold of Nate Silver’s excellent book The Signal and the Noise, which reveals just how difficult it can be to get data to tell you the truth.

Or, for that matter, believe your own experience, since I’m fairly certain you’ve also experienced the sort of data debacle I’m about to describe.

A few years ago, my professional learning community rewrote all of our assessment questions so that they were clearly labeled by learning goal. When we tested a student’s ability to support an argument using textual evidence, the question might look like this:

Using Evidence: Using at least one quote, explain how Jon Krakauer establishes Chris McCandless’s desire to live a more primitive lifestyle in Into the Wild.

Now everything should be clean and easy to parse—if kids get the question right, they have mastered the use of textual evidence. If they get it wrong, they have not. And if they can explain Krakauer’s methods but fail to use a quote, we can presume they’re halfway there.

So would it surprise you to learn that my PLC ended up getting incredibly muddled data from this question? And that we eventually had to rethink how we were interpreting much of the data? Here are some of the issues that we encountered:

How can you tell when a student lacks a skill versus when they lack vocabulary? Three of my stronger students asked me what primitive meant—in my first period alone!
Did all the students recognize the implicit meaning of the verb explain? Have you been clear about what various verbs (contrast, analyze, challenge) demand of them in an assessment?
How do you decide whether a student just hasn’t written enough? And what should the takeaway be when students can vocalize an answer that is thorough and accurate?
How much should you be concerned when a student’s example is the one you’ve already used in a class discussion? What if that brand of example shows up on every single assessment a student takes?
If you give the students one passage to focus on, is a correct answer an indication of mastery of this skill or only partial mastery (since on their own they might not have been able to select the relevant part of the text from, say, an entire chapter)?

Any of these are good reasons to have a careful data discussion in your PLC. But let’s just take that first one—lacking a skill versus lacking vocabulary—as an example.

I couldn’t write off as a trivial minority the students who asked the question (what primitive meant)—these were the grade-concerned kids who were good about asking questions. If they didn’t know the term and said so, then there was a good chance that A LOT of the other kids also didn’t know the meaning of primitive. They just didn’t bother to ask.

Is Data Doomed?

All of a sudden, our data about this fundamental writing skill seemed really murky. And this was a learning goal we thought was pretty transparent and objective! There was a sudden temptation to go back to the more instinctive, less numbers-driven approach to gathering feedback about students.

Even though gathering good data in English is tougher than it seems, it is both possible and essential for effective instruction. I’ll revisit my own case study in my next blog post, in order to elucidate a few of the counter-measures my PLC took to help avoid “fuzzy” data points.

In the meantime, think about the next assessment you give to students. Whatever data you take from it, ask yourself whether more than one “theory” about the kids’ performances on it would fit the data you’re staring at.

Michael Ziegler

Podcast #15: Student Data, Mining of This Data, and Implications

by Les Howard Podcasts

The ability to collect and store vast amounts of information on students has increasingly become easier and cheaper. At its best, this information can be used to support students. At its worst, the information can be used against students, often without their knowledge. This information can be stored and manipulated forever.

In this podcast, Chris Gilliard, Hugh Culik, Daniel Hoops and Jason Almerigi provide an insightful and interesting discussion on this issue.

Links to sites mentioned in the podcast:

This podcast is also on iTunes.

Moving from the ACT to the SAT in 2016

by Delia DeCourcy News

The Michigan Department of Education announced this shift yesterday. For more information, see the press release below.

Visit this website to learn about the redesigned SAT.
Read “The Story Behind the SAT Overhaul” from the New York Times about the new SAT.

***

MDE News Release

Contact: Martin Ackley, Director of Public and Governmental Affairs, (517) 241-4395

Caleb Buhs, Michigan DTMB, (517) 241-7422

State Awards Future College Assessment to College Board’s SAT for Michigan Students

January 7, 2015

LANSING –- All Michigan high school juniors will begin taking the SAT as the state-administered college assessment exam beginning in 2016 after the College Board won the three-year competitively-bid contract, the Michigan Department of Education and Department of Technology, Management and Budget jointly announced today.

The College Board administers the SAT, a globally-recognized college admission test that lets students show colleges what they know and how well they can apply that knowledge. It tests students’ knowledge of reading, writing and math — subjects that are taught every day in high school classrooms in Michigan.

ACT, Inc. will continue to provide its WorkKeys assessment for all high school students. Both the college entrance assessment and work skills tests are required in state law to be provided free to all high school students, and each is periodically competitively bid through the state’s structured procurement process, as directed by the Department of Technology, Management and Budget (DTMB).

“The College Board’s SAT test is respected and used around the country,” said State Superintendent Mike Flanagan, “and Michigan high schools work with them now through their Advanced Placement program that helps students earn college credits while in high school.

“Their bid was rated the highest; provides valuable assistance to Michigan educators, students, and parents; is more aligned to Michigan’s content standards; and saves the state millions of dollars over the course of the three-year contract,” Flanagan said.

The College Board’s bid was $15.4 million less over the three-year contract than the next bidder and scored 10 percentage points higher by the Joint Evaluation Committee (JEC). In addition to staff from MDE and DTMB, the evaluation committee also included members representing the education community, including a high school principal; local school superintendent; a testing and assessment consultant from an intermediate school district; and a vice president from a Michigan community college.

Bill Barnes, principal at Charlotte High School and member of the JEC said: “The attention to detail with which the College Board created its proposal and the extensive resources that it will provide to schools and students to help them prepare for the test, make its college readiness assessment the best choice for Michigan.”

Another member of the Joint Evaluation Committee, Jim Gullen, a data and evaluation consultant for the Macomb Intermediate School District, said: “After two days of review and discussion, there was no question that College Board put forth the best proposal. Considering the quality of College Board’s proposal, the value presented in the pricing, and our current legislation, it is a good time to transition to the SAT to assess Michigan’s high school students’ mastery of the Michigan curriculum.”

Each year, the College Board helps more than seven million students prepare for a successful transition to college through programs and services in college readiness and college success — including the SAT and the Advanced Placement program. The organization also serves the education community through research and advocacy on behalf of students, educators and schools.

The Michigan Department of Education (MDE) is forming a team that will include the local, regional, and community college members of the Joint Evaluation Committee to assist in the transition to the SAT. In addition, the department will hold an onsite meeting with the College Board to discuss how it intends to positively affect the transition for Michigan schools, educators, parents, and students.

In its successful bid, the College Board included the following value-added components that will benefit Michigan schools and families:

Beginning in Spring 2015, the College Board will provide all schools and students with free test prep materials and online practice tests to help students prepare for the redesigned SAT in 2016.
Professional Development
- In-person and technology-based training for local test administrators, proctors, and technology coordinators
- Professional development for teachers, students, and parents in understanding the new SAT and analyzing test results
- Professional development for post-secondary enrollment professionals in using the data/resources for admissions and financial aid decisions
An updated and relevant assessment
- Redesigned SAT beginning in 2016
- Aligned to Michigan content standards, evidence-based design
- Additional item types beyond multiple choice
- New forms developed each year
- Reports available online
Simplification and reduction of school staff effort to request testing accommodations
- No need to reapply for testing accommodations if already approved for the Advanced Placement Program, or the PSAT testing for National Merit Scholarship Qualification Test

The college entrance exam and work skills assessment are given free to approximately 115,000 Michigan high school students each year.

ACT WorkKeys is a job skills assessment system that helps employers select, hire, train, develop, and retain a high-performance workforce. This series of tests measures foundational and soft skills and offers specialized assessments to target institutional needs.

As part of ACT’s Work Readiness System, ACT WorkKeys has helped millions of people in high schools, colleges, professional associations, businesses, and government agencies build their skills to increase global competitiveness and develop successful career pathways.

Successful completion of ACT WorkKeys assessments in Applied Mathematics, Locating Information, and Reading for Information can lead to earning ACT’s National Career Readiness Certificate (ACT NCRC), a portable credential earned by more than 2.3 million people across the United States.

Michigan high school students have taken the WorkKeys assessment since 2007. Over 413,000 Michigan students have received an NCRC credential.

Although the contracts await final completion and approval of the State Administrative Board, the three-year contract for the college entrance assessment will cost approximately $17.1 million, and the three-year work skills assessment will cost approximately $12.2 million.

Menu