WHY USE THE RUBRICS?
Has a student ever said to you regarding an assignment, "But, I didn't know what you wanted!" or "Why did her paper get an 'A' and mine a 'C?'" Students must understand the goals we expect them to achieve in course assignments, and importantly, the criteria we use to determine how well they have achieved those goals. Rubrics provide a readily accessible way of communicating and developing our goals with students and the criteria we use to discern how well students have reached them.
WHAT IS A RUBRIC?
Rubrics (or "scoring tools") are a way of describing evaluation criteria (or "grading standards") based on the expected outcomes and performances of students. Typically, rubrics are used in scoring or grading written assignments or oral presentations; however, they may be used to score any form of student performance. Each rubric consists of a set of scoring criteria and point values associated with these criteria. In most rubrics the criteria are grouped into categories so the instructor and the student can discriminate among the categories by level of performance. In classroom use, the rubric provides an "objective" external standard against which student performance may be compared.
WHAT IS INVOLVED?
| Instructor Preparation Time: | Medium to High |
| Preparing Your Students: | Continuous; but students catch on fairly quickly |
| Class Time: | Variable. As students use rubrics, they become better writers and oral presenters; hence the time instructors spend evaluating students' work is reduced. |
| Disciplines: | All. |
| Class Size: | All. Rubrics are easy to use in small classes, and are particularly useful in large classes to facilitate scoring large numbers of written or oral assignments. |
| Special Classroom/Technical Requirements: | None. |
| Individual or Group Involvement: | Both. |
| Analyzing Results: | The level of analysis depends on the instructor's intended goal of the assessment task and the type of data desired about students' performance. For detailed analysis of students' responses, each section of the rubric can be scored independently then totaled. For a holistic analysis of students' responses, all sections of the rubric can be blended and an overall score assigned. |
| Other Things to Consider: | Rubrics must be readily available to students before they begin an assignment or written test. Posting rubrics on the web and including them in the course pack for in-class writing promotes their usefulness. |
Description
1. I developed the goals for my course and daily class meetings.
Based on the goals for my courses, I selected different forms of extended responses, both written and oral, and concept maps to gather the data that would convince me that my students achieved the goals. The kinds of questions I asked students and the types of projects I assigned, were designed to promote students' reasoning. For example, for the first three goals I have listed, various types of assessment that could be used to gather the type of data desired.
Notice, when it is possible to quantify the categories, I did so. So, for example, the criteria for acceptable style and grammar in an exemplary answer is based on no errors.
Our ability to differentiate among criteria is critical to the effectiveness of the scoring rubric. So words like "good" are too subjective. The criteria must be designed so that you and your students can discriminate among the qualities you consider important.
When we evaluate students' extended responses, we tend not to score them point by point, however, by elaborating on the criteria that comprise the different levels of performance, we provide the students substantive guidance about what should be included in their extended responses.
Point values: Do you assign points on a 5, 3, 1 scale? or a 5, 4, 3 scale? I have tried both. I chose 3 as the middle or as an adequate score. Most student responses in this category can readily be improved through group work, practice, effort and instruction. Therefore, in an effort to develop students' self-efficacy and to promote their achievement of higher standards, I chose the 5,4,3 point scheme.
On a five-point scale, the data do not enable me to discriminate between two consecutive points, such as 3 and 4, in terms of evaluating the response. Rather, three categories were readily distinguishable by my students and me, therefore, little if any time was spent "arguing" for points. The criteria for evaluation were clear and understood.
Faculty Roles
Share with students samples of "exemplary", "adequate", "needs improvement" responses. Ask them to work in cooperative groups to analyze the strengths and weakness of the written responses, using the rubric as a guide. With practice, students learn to recognize and ultimately develop their own exemplary responses.
The formative assessment I gather by using rubrics to evaluate students' responses during the course is valuable. In-class writing assignments give me feedback about the nature of the task and questions I ask students. The components of a question or task that provide meaningful responses are readily identifiable from the rubric and provide us insight into my students' strengths and weaknesses. I use these data to modify, change directions, or add components to our instructional design and strategies.
Four functions of assessment data are described by Hodson (1992):
Assessment is learning. We and our students both benefit from meaningful assessment information about the achievement of the broader course goals. Multiple assessment strategies can be implemented to provide evidence that students have or have not learned, have or have not accomplished the goals of the course. Rubrics help us set well-defined standards for our students, provide students guidelines for achieving those standards and facilitate grading extended written and oral responses. This feedback provides us data to interpret and make informed decisions about our students' learning and our own teaching practice, similar to the process of data evaluation that we use daily in our scientific research.
Freeman, RHL. 1994. Open-ended questioning: a handbook for educators. Menlo Park, California; Reading, Massachusetts; New York; Don Mills, Ontario; Wokingham, England; Amsterdam; Bonn; Sydney; Singapore; Tokyo; Madrid; San Juan; Paris; Seoul; Milan; Mexico City; Taipei. Addison-Wesley Publishing Company (The book is published by Innovative Learningª, an imprint of the Addison-Wesley Alternative Publishing Group.)
King PM, Kitchener KS. 1994. Developing reflective judgement: understanding and promoting intellectual growth and critical thinking in adolescents and adults. San Francisco (CA): Jossey-Bass Publishers.
MacGregor J. 1993. Student self-evaluation: fostering reflective learning. San Francisco (CA): Jossey-Bass Publishers.
Magolda, MBB. 1992. Knowing and reasoning in college: gender-related students' intellectual development. San Francisco (CA): Jossey-Bass Publishers.
National Research Council. 1996. National science education standards. Washington (DC): National Academy Press.
Novak JD, Gowin DB. 1984. Learning how to learn. New York: Cambridge University Press.
Anyhow, when I began teaching a large introductory biology course (600 students) I knew that my multiple choice tests were not providing me the kinds of data I wanted about my students' thinking, because I also knew that freshman can/do think!! Second, I believed that my students needed to learn how to write and speak to explain themselves in the sciences as well as every other facet of their education, and it was my responsibility to assist all of them in this process. On the other hand, I needed a reality check. How would I find time to evaluate 600 writing samples, especially if I asked students to practice writing/speaking more than once throughout the semester?
So I stumbled upon the term "rubric," I learned what it meant, and I learned how to design rubrics from various sources in the literature - a special acknowledgement to BSCS (Biological Sciences Curriculum Studies) whose rubrics influenced my original thinking. As I developed rubrics for each of my assessments, I forced myself to think more explicitly about the goals I wanted my students to achieve and the criteria I would use to monitor their progress. Then I worked with my students to understand and practice achieving the goals and criteria with rubrics as a guide for communication. I now can manage reading and evaluating large numbers of well-written and reasoned responses. We all won. On another note, because my students understand and use rubrics, I seldom, if ever, have individuals who "argue for points" on any given assignment.
Last year, I was recruited by Michigan State University to become the director of a residential science school within the College of Natural Science, the Lyman Briggs School. Our faculty includes scientists, mathematicians and humanists who value their teaching as much as their research. So I left the sunshine of northern Arizona to engage in this
great opportunity to continue my research in undergraduate science education, this time with science majors in a small college within a large university -- stay tuned.
Exemplary
(5 pts quizzes)
Adequate
(4 pts quizzes)
Needs Improvement
(3 pts quizzes)
No Answer
(0 pts)
Assessment Purposes
Limitations
Level of Achievement
General Presentation
(10 points possible) Conceptual Understanding
(10 points possible)Argument Structure
(10 points possible)Use of literature and pertinent resources
(5 points possible)
Exemplary
Adequate
Needs Improvement
Teaching and Learning Goals
Students learn to communicate about science in a variety of ways and especially improve their writing skills. The quality of students reasoning and logic increases. Instructors gather a variety of data about students' understanding and performance.
Suggestions for Use
I design rubrics for the multiple forms of assessment I use in my courses: short writing samples, essays, poster displays, research papers, public hearing papers, oral presentations, weekly homework assignments, and concept maps. Each rubric stands on its own, but the general criteria in many rubrics are similar. For example, rubrics for written assignments have the same criteria for acceptable style and grammar; responses must address the question, and arguments must be presented in a logical order [compare Figures 1 and 3]. Alternatively, a rubric designed to evaluate the mechanics of a poster display may include a checklist to guide the student in developing all of the sections of the poster. Each component of the poster should then have additional criteria for evaluation. For example: What are the criteria for the title of a poster? Is the title informative? Are specific key words used?
Exemplary
(10 pts)
Quality
(8 pts)
Adequate
(6 pts)
Needs Improvement
(4 pts)
No Answer
(0 pts)
Step-by-Step Instructions
There are many routes to developing a useful scoring rubric, however, all of them involve the following five steps:
To provide a useful example of how these steps "play out" in a real world context, I will describe how I developed rubrics for my own introductory biology and ecology courses.
Keep in mind the assessment tasks must be linked to student learning goals and outcomes. So writing goals is the first step. These are examples of stems and sample goals from introductory ecology or biology courses:
Students will be able to demonstrate their ability to:
2. I selected the assessment tasks:
What type of assessment will provide me data about students' achievement of each of these goals?
3. I developed a set of performance standards:
The performance standards I used in my introductory biology course on "logical reasoning" and "critically evaluating information" were different than the performance standards I developed for my upper division biology majors. The difference was based on the developmental stages of the students and their experience in college-level science courses (Magolda 1992, King and Kitchener 1994).
4. I differentiated performances based on criteria:
Examine the rubric for Quizzes and Homework. The criteria for responses fall into two major categories: general approach and comprehension. Although these two categories are not discrete as indicated by the dotted line between them, students can see all of the itemized components of an exemplary answer. These categories can be divided further. For example, comprehension could be divided into content knowledge, conceptual understanding, and reasoning and critical thinking skills (Freeman 1994). Freeman (1994) includes communication skills as a category in rubrics. Essentially, my rubrics cover the same categories; the difference is in the number of columns used.
5. I assigned ratings (or weights) to the categories.
Variations
Student Roles
Analysis
Rubrics are scoring tools that enable me to assign points to students' assignments and tests. Students' accumulation of points determines their grade in the course. Each assignment, quiz, or test is weighted in terms of value in the overall course evaluation. For example, daily writing samples (quizzes) are worth 5 points, twice weekly, 15 weeks per semester; hence a student can earn a maximum of 75 points for daily performance. The pattern of students' performance is consistent from semester to semester. At the beginning of each semester, many students' responses are below college-level. As students begin to understand the criteria and practice writing, they attain college-level work or exemplary performance on short, five-point assignments or quizzes. A key strategy in promoting improvement by all students is peer review within their cooperative groups.
Pros and Challenges
However:
Theory and Research
What is assessment? Simply, assessment is data collection with a purpose. In each of our courses, we engage in the process of gathering data about our students' learning. The type of data we gather depends on the evidence we will accept that students have learned what we want them to learn. Generally, the data we collect are intended to be measures of students' knowledge, attitudes and performance. Ideally, these data are also matched or "aligned" with the goals of the course and our daily or weekly course activities.
We need confidence in the quality of the data we gather about our students if we want to justify our subsequent decisions about teaching. Many of us who teach introductory science courses are dissatisfied with the type of evidence we are collecting about our students' learning. We admit that data from multiple choice tests measure inert bits of knowledge and some comprehension but provide us incomplete and inadequate feedback about our students' learning. We would like to use alternative forms of assessment to gather multiple, substantive forms of data about active student learning, such as understanding, analysis, reasoning, and synthesis (Ebert-May et al 1997). These kinds of assessments include short answer items, essays, minute papers, oral communication, poster presentations, laboratory projects and research papers, but because of large class sizes and individual research priorities we have limited time to evaluate extended responses from students.
Links
email: ebertmay@pilot.msu.edu
Sources
Ebert-May D, Brewer C, Allred S. 1997. Innovation in large lectures-teachings for active learning. Bioscience 47: 601-607.
Diane Ebert-May
Freshman in college - my favorites. Why? Freshman are excited, energetic, and a bit wary about the challenge before them. Perhaps the subliminal reason is that every year my freshmen are 18 years old, so that must mean I am staying the same age too, right??
Tell me more about this technique:
Introduction
Description, Purpose, and Limits
Goals, Use, and Instructions
Variations, Analysis, and Pro/Challenges
Theory, Links, and Sources
Diane Ebert-May
View Entire Technique
Download Technique
Tools
![]()