Many courses in the biological sciences include short tests and quizzes as part of their overall assessment strategy and curriculum design. Such tests are generally described as being shorter than examinations*, take place during the course of study, and if formally assessed the grades generally make a smaller contribution to the overall grade. Tests vary in terms of their principal purpose, question design and format. A unifying characteristic, however, is that tests are opportunities for students to assess their knowledge, and they provide timely and diagnostic feedback on students' learning.
Written or online tests associated with practical classes are included in this section. A distinction is made, however, with the demonstration and evaluation of practical skills. Practical assessment of this type is quite distinctive and is described elsewhere [see also: Practical assessment].
* Tests, quizzes and examinations share many common features. For the purposes of this site, we have distinguished 'examinations' on the basis that they tend to be end-of-unit tests which contribute significantly to the overall grade [see also: Examinations]. We make no distinction between a test and a quiz, as these terms are used variously and interchangeably in the biological sciences.
As the examples described to us illustrate, tests are diverse in design and in the role that they play in learning and teaching in the biological sciences. Fundamentally, however, they are used to provide feedback and incentives.
While it is true that all forms of assessment have the potential to provide students with feedback on their learning, this is particularly true of tests. A well-designed test helps students to check their knowledge and identify gaps and areas of misunderstanding. This is true of tests designed as self-assessment tasks [Examples: Peat 3; Peat 5; Plastow; Rogers 2; Shapter], as well more formal and mandatory tests [Example: Mrongovius 2].
As many tests are 'progressive', they enable students to act on the feedback in time to improve their performance in subsequent assessment tasks (including, often, the final examination). It is common for first year courses to include early, low-stakes tests as a means of providing timely feedback on progress as they adjust to the expectations of university studies.
Tests and quizzes conducted during a unit of study enable staff to identify, and respond to, issues with the learning and teaching. This feedback is important as two levels:
Results for individual students: Test results provide valuable information about individual students' learning and progress. For example, poor test results contribute to identifying students at risk, particularly important in early-year levels [Example: Shapter].
Classwide results: Areas of general misunderstanding can be identified and addressed. In-class quizzes, including the use of Audience Response Systems*, can be used to good effect [Example: Wood 1], as can more formal tests such as practical tests [Example: Dow; Mrongovius 2] and in-semester tests [Example: Peat 3; Plastow].
... If it's a test I always looking at the different groups and their average mark and standard deviation to make sure that there's equity there. We also put up the marking schemes or the test answers or what we were looking for in the dissection so a student can always come back and ask for an explanation if they're not happy. ... we tend to modify marking schemes from year to year depending on our experience from a previous year. We try to improve... you know, if there's been something that the students have raised with us on a regular basis then we'll take that into account and modify from year to year. [academic]
[*see also: Engaging large classes through assessment; Online assessment]
Tests that contribute toward the grade for a course are often designed to encourage students to revise and study throughout the course - not simply in the lead up to a final examination. The use of tests for this purpose varies from regular small tests such as weekly practical tests each contributing just a few percentage points, to single, 'mid-semester' tests which are much 'higher-stakes'.
Tests tend to involve questions which are answered quickly - extended answers and essays questions, common in examinations, feature less often in tests and quizzes. The typical question types are described below.
As diagnostic tools, well-designed MCQs are an effective way for students to identify misconceptions and gaps in their knowledge. In particular, where explanation of both 'correct' and 'incorrect' options is provided, MCQs are a powerful device for students' self-assessment [Example: Plastow]. Similarly, classwide response patterns to MCQs provide staff with important feedback on teaching and student learning.
As noted under Examinations, however, the challenge lies in designing questions which are appropriate to the desired learning outcomes. The simplest MCQs to prepare are those which test students ability to recall specific information - for example: the correct name of an organelle; the most numerous molecule in plant cell walls. If, however, the intent is to encourage critical analysis, synthesis or problem-solving, effective MCQs are more difficult to construct.
When used for more formal testing - such as mid-semester tests - there are issues related to reliability and ambiguity. As discussed under Examinations, with just two options, guesswork and chance are even more an issue than is the case with MCQs. In addition, the risk of ambiguity in the typically-complex statements used in T/F questions is high. We were told of two different approaches to overcoming these limitations. The first approach was to require a written justification for the choice of true or false. In this case, the questions were not dissimilar to short-answer questions. A second approach described to us involved penalty for incorrect answers. In this case, there was no deduction if fewer than 30 per cent of the responses to a particular question were correct - the reasoning being that this would help control for questions which proved confusing or ambiguous.
The most obvious distinction in format is between paper-based and computer-based tests. The use of online tests and quizzes is increasingly rapidly, in part due to the uptake of learning management systems in higher education, and the fact that such systems usually have inbuilt tools for test creation and management.
Another variation on format, and one which is growing in use, is the incorporation of informal quizzes into lecture classes. While this has always been an option, the advent of Audience Response Systems* has provided impetus to this form of assessment [Example: Wood 1]. Typically there is no grade involved, and the test is used for the multiple purposes of engaging students and providing feedback to both students and staff.
[See also: Online assessment]
[*Audience Response Systems are described further under Engaging large classes through assessment]
The conditions under which tests and quizzes are completed relates directly to their principal purpose.
In practical tests of this type, it is common for students to have access to additional resources such as laboratory notes.
As described above, many tests are designed with feedback as the principal purpose. Therefore, the type of feedback provided by tests is an important consideration. The type of feedback provided is often directly influenced by the question type. Feedback on a multiple-choice quiz is typically quite different to that provided on a test involving short-answer questions and diagrams. In addition, providing the best possible feedback given limited time and resources inevitably results in the need to make strategic decisions.
The types of feedback provided by test results may be categorised as follows:
Due to the way in which information about test results is often provided to students, however, the score may be the only feedback many students 'register'. For example, if scores are made available in advance and/or independent of other feedback, many students pay attention to the score alone. Therefore, consideration of how feedback is released can be as important as the actual type of feedback provided [Example: Mulder 4].
As discussed above, test feedback will be most effective if it is available at a time when students are engaged in thinking about the questions. Approaches include feedback for each question as students progress through an online quiz or immediately after completing such a quiz. This can also be achieved in class, if students correct their own tests as the questions are discussed in turn.
[see also: Online assessment]
A variation on this approach is self-assessment - students checking their own responses against the model answers provided. This is an effective approach for optional, purely 'formative' tests [Example: Rogers 1].
The preparation of effective explanations of this kind requires that the author have both disciplinary expertise and insight into student learning. It is both a powerful and potentially efficient approach - just as with model answers, such explanations need only be prepared once for a particular question and can provide a valuable study resource for students.
[see also: Providing feedback; Coping with resource constraints]
When tests contribute in even a small way to the overall grade for a course, the issue arises of ensuring that each students' work is their own. Staff differ in their philosophies in this regard and test conditions vary accordingly - from timed, invigilated, and with questions unavailable beforehand, to 'take-home' or online tests which are untimed and rely on students' honesty in terms of authentication. Typically, however, where there is a significant contribution to course grades, tests are supervised and structured to minimise the risk of dishonest behaviour.
A simple and long-standing approach is to manage students seating position within a shared test venue. This is no longer as simple as it once was, however, with increasing class sizes. Also, if computers are used, students' work may be in clear view of one another. A strategy commonly used to manage this is to vary the order of questions and/or the order of options in MCQs. Another approach, and one that is commonly used if tests are taken by groups of students at different times, is to have versions of the test with different but 'equivalent' questions. Moderation becomes important in such situations to ensure that the versions are indeed equivalent.
[see also: Addressing plagiarism issues]
Tests and quizzes are also used for purely formative purposes. In such situations, collaboration between students is typically encouraged - test questions can provide an effective trigger for productive discussion and debate.
It is a reality of university teaching that resources are finite and, therefore, that the strategic use of resources is a critical aspect of assessment design. Feedback for students is most effective when it is both timely and meaningful [see also: Providing feedback]. Identifying ways in which meaningful feedback can be provided quickly and efficiently is therefore critical in the design and implementation of tests and quizzes.
Considerations regarding feedback are described in detail in the section above entitled 'Feedback'.
The design of effective test questions requires disciplinary-expertise, insight into student learning ... and time [see also 'Question types', above]. This is particularly true for MCQs, which are challenging to prepare and are typically needed in large numbers. For this reason, the questions used in graded tests are often 'protected' and reused in later years. This poses a challenge with regard to providing feedback. Displaying the questions, with feedback, in a secure location for a fixed period of time is one solution. While perhaps not ideal, this approach can be effective - particularly if students have both opportunity and incentive to consider the feedback. Discussion of the feedback, in class or online, is one way to maximise the feedback value from 'protected' test questions.
The focus of tests is typically upon the development and assessment of disciplinary knowledge. Well-designed tests can, however, also be used to foster the development of generic skills and, to some extent, assess these skills. The most obvious are critical thinking, analysis and problem-solving
[see also: Assessing generic skills; Outcomes and assessment]
A number of the biologists we interviewed said thay developed tests and quizzes in realistic contexts. For instance, one academic uses a quiz to ensure students navigate through the National Aquatic Animal Disease Awareness web site [Example: Benkendorff 1]. Another creates problem sheets based on authentic scenarios [Example: Sanderson].
To reference material from this site, please use:
Harris, K-L., Krause, K., Gleeson, D., Peat, M., Taylor, C. & Garnett, R. (2007). Enhancing Assessment in the Biological Sciences: Ideas and resources for university educators. Available at: www.bioassess.edu.au