Effect Size 0.4–0.7 — 250+ Studies

Formative Assessment:
The Most Powerful Teaching Strategy You Can Use

Formative assessment produces effect sizes of 0.4–0.7 — among the highest of any educational intervention. Black & Wiliam's landmark review of 250+ studies showed that when teachers use assessment for learning rather than of learning, student achievement gains are substantial and consistent across all subjects.

Based on Black & Wiliam (1998), “Assessment and Classroom Learning” — the most-cited assessment research paper in history.

Low-Stakes Feedback
Grade in 90 Seconds
FERPA Compliant
Understanding the Method

What Is Formative Assessment?

Formative assessment is ongoing, low-stakes assessment used to monitor student learning and provide feedback during instruction. Unlike summative assessment (which evaluates what students learned), formative assessment helps teachers adjust instruction in real-time. The goal is not to assign a grade but to improve the quality of learning while it is still happening.

The key principle: formative assessment is a process, not a test. It is the ongoing conversation between teaching and learning — any activity that gives teachers evidence of student understanding qualifies. An exit ticket, a think-pair-share, a draft essay, a classroom discussion, a quick write — all of these are formative assessment when they are used to adjust instruction.

Modern formative assessment has evolved from informal teacher observations to AI-powered real-time feedback on student writing. But the principle remains unchanged: use evidence of learning to improve teaching and learning. What AI changes is the scale — teachers can now give detailed, personalized formative feedback to every student, every week, without spending 40 hours grading.

Teacher providing one-on-one formative assessment feedback to student
Teacher providing formative feedback to students in a classroom setting
EasyClass AI providing formative feedback on student writing with specific strengths and areas for improvement
Comparison

Formative vs. Summative Assessment

The distinction between formative and summative assessment is one of the most important concepts in education. Paul Black and Dylan Wiliam put it simply: formative assessment is assessment for learning; summative is assessment of learning.

Purpose

Formative

Improve learning during instruction

Summative

Evaluate learning after instruction

Timing

Formative

Ongoing, during instruction

Summative

End of unit, semester, or course

Stakes

Formative

Low-stakes — not graded or graded lightly

Summative

High-stakes — counts toward final grade

Feedback

Formative

Immediate, specific, and actionable

Summative

Delayed, evaluative, and final

The analogy that sticks: Formative assessment is the practice during the season. Summative assessment is the championship game. You can't win the game without practice — and the research shows that the quality of practice feedback is the single biggest driver of performance.

The Framework

Dylan Wiliam's 5 Key Strategies

In his landmark 2011 book Embedded Formative Assessment, Dylan Wiliam distilled decades of research into five core strategies that define effective formative assessment practice. These are not techniques — they are principles that can be implemented in any classroom, at any grade level, in any subject.

1

Clarifying, Sharing, and Understanding Learning Intentions

Students must know what they are trying to learn, not just what they are doing.

When students understand the learning goal and the criteria for success, they can self-assess their progress. This is not about posting the standard on the board — it is about making the target transparent enough that students can identify where they are and where they need to go. Effective teachers co-create success criteria with students, making the destination clear before the journey begins.

Example Techniques

Learning intentions posted in student-friendly languageCo-created success criteriaRubrics shared before the assignmentAnchor papers showing strong and weak examples
2

Engineering Effective Classroom Discussions and Tasks That Elicit Evidence

Not all questions and tasks reveal student thinking. Design them to.

Most classroom questions tell teachers very little about student understanding. Wiliam advocates for 'hinge-point questions' — questions designed so that wrong answers reveal specific misconceptions, and right answers confirm mastery. This includes rich tasks where the process of solving reveals student thinking, not just the final answer.

Example Techniques

Hinge-point questionsThink-pair-shareMini whiteboardsClassroom polls with follow-upExplain-your-reasoning tasks
3

Providing Feedback That Moves Learners Forward

Feedback should tell students where they are and how to get to where they need to be.

Research by Kluger & DeNisi (1996) found that feedback can actually decrease performance when it focuses on the person (grades, praise) rather than the task. Effective formative feedback is specific, actionable, and focused on the gap between current performance and the learning goal. It answers: What is good? What needs to improve? How do I improve it?

Example Techniques

Specific written commentsCoded feedback (symbols that require student lookup)Audio feedbackAI-generated strengths and next stepsDraft feedback before final submission
4

Activating Students as Instructional Resources for One Another

Peer assessment is one of the most powerful formative tools available.

When students assess each other's work using clear criteria, they deepen their own understanding of quality and internalize success criteria. Graham, Hebert & Harris (2015) found peer feedback on writing has an effect size of 0.58. The key is structured peer review with specific criteria — not vague "what do you think?" conversations.

Example Techniques

Structured peer review protocolsTwo stars and a wishGallery walks with sticky note feedbackPeer editing with specific criteriaCollaborative error analysis
5

Activating Students as Owners of Their Own Learning

The ultimate goal: students who monitor and adjust their own learning.

Self-assessment is not students grading themselves. It is students developing metacognitive awareness of their own learning — understanding what they know, what they do not know, and what they need to do to close the gap. This strategy builds the foundation for lifelong learning and is the highest level of the formative assessment hierarchy.

Example Techniques

Self-assessment rubricsLearning journals and reflectionsTraffic light self-assessmentGoal settingPortfolio curation with rationale
Research

Research & Evidence

Formative assessment is one of the most evidence-supported practices in education. The research base spans more than four decades and hundreds of studies across every subject and grade level.

Headline Study

Black & Wiliam (1998) — “Assessment and Classroom Learning”

Review of Research in Education. The most-cited assessment research paper in history.

Reviewed 250+ studies on formative assessment. Found that improving formative assessment practices raises achievement for all students, with the largest gains for low-achieving students. Effect sizes of 0.4–0.7 — among the highest of any educational intervention ever studied.

250+

Studies reviewed

(comprehensive meta-review)

0.4–0.7

Effect size range

(large effect)

#1

Most-cited paper

(in assessment research)

Kingston & Nash (2011) — Meta-Analysis of Formative Assessment

Reviewed 42 studies. Median effect size = 0.25, weighted mean = 0.20. ELA showed stronger effects (ES = 0.32) than math (ES = 0.17). Confirmed that formative assessment is particularly powerful for language arts and writing instruction.

Wiliam (2011) — "Embedded Formative Assessment"

Synthesized the research into five actionable strategies. Demonstrated that the key is not assessment itself, but using assessment evidence to adjust instruction in real-time. Teachers who implement all five strategies see the largest gains.

Shute (2008) — "Focus on Formative Feedback"

Meta-analysis of feedback types. Found that elaborated feedback (specific, task-focused, actionable) significantly outperforms simple grades. Feedback that focuses on process and next steps outperforms feedback that focuses on the person.

Graham, Hebert & Harris (2015) — Writing Feedback

Meta-analysis of writing feedback. Adult (teacher) feedback on writing: ES = 0.87. Peer feedback on writing: ES = 0.58. Both are among the largest effect sizes in writing research — confirming that formative feedback on drafts dramatically improves final writing quality.

How 0.4–0.7 Compares (Hattie's Rankings)

Hattie considers 0.40 the “hinge point” for meaningful impact. Formative feedback consistently exceeds this threshold.

Teacher Feedback (writing)
0.87
Formative Assessment (max)
0.7
Peer Feedback (writing)
0.58
Formative Assessment (min)
0.4
Direct Instruction
0.59
Class Size Reduction
0.21
By Subject

Formative Assessment Across Every Subject

Formative assessment is not a writing-class-only strategy. Kingston & Nash (2011) found strong effects in both ELA and math. Here's how it adapts to every content area.

ELA

ES = 0.32 (Kingston & Nash)

  • Draft feedback on essays and narratives
  • Reading response journals
  • Writing conferences
  • Peer review with rubrics

Math

ES = 0.17 (Kingston & Nash)

  • Exit tickets with worked problems
  • Error analysis tasks
  • Think-alouds while solving
  • Warm-up hinge questions

Science

Lab-integrated checks

  • Lab check-ins before and after
  • Prediction journals
  • Concept maps
  • Pre-lab vocabulary checks

Social Studies

Document-based feedback

  • Draft thesis feedback
  • Document analysis annotations
  • Discussion observation notes
  • Map and timeline checks

World Languages

Low-stakes practice

  • Speaking practice feedback
  • Written draft corrections
  • Self-assessment checklists
  • Partner conversation reviews

CTE / Technical

Process-based checks

  • Process portfolio check-ins
  • Skills observation checklists
  • Project milestone reviews
  • Safety and procedure checks
Comparison

Formative vs. Diagnostic Assessment

These terms are often confused. Both are low-stakes and focused on learning, but they serve different purposes and are used at different times. Understanding the distinction helps teachers use both more effectively.

Aspect
Formative
Diagnostic
Purpose
Monitor and improve learning during instruction
Identify prior knowledge and gaps before instruction
Timing
Ongoing — throughout the unit or course
Before instruction begins — start of unit, year, or topic
Frequency
High — daily or weekly during active instruction
Low — once per unit or term, at the start
Who uses the results
Teacher adjusts instruction; students self-regulate
Teacher plans instruction; identifies starting points
Examples
Exit tickets, draft feedback, think-pair-share, quick writes
Pre-tests, KWL charts, interest surveys, placement assessments
AI enhancement
Real-time feedback on drafts, class analytics for reteaching
Pre-assessment scoring, gap identification, grouping recommendations

In practice: Use diagnostic assessment to find out where students are starting. Use formative assessment to guide them to where they need to go. Together, they form a complete picture of student learning — before, during, and after instruction.

Solutions

Common Challenges & AI Solutions

Formative assessment is research-proven — but implementation is hard. Here's where teachers struggle and how EasyClass helps.

Feedback Takes Too Long

The Problem

Writing personalized feedback on 150 essays is physically impossible. Most teachers grade one set of essays every two weeks — too slow for effective formative use.

AI Solution

EasyClass generates draft-quality feedback in 90 seconds per essay — specific strengths, next steps, and criterion-by-criterion comments. Teachers review and customize, turning a 10-hour task into a 30-minute review.

Students Don't Read Feedback

The Problem

Students skip written feedback and check the grade. If there's no grade, they assume the work doesn't matter and still don't read it.

AI Solution

Color-coded text annotations and specific, actionable next steps make feedback impossible to ignore. When feedback tells students exactly what to fix and how, engagement with comments improves dramatically.

No Time for Low-Stakes Grading

The Problem

Every assignment cannot be graded if it all takes manual effort. Teachers are forced to choose between feedback quality and feedback frequency.

AI Solution

AI handles the first pass — grade exit tickets, drafts, and quick writes in bulk. Teachers set the criteria once and can give formative feedback on every draft, every week.

Tracking Growth Is Hard

The Problem

Seeing patterns across 150 students over time requires spreadsheets and manual data entry. Most teachers don't have time to track who improved and who is still stuck.

AI Solution

Class analytics dashboard shows per-student and per-criteria growth trends across all submissions. Instantly identify which students need reteaching and which are ready to move on.

Step by Step

How to Use Formative Assessment with AI

From blank page to personalized feedback for every student in under 2 minutes.

1

Set Up a Low-Stakes Session

Choose the "encouraging" grading style in EasyClass, select a formative rubric or create one, and specify this is a draft — not a final submission. The AI will calibrate feedback to be growth-focused, not evaluative.

Open the AI Grader
2

Upload Student Work

Paste student writing directly, upload PDFs, or connect Google Classroom. Works for exit tickets, drafts, quick writes, reading responses, or full essays — any written student work.

3

Review Feedback & Track Growth

Review AI-generated feedback with specific strengths and next steps for each student. Share feedback immediately or customize first. Use the class analytics dashboard to identify which students need reteaching.

EasyClass AI grader showing formative feedback with strengths and next steps for student writing
FAQ

Frequently Asked Questions

What is formative assessment?

Formative assessment is ongoing, low-stakes assessment used to monitor student learning and provide feedback during instruction. Unlike summative assessment, formative assessment is not about assigning a final grade — it is about improving learning while it is still happening. Any activity that gives teachers evidence of student understanding can be formative assessment.

What's the difference between formative and summative assessment?

Formative assessment is ongoing, low-stakes, and used to adjust instruction during learning. Summative assessment is end-of-unit, high-stakes, and used to evaluate mastery after instruction. Think of it this way: formative = practice, summative = the game. Formative gives immediate, actionable feedback; summative gives a final evaluative judgment.

What are examples of formative assessment?

Common formative assessment examples include exit tickets, draft feedback on writing, think-pair-share activities, concept maps, journal entries, quick writes, classroom polls, observation checklists, peer review, self-assessment checklists, and AI-generated writing feedback. Any activity that gives teachers real-time evidence of student understanding qualifies.

What does the research say about formative assessment?

Black & Wiliam's landmark 1998 review of 250+ studies found effect sizes of 0.4–0.7 for formative assessment — among the highest of any educational intervention ever studied. A subsequent meta-analysis by Kingston & Nash (2011) found a median effect size of 0.25, with stronger effects in ELA (0.32) than math (0.17). Graham, Hebert & Harris (2015) found that writing feedback has an effect size of 0.87 (adult feedback) and 0.58 (peer feedback).

Can AI help with formative assessment?

Yes. AI tools like EasyClass provide instant, detailed feedback on student writing — enabling teachers to give formative feedback on every draft without spending hours grading. The AI handles the first pass, generating specific strengths and next steps for each student. Teachers review and personalize. This makes low-stakes formative grading feasible at scale for any class size.

How do I start using formative assessment with AI?

Start small: use AI to grade one set of exit tickets or writing drafts. In EasyClass, choose the "encouraging" grading style for low-stakes feedback, select a formative rubric, and specify this is a draft. The AI generates feedback in 90 seconds per student. Review, adjust, and share with students. Use the class analytics dashboard to identify who needs reteaching.

Start Giving Better Feedback Tomorrow

Low-stakes. High-impact. Grade exit tickets, drafts, and quick writes with AI-powered formative feedback your students will actually use.

Free forever plan. No credit card required. FERPA compliant.

Free forever plan|No credit card required|FERPA compliant

What Is Formative Assessment — And Why Is It the Highest-Leverage Teaching Skill?

Formative assessment is the ongoing process of gathering evidence of student learning during instruction — and using that evidence to adjust teaching. It's the check-in, not the final exam. The sticky note, not the standardized test. The thumbs up/thumbs down, not the end-of-unit project.

The term gets its power from the word formative: this type of assessment is meant to form and shape instruction. It answers the question teachers need answered every day: Do my students understand this, or am I teaching into a void?

Contrast this with summative assessment — tests, projects, and grades that evaluate learning after a unit is complete. Summative assessment is important for reporting and accountability, but by the time you know a student missed a concept on the final exam, it's too late to re-teach it. Formative assessment catches misunderstanding while there's still time to fix it.

The research base is extraordinary. Dylan Wiliam and Paul Black's 1998 meta-analysis Inside the Black Box — still one of the most-cited papers in education — found that frequent, low-stakes formative assessment improved student achievement more than almost any other intervention, with effect sizes equivalent to raising achievement by 0.4 to 0.7 standard deviations. In plain English: used consistently, formative assessment can mean the difference between a C and an A for a struggling student.

Key Distinction

Formative vs. Summative Assessment: A Clear Comparison

FeatureFormative AssessmentSummative Assessment
WhenDuring learningAfter learning
PurposeGuide instructionEvaluate learning
FrequencyDaily or weeklyEnd of unit/semester
Grade weightUsually ungraded or low-stakesHigh-stakes, counts significantly
Student roleActive feedback receiverPerformance demonstrator
Teacher actionAdjust, reteach, accelerateRecord, report
ExamplesExit tickets, polls, observationsUnit tests, essays, final projects

How EasyClass Makes Formative Assessment Faster

The barrier to consistent formative assessment is time — specifically, the time it takes to create good check-in activities. EasyClass removes that barrier with free AI tools that generate formative materials in seconds:

Exit Ticket Generator

Type your lesson topic and learning objective. Get a 3–5 question exit ticket with multiple choice, short answer, and reflection prompts — ready to print or share via QR code.

Quiz Generator

Build a low-stakes formative quiz aligned to your standard. Choose number of questions, difficulty, and question types. Outputs a Google-Form-ready or printable quiz.

Rubric Generator

For project-based or performance-based formative checks. Enter the task and learning standards; get a clear rubric students can self-assess with.

Discussion Prompt Generator

Generates Socratic questioning prompts or turn-and-talk stems for verbal formative checks during class.

Strategy Reference

20 Formative Assessment Strategies — When and Why to Use Each

#StrategyBest ForTimingWhat You Learn
1Exit TicketAll grades, all subjectsEnd of classDid students grasp today's core concept?
2Thumbs Up / Sideways / DownK-8, quick comprehension checkMid-lessonWhole-class confidence level
3Think-Pair-ShareAll grades, especially ELL supportMid-lessonDepth of understanding; peer explanation ability
4Mini Whiteboard ResponseMath, science, vocabularyAny point in lessonSimultaneous whole-class answer reveal
5Cold Call with Wait TimeSecondary, discussion classesDuring instructionIndividual understanding, not just eager students
6Muddiest PointSecondary, complex conceptsEnd of classWhat students found most confusing
73-2-1 ReflectionAll grades, any content areaEnd of lesson or unit segmentComprehension + lingering questions
8Traffic Light Self-AssessmentGrades 3–12After practice or readingStudent metacognition and self-awareness
9Kahoot / Poll / QuizizzGrades 3–12, reviewMid or end of lessonMultiple choice accuracy across class
10Observation ClipboardK-5, lab work, group tasksDuring work timeProcess skills, misconceptions, collaboration
11Gallery Walk with Sticky NotesAll grades, project or reading workAfter activityPeer feedback quality and content understanding
12Two Stars and a WishPeer review, writingAfter draftingStudent ability to give and receive feedback
13Fist-to-FiveAll gradesAfter instructionWhole-class confidence gradient
14Error AnalysisMath, secondary scienceAfter practiceSpecific misconceptions in the class
15Learning Journals / NotebooksAll grades, ongoingWeekly reviewLongitudinal understanding growth
16Student-Generated QuestionsGrades 5–12Mid or end of unitDepth of inquiry and conceptual gaps
17Quick Write / One-Sentence SummaryHumanities, scienceEnd of lessonSynthesis and communication of ideas
18Socratic Seminar TrackingGrades 6–12During discussionParticipation, reasoning quality, text use
19Self-Assessment ChecklistProject-based, all gradesDuring or after taskStudent self-monitoring and goal-setting
20Digital Check-In (Google Form)All grades with tech accessAsynchronous or in-classDetailed written evidence of understanding

Deep Dive: The 5 Most Impactful Strategies

Exit Tickets

Exit tickets are the highest-leverage single habit a teacher can build. A three-question exit ticket at the end of every lesson — one recall question, one application question, one reflection question — gives you enough data in 5 minutes to plan the next day's lesson with precision.

Generate a differentiated exit ticket

Error Analysis

Error analysis is underused in most classrooms. Instead of correcting and returning a quiz, present a set of common errors (anonymized from student work) and ask the class to identify and fix the mistake. Students learn more from analyzing errors than from seeing the right answer.

Think-Pair-Share

Think-Pair-Share is deceptively powerful because it forces every student to formulate an answer — not just the fastest hand-raisers. The pair conversation surfaces misconceptions before they solidify, and the share-out gives you a sample of where the class is. It costs zero prep time.

Traffic Light Self-Assessment

Traffic Light Self-Assessment builds metacognition — one of the highest-impact skills for long-term academic success (Hattie effect size: 0.69). Students who accurately know what they know are better learners. Routine self-assessment trains this skill.

Muddiest Point

Muddiest Point is perfect for secondary teachers covering dense content. Students write one sentence: "The muddiest point for me today was ___." Sort the responses in under 3 minutes; the most common muddy points become your next day's warm-up.

FAQ

Formative Assessment — Frequently Asked Questions

What is the difference between formative and summative assessment?

Formative assessment happens during learning — its purpose is to give teachers and students real-time feedback so instruction can adapt. Summative assessment happens after learning — it evaluates how much students learned over a unit or course. A daily exit ticket is formative. The end-of-unit test is summative. Both are necessary, but formative assessment is what actually drives learning improvement.

Does formative assessment have to be graded?

No — and in most cases, it shouldn't be. The power of formative assessment comes from the feedback loop, not the grade. When students know an exit ticket won't be graded, they're more likely to be honest about what they don't understand. Teachers get better data. Students get better support. Many educators mark formative work as 'completion' only, or don't record it at all — the information is used for instructional planning, not report cards.

How often should I use formative assessment?

Daily, if possible. The research (Wiliam, 2011; Hattie, 2009) consistently shows that frequency matters more than length. A 3-minute exit ticket every day is more powerful than a 30-minute quiz every two weeks. Even a quick thumbs up/down before you move to the next topic counts — the habit of checking for understanding is what drives results.

What are the best free formative assessment tools for teachers?

EasyClass.ai offers free exit ticket, quiz, and discussion prompt generators with no account required. Other widely-used free tools include Google Forms (for digital check-ins), Padlet (for reflection walls), and Quizlet (for vocabulary checks). The advantage of EasyClass is that all tools are built for teachers, not general audiences, and integrate with lesson planning in the same platform.

How do I use formative assessment data to change my teaching?

After collecting a formative check (exit ticket, quiz, observation), sort responses into three buckets: Got It, Almost There, and Needs Reteach. Use this to inform the next lesson: "Got It" students can extend or peer-tutor; "Almost There" students need one more targeted example; "Needs Reteach" students need a small-group pull or a different explanation approach.

What is the research basis for formative assessment?

Formative assessment has one of the strongest evidence bases in all of educational research. The landmark 1998 meta-analysis by Black and Wiliam ('Inside the Black Box') found that improving formative assessment in classrooms yields effect sizes of +0.4 to +0.7 — among the largest of any instructional intervention. John Hattie's meta-meta-analysis (Visible Learning, 2009) placed formative evaluation at an effect size of +0.90 — nearly double the average instructional effect. The research consistently shows three key mechanisms: (1) students knowing their current level relative to the goal; (2) teachers receiving information to adjust instruction; (3) students developing self-regulation through feedback and reflection. Formative assessment works when it's ongoing and responsive — not just as an add-on, but as the central loop of instruction.

What is the difference between low-stakes and high-stakes assessment?

Low-stakes assessments are formative by nature: no or minimal grade weight, used to gather information for instruction and learning, not for reporting or accountability. Examples: exit tickets, thumbs up/middle/down checks, oral questioning, whiteboard responses, and observation notes. High-stakes assessments are summative: significant grade weight, used for reporting, decisions, or accountability. Examples: unit tests, state standardized tests, final exams, AP assessments. The key principle is that assessment purpose should drive design: if you want information to improve learning, design a low-stakes formative check. If you want to evaluate final achievement, design a high-stakes summative. Problems arise when teachers use high-stakes formats (graded tests) for formative purposes — the grade pressure changes how students engage with feedback.

Build formative assessments now with the free exit ticket generator or the AI quiz generator.

Formative Assessment Strategies and Free AI Tools — EasyClass