Intertextuality - AI Developer Tools for Education

Evaluator last updated June 15, 2026.

At a glance


Input type	Informational text
Passage length	200 words or more
Supported grades	3–12

Intertextuality is what a text assumes students have already encountered (another text, a cultural reference, genre conventions, etc.) and then builds on to create meaning. It is one of the least visible qualitative dimensions of text complexity – unlike Vocabulary or Sentence Structure, there is no word to flag or clause to count. For example, a passage can otherwise read as appropriate for a grade level, but incorrectly assume that students will recognize all its cultural references and genre conventions. The Intertextuality Evaluator assesses how much a text’s meaning depends on this kind of relational knowledge, helping developers optimize prompts, match texts to instructional goals, and maintain quality at scale.

Intertextuality is distinct from Subject Matter Knowledge:

Subject Matter Knowledge – Content that a text assumes (facts, concepts, domain knowledge)
Intertextuality - Relational knowledge that a text assumes (prior encounters with texts, references and conventions)

Model and prompt

For instructions on running the evaluator, see Quickstart.


Model used	gemini-3-flash-preview
Temperature	0
Prompts	View prompts ↗
Python notebook	View notebook ↗

The prompt is optimized for the model mentioned above. If you use other models and parameters, the output accuracy and result will vary.

Inputs

Requirement	Supported	Required
Text passage	Informational text	Yes
Target grade level	Enables grade-specific guidance	No

Output

Field	Description
`complexity_score`	Intertextual complexity level: `slightly_complex`, `moderately_complex`, `very_complex`, or `exceedingly_complex`.
`reasoning`	Synthesized decision that explains the rating and relevant context, cites specific references and their impact on comprehension, etc.
`detailed_summary`	Individual factors that drive the rating (specific references, allusions, genre conventions, and shared discourse detected, with descriptions and their effect on intertextual demand)
`assumptions_and_scaffolding`	Analysis of what the author assumes the reader already knows versus what the text explains, with suggestions for making the demands accessible
`recommended_use_cases`	Instructional opportunities for using the text and its intertextual demands strategically.

Interpreting results

The evaluator returns one of the following ratings, along with reasoning, to help developers determine the best course of action. Ratings are based on how many references or allusions are present, how central they are to meaning, and how widely accessible they are likely to be for students at the target grade.

Rating	Meaning
Slightly complex	Few or no references; any allusions are explained in the text or widely familiar. Meaning does not depend on outside textual or cultural encounters.
Moderately complex	Some references, allusions, or genre conventions are present and accessible to grade-level readers. They enrich meaning but rarely block it.
Very complex	Multiple references are central to meaning; the text assumes familiarity with other texts, cultural touchstones, or genre conventions it does not introduce.
Exceedingly complex	Dense, layered references are essential to meaning; the text assumes deep familiarity with a body of texts, traditions, or discourse, and is largely inaccessible without prior encounters.

More complex ratings indicate a text whose meaning depends more substantially on prior cultural or textual encounters. Intertextual complexity isn’t inherently good or bad; it depends on instructional goals. A text rated “Very complex” or “Exceedingly complex” may be ideal for teaching students to analyze how authors use allusion and convention to build arguments, while the same rating may signal a mismatch — and a need for scaffolding — if the goal is content comprehension.

See Grade Level Appropriateness for an evaluator that synthesizes all text complexity dimensions and makes an overall grade-level judgment.

Rubric

Slightly complex	Moderately complex	Very complex	Exceedingly complex
Few or no intertextual references; any present are explained or widely familiar; meaning is self-contained.	Some references, allusions, or genre conventions; generally accessible at grade level; meaning is enriched but not dependent on them.	Multiple references central to meaning; assumes familiarity with texts, cultural touchstones, or genre conventions not introduced in the text.	Dense, layered references essential to meaning; assumes deep familiarity with a body of texts, traditions, or discourse not part of the text.

Based on SAP’s ↗ Qualitative Text Complexity Rubric for Informational Text ↗, with a derivative rubric for the intertextuality dimension.

Accuracy and validation

This evaluator is provided as Early access. Comprehensive accuracy measures are still evolving, and validation testing is ongoing.

The evaluator was optimized using 25 passages from the CLEAR Corpus↗, annotated by 2 literacy experts from ANET, were used to iteratively refine the prompt against gold-standard examples and expert reasoning. The result wasvalidated through expert review of 20 additional passages.

Metric	Result
Complexity score accuracy	85% agreement with expert annotations
Expert agreement	80% (8 of 10 sampled outputs passed expert review; threshold 60%)
Reasoning soundness	Average 3.5 / 5
Dataset source	CLEAR Corpus↗

The benchmark dataset does not yet include validated examples of “Very complex” or “Exceedingly complex” texts, which are uncommon and difficult to annotate consistently for grades 3–12. Use caution when applying this evaluator to higher grade levels where such texts are more frequent.The evaluator is also tuned for informational texts and may perform less reliably on literary texts.

Evaluator release history

Date	Changes
June 16, 2026	First release.

​At a glance

​Model and prompt

​Inputs

​Output

​Interpreting results

​Rubric

​Accuracy and validation

​Evaluator release history

At a glance

Model and prompt

Inputs

Output

Interpreting results

Rubric

Accuracy and validation

Evaluator release history