Early Release

This evaluator reflects early-stage work. We’re continuously improving its accuracy and reliability.
Now
  • Increasing confidence in annotations, potentially by adding more annotations per row.
  • Expanding coverage up to Grade 12.