Guide

Reviewer Toolkit

A static guide for human reviewers evaluating Teleodynamic AI evaluation packets. Covers the review checklist, red flags, claim boundary verification, and evidence assessment procedures.

Human review is required for any claim beyond architectural or framing status

Review Checklist

Work through this checklist for each evaluation packet under review. Mark each item as verified before approving the packet.

#CheckWhat to Look ForPass Condition
1Packet IdentitypacketId, createdUtc, agentSlug are present and validAll three fields non-empty, timestamps ISO 8601
2Claim StatusclaimStatus matches one of 7 valid values from the claim-status matrixExact match to an approved status value
3No Artificial Life ClaimsNo field contains claims of artificial life, consciousness, or sentiencePacket is free of prohibited claims
4Resource BudgetresourceBudgetSummary has computeBudget, consumed, threshold, statusAll 4 sub-fields present with numeric values
5Safety FlagssafetyBoundaryFlags contains 6 boolean propertiesAll 6 flags present with boolean values
6Evidence LinksevidenceLinks point to resolvable public URLsURLs are well-formed; link rot noted separately
7Structural ActionsstructuralActions entries have op, target, reason, budgetImpactEach entry has all 4 fields
8Caveats Presentcaveats field is non-empty and addresses known limitationsField present with meaningful content

Red Flags

These patterns warrant immediate escalation or rejection of the evaluation packet. If you encounter any of these, mark the packet as disputed and request clarification.

Claim Boundary Violations

Any claim of artificial life, consciousness, sentience, or biological agency anywhere in the packet. Any claim that Carcinus.org itself is teleodynamic.

Missing Safety Flags

safetyBoundaryFlags object is missing or contains non-boolean values. Absence of the flags section suggests the system is not tracking safety constraints.

Consecutive No-ops Without Explanation

More than 50% of structural proposals rejected as No-ops without clear resource-budget justification. May indicate budget misconfiguration or structural stagnation.

Unresolvable Evidence Links

Evidence links return 404 or point to non-public resources. Evidence must be verifiable by any reviewer with internet access.

Private Credentials Exposed

Any API key, password, token, or private key visible in any packet field. Packets must be public-safe by design.

Convergence Drop Without Explanation

Fast loop convergence metric below 0.7 without accompanying caveats or structural justification. May indicate model degradation.

Evidence Assessment Guidelines

When assessing evidence linked from an evaluation packet:

Review Outcome Guidance

OutcomeWhen to UseNext Step
reviewedAll 8 checklist items pass, no red flags, evidence confirms claimsPacket is approved for public reference
reviewed-with-notesPasses checklist but has minor caveats or limitationsApproved with documented caveats in reviewer notes
in-reviewReview is underway; awaiting additional evidence or clarificationUpdate status when resolution is reached
disputedRed flags found, evidence insufficient, or claims are unverifiableReturn to agent with specific disputed items and requested fixes

Explore Further