Cortical-Engagement Scoring Contract
This contract pins the experimental commitments of A-013, the lenia-swarm TRIBE overlay addendum. Changes to it require an addendum-internal review.
Frame
TRIBE-engagement is a relative score over Lenia creatures. We do not claim absolute neural realism. The contract is about reproducibility of the scoring procedure, not about cortex-as-ground-truth.
Inputs
- A TRIBE v2 checkpoint identified by HuggingFace revision hash.
- A stimulus manifest. Each entry is content-addressed and carries a class label drawn from
{lenia, biomotion_positive, scrambled, noise, static, rigid_motion, grating}plus stimulus-specific metadata. - For lenia-class stimuli, a reference into the lenia-swarm compendium giving the source trajectory and the morphospace coordinates already computed there. We do not recompute morphospace features inside this addendum; we consume them.
Outputs
For every (checkpoint, stimulus) pair:
- A vector of predicted per-voxel activations on
fsaverage5(20484 vertices, [left|right] hemispheres stacked per TRIBE’s convention). - A reduced vector of per-ROI mean activations for the registered ROI bundle (initial set:
sts,lateral_ot,v1_proxy, defined via Destrieux atlas labels inlenia_tribe_overlay.rois). - A provenance record (TRIBE revision, codebase commit, torch and CUDA versions, machine identifier).
For lenia-class stimuli with a manifest-supplied specimen_id, an overlay report joins the ROI scores to the lenia_terminal_v1 feature space in the lenia-swarm morphospace warehouse and emits one row per linked specimen with ROI scores and the 16 lenia descriptor axes (normalized_value per feature_values.axis_id). Specimens whose specimen_id is not present in the warehouse cause the overlay to fail loudly; the warehouse is the source of truth for axis values.
Outputs are written under $SPECTER_ARTIFACT_ROOT/lenia-tribe-overlay/ if set, otherwise under addenda/lenia-tribe-overlay/.artifacts/.
Redundancy verdict
For lenia-class stimuli with linked specimen_ids, lenia-tribe-correlate computes Pearson r between every ROI score and every lenia_terminal_v1 axis. An ROI is tagged REDUNDANT if its top abs(r) crosses a threshold (default 0.85), candidate-new otherwise. The threshold and the n_specimens floor (currently 4) are part of the contract because they decide the verdict; changing either requires reissuing prior verdicts.
Sanity gate
A run of the experiment is invalid unless the sanity gate, defined in lenia_tribe_overlay.sanity, has been executed against the same TRIBE checkpoint within the last seven days and emitted a pass record with the same checkpoint hash. The gate confirms only one thing:
- Across-stimulus variance over a small OOD probe set exceeds
VARIANCE_FLOOR = 1e-4. TRIBE must produce visibly differentiated whole-cortex predictions for visually distinct probes.
If (1) fails, the gate hard-errors with a diagnostic. Region-specific claims (e.g. STS responds more to a creature than to noise) are not part of the gate, because the bio-motion ROIs are ~1-2% of cortex and a localized signal can be drowned out in any whole-cortex aggregate. Those claims belong to the experiment proper.
Determinism
- Stimulus tensors are precomputed and stored content-addressed; the same stimulus produces the same input tensor every run.
- TRIBE inference is run with deterministic kernels where supported. Where it is not, the non-determinism is recorded and the experiment reports the standard deviation across N replicate runs.
What this contract does not promise
- We do not claim the predicted activations correspond to any particular individual human’s response.
- We do not claim that elevated predicted activation in any ROI for a Lenia stimulus implies that humans perceive that stimulus a particular way. The model predicts a hemodynamic proxy.
- ROI masks are anatomical (Destrieux), not functional.
sts,lateral_ot, andv1_proxyare loose proxies for STS, MT/EBA-lateral OTC, and V1 respectively; small-effect contrasts are not interpretable at this resolution. - We do not commit to any specific clustering or significance test upfront. The analysis layer is allowed to evolve. The contract pins what is measured and recorded, not what is concluded.