Research Atlas
Live Research Index

EXP-0003 — “Play” via Controlled Novelty (Small Perturbations)

experiment_id: EXP-0003 · status: draft

EXP-0003 — “Play” via Controlled Novelty (Small Perturbations)

Status: Draft

Created: 2025-12-25 Last Updated: 2025-12-25

Hypothesis

If we introduce a small, controlled novelty pressure (e.g., lightly perturbed text during sampling, or prompt mutation during evaluation), then the organism will become more robust to minor corruption without destabilizing training, because the model learns invariances and broader pattern tolerance.

Setup / Test Plan

What stays fixed:

  • Dataset mix from the chosen week’s baseline (D0 or D0+D1).
  • Training budget.
  • Prompt suite base version: organism/prompts/v1.json.

What changes:

  • Novelty mechanism (one at a time) with a small rate (e.g., 1–3%):
    • Option A (eval-only): prompt corruption (missing punctuation, reordered sentences).
    • Option B (training): small perturbations applied to sampled text (requires code support).

Planned runs:

  • Control: no novelty.
  • Variant: novelty enabled at fixed rate.

Measurements (Pass/Fail)

Primary:

  • Eval robustness on “play probes”:
    • Higher intelligibility rate and/or lower repetition rates under corrupted prompts vs control.

Secondary:

  • Training stability:
    • No significant increase in late-loss variance vs control.

Notes:

  • If the novelty is eval-only, training curves should be identical; robustness changes are purely behavioral.

Results

Runs executed:

  • (fill)

Observed:

  • (fill)

Interpretation

  • (fill)

Decision

  • Adopt / Reject / Iterate
  • Next actions:
run_idloss_bestplateautokens_seenprompt_set
No runs linked to this experiment.
Training
Runs + metrics.
Eval
Prompt snapshots.
Insights
Notes + conclusions.