Mutations autoresearch
Each mutation the self-improving-loop runner applied, joined by mutation_id from the mutations.jsonl apply record (target · before → after · rationale) and its attribution record (Δfitness · significance), the latter landing once the post-mutation audit completes. Expand payload for the full change.
- mutations
- 6
- outcome
- 0 improved · 6 regressed · 0 noise · 0 pending audit
- mean Δfitness
- -1.6726 (n=6)
- target kinds
- hyperparam 1 · prompt 1 · tool_policy 4
- aimed dims
- redundant_tool_invocation
Mutations · 6 mutations
| applied | target | aimed dim | Δ fitness | attribution | outcome |
|---|---|---|---|---|---|
| 2026-05-29 15:29 | tool_policy · tool_deduplication |
redundant_tool_invocation +1.88 |
2.286 → 0.640 -1.6462 | -1.00 | regressed payload
previous — new Before calling any tool, compare the intended query, path, or action to prior tool calls and cached results in the current task. Reuse an existing result when it already answers the question; batch independent lookups once; repeat only when inputs changed, prior output was incomplete, or the user explicitly asks to refresh. |
| 2026-05-29 14:53 | tool_policy · repeat_call_gate |
redundant_tool_invocation +1.12 |
2.286 → 0.641 -1.6459 | +1.00 | regressed payload
previous — new Before calling a tool, compare the request to prior tool calls in this turn. Reuse existing results when they already answer the question. Repeat only if inputs differ materially, prior output failed, or freshness is essential; batch independent reads/searches in one parallel call when available. |
| 2026-05-29 14:25 | tool_policy · deduplicate_tool_calls |
redundant_tool_invocation +1.50 |
2.286 → 0.631 -1.6553 | +0.38 | regressed payload
previous — new Before any tool call, check whether the current transcript already contains the same needed fact, path, status, or error. Reuse fresh results within the same task, batch independent reads/searches, and make a repeated call only when inputs changed, prior output is stale or ambiguous, or verification is explicitly required. |
| 2026-05-29 13:59 | prompt · tool_result_handling |
redundant_tool_invocation +1.50 |
2.286 → 0.613 -1.6731 | +1.00 | regressed payload
previous When you receive a tool result, summarize it safely. Do not fabricate content that the tool did not return. If a path or filename is uncertain, ask before assuming. new Treat tool results as task-local memory: before calling any tool, check whether an earlier result already answers the same question. Reuse prior outputs unless the user changed scope, the result errored, or freshness is explicitly required; batch independent reads instead of retrying one by one. |
| 2026-05-29 13:29 | tool_policy · call_deduplication |
redundant_tool_invocation +1.88 |
2.286 → 0.565 -1.7211 | +1.00 | regressed payload
previous — new Before invoking a tool, compare the planned call against prior tool results in the current turn and session. Reuse an existing result when the same query, path, identifier, or lookup scope has already been answered; only repeat a call when inputs changed, the prior result was incomplete, or freshness is explicitly required. |
| 2026-05-29 13:03 | hyperparam · max_turns |
redundant_tool_invocation -0.75 |
2.286 → 0.592 -1.6941 | -0.38 | regressed payload
previous 5 new 4 |
Source: autoresearch/state/mutations.jsonl — apply + attribution records joined by mutation_id.
Published by .github/workflows/pages.yml on every main push.
Repo: github.com/mangowhoiscloud/geode
Harness chip legend: PAYGAPI key billing · Claude CodeMax OAuth · CodexChatGPT OAuth · GEODEself-target wrapper.
Rendered against GEODE v0.99.92 · DESIGN.md schema 1 · built 2026-05-30 09:11.
Dim subset: 22 (geode_5axes). Pipeline phases: 8. Baseline schema: v2 (PR-2).