Hypernym Infinite Memory · v0.64 closeout · generated 2026-06-10
The tiny lease passed. The large row was still rejected.
v0.64 tested whether two exact OK chat probes could safely reserve the shared direct-IP lane before resuming memory-pressure rows. The probes passed, but the immediately following 2048-pressure benchmark row returned seven fast 503 busy responses. This is an admission-control finding, not a recall result.
Lane lease
2 / 2
Two exact
OK probes passed.Benchmark rows scored
0
No memory row reached HTTP 200.
First row attempts
7
All fast 503 busy.
Recall claim
None
No success or failure should be inferred.
What We Learned
- A tiny successful probe does not predict admission for a large 2048-pressure row on the shared endpoint.
- The readiness problem is now precise:
/health OK, tiny chatOK x2, large row503. - The harness kept the result out of memory-quality scoring, which preserves the science.
What Not To Claim
- Do not claim story-canon recall failed.
- Do not claim cross-domain recall failed.
- Do not use this as a model-quality datapoint.
Trace
| Step | Status | Elapsed | Meaning |
|---|---|---|---|
| Idle probe 1 | 200 OK | 0.2769s | Exact content OK; success streak 1. |
| Idle probe 2 | 200 OK | 0.3279s | Exact content OK; success streak 2. |
| First benchmark row | 503 x 7 | ~0.50-0.55s each | Backend busy: 1 request(s) already in flight. |
| Classification | capacity only | pre-score | lane_probe_passed_but_benchmark_row_busy. |
CTO Decision
Institutional eval readiness must be request-size-aware or explicitly reserved.
- Best: isolated benchmark lane.
- Strong: server-side lease or queue token.
- Fallback: calibration probe near benchmark-row size.
- Operational fallback: coordinated quiet window with other shared-service users.
Carryforward Memory Evidence
v0.62 remains the latest positive memory-quality result: 4/4 scored research-update rows passed strict and semantic true-north at 1024/2048 pressure, with 247,962 prompt tokens counted on successful rows. v0.64 only changes the serving-readiness requirements.
Data Trace
| Evidence | Path | Use |
|---|---|---|
| v0.64 scores | research/tracks/hypernym-infinite-mim/results/v0.64-lane-lease-unscored-domain-resume/20260610T_lane_lease_unscored_domain_resume_live_codex_v1/scores.json | Aggregate stop reason and classification. |
| Lane lease trace | research/tracks/hypernym-infinite-mim/results/v0.64-lane-lease-unscored-domain-resume/20260610T_lane_lease_unscored_domain_resume_live_codex_v1/idle-probe-before-first-row-attempts.json | Two exact OK successes. |
| First row attempts | research/tracks/hypernym-infinite-mim/results/v0.64-lane-lease-unscored-domain-resume/20260610T_lane_lease_unscored_domain_resume_live_codex_v1/v064-story_canon_supersession-controller_expands_tail_contract-p2048-attempts.json | Seven fast 503 responses. |
| Snapshot | .forge/artifacts/cxdb-hypernym-infinite-mim-post-v064-interrupted-snapshot-20260610T075855Z.md | Durable handoff. |
| v0.62 positive result | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/scores.json | Last scored recall evidence. |
Compound Research Chain
| Artifact | Pointer |
|---|---|
| Current public board | https://hypernym-infinite-memory-v09.pages.dev/ |
| Previous immutable v0.63 board | https://df021339.hypernym-infinite-memory-v09.pages.dev/ |
| Working memory | research/tracks/hypernym-infinite-mim/WORKING_MEMORY.md |