Persistence sidecar for EP adopted texts that are indexed but
not yet available (EP Open Data Portal 5–15-day indexing lag).
When Stage B deep-fetch receives UPSTREAM_404: document indexed but content not yet available from the MCP server, the document identifier is recorded
here with { docId, firstObservedAt, lastProbedAt, attempts } so that
subsequent workflow runs can re-probe with exponential back-off instead of
treating the item as a permanent retrieval failure.
Back-off schedule: initial 24 h, doubling each attempt, capped at 72 h.
Documents older than 14 days are escalated (status = ESCALATED) so the
wildcards-blackswans family can handle them.
Description
Persistence sidecar for EP adopted texts that are indexed but not yet available (EP Open Data Portal 5–15-day indexing lag).
When Stage B deep-fetch receives
UPSTREAM_404: document indexed but content not yet availablefrom the MCP server, the document identifier is recorded here with{ docId, firstObservedAt, lastProbedAt, attempts }so that subsequent workflow runs can re-probe with exponential back-off instead of treating the item as a permanent retrieval failure.Back-off schedule: initial 24 h, doubling each attempt, capped at 72 h. Documents older than 14 days are escalated (status = ESCALATED) so the wildcards-blackswans family can handle them.
See
EP Open Data Portal — Developer Corner