Add exponential backoff retry wrapper for OpenAI generation calls — max 3 attempts #206

Closed
opened 2026-04-09 03:31:01 -04:00 by pook · 1 comment
Owner

Replaces stale #128 with a tighter scope. The document generation service calls OpenAI without retry logic, causing user-visible failures on transient 429/500 errors.

Acceptance criteria:

  • Create a retry utility function: max 3 attempts, exponential backoff (1s, 2s, 4s)
  • Retry only on HTTP 429, 500, 502, 503 status codes
  • Wrap the OpenAI API call in the document generation service with this retry
  • Add unit tests for the retry utility (mock fetch, verify attempt count and delay)
  • Do NOT retry on 400/401/403 (client errors)

Generated by CEO Planner (priority: 3)

Replaces stale #128 with a tighter scope. The document generation service calls OpenAI without retry logic, causing user-visible failures on transient 429/500 errors. Acceptance criteria: - Create a retry utility function: max 3 attempts, exponential backoff (1s, 2s, 4s) - Retry only on HTTP 429, 500, 502, 503 status codes - Wrap the OpenAI API call in the document generation service with this retry - Add unit tests for the retry utility (mock fetch, verify attempt count and delay) - Do NOT retry on 400/401/403 (client errors) --- *Generated by CEO Planner (priority: 3)*
Author
Owner

Bulk-closed 2026-04-10 during pipeline triage.

Context: CEO agent had created 100 open agent-task issues against compliancebot, largely duplicates of each other and of the 50 currently-open PRs. Root cause traced to a git-push race in agent-worker executor (dispatch jobs collided on branch agent/dispatch/* because jobId prefix truncated to literal "dispatch"). Fix deployed: runId is now threaded from Paperclip shim through /dispatch → TaskJob → executor, and branches are keyed on a 12-char unique run key.

What to do next:

  1. Triage the 50 open PRs at https://192.168.183.110:3000/pook/compliancebot/pulls — many are ready to merge
  2. CEO should halt new task creation until open PRs drop below 10
  3. Surviving kept issues: #313, #314, #315, #341, #342, #350, #351, #352 (PR review/merge tasks)

This issue was superseded, not abandoned. Reopen if still relevant after PR triage.

Bulk-closed 2026-04-10 during pipeline triage. **Context:** CEO agent had created 100 open agent-task issues against compliancebot, largely duplicates of each other and of the 50 currently-open PRs. Root cause traced to a git-push race in agent-worker executor (dispatch jobs collided on branch `agent/dispatch/*` because jobId prefix truncated to literal "dispatch"). Fix deployed: runId is now threaded from Paperclip shim through /dispatch → TaskJob → executor, and branches are keyed on a 12-char unique run key. **What to do next:** 1. Triage the 50 open PRs at https://192.168.183.110:3000/pook/compliancebot/pulls — many are ready to merge 2. CEO should halt new task creation until open PRs drop below 10 3. Surviving kept issues: #313, #314, #315, #341, #342, #350, #351, #352 (PR review/merge tasks) This issue was superseded, not abandoned. Reopen if still relevant after PR triage.
pook closed this issue 2026-04-10 14:48:27 -04:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pook/compliancebot#206
No description provided.