Handle and retry after rate limits from OpenAI #14

dcadenas · 2023-10-05T16:03:40Z

No description provided.

dcadenas · 2023-10-06T12:28:24Z

There's Retry after exponential backoff delay already enabled in the topic so any rate limit should be retried correctly.

I'm also adding:

Dead Letter topic. So we don't lose any message that still may not be processed and ease monitoring and alerting
Random jitter inside the function so when the exponential backoff is elapsed we don't get all retries done at the same time.
Not sure if we have slack alert integrations in case we detect rate limit issues of topics increasing.

If this is not enough, in the future we could:

Send some kind of token summary of the content to avoid hitting the token limit. It would be naive to just grab a random chunk of the content because the offensive part could be in another place but there may be other options we could investigate
Fill the Increase Request Limit Form but we'd need to probe we really need that

dcadenas · 2023-10-07T13:50:02Z

Related #15

dcadenas mentioned this issue Oct 7, 2023

Use a pool of OpenAI api keys #15

Open

Provide feedback