You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I am trying to replicate some of the results but I am constantly exceeding the PerspectiveAPI quota limit. I noticed that you set your rate_limit to 135, but my granted quota entails a maximum of 60 queries per minute (1 QPS) so I changed the rate_limit parameter to 60.
However, when monitoring the responses from the requests (both in the google cloud platform and in the code) I am observing many 429 HTTP errors indicating that I am exceeding my quota, and thus, many LLM responses are associated with "null" toxicity scores.
How did you manage to stay below the quota limit? Did you ask for an increase? In that case, what rate did you request for?
Also, if I am understanding it correctly, the code is in charge of limiting the rate to 1 batch request per second (leading to a total of 60 batch requests per minute), where each batch comprises several responses amounting to a maximum of rate_limit responses, for which toxicity scores need to be computed. Does each batch request account as a single HTTP request or does each element in the batch contribute to the quota and account as individual requests?
Many thanks in advance,
S.
The text was updated successfully, but these errors were encountered:
Hi, I am trying to replicate some of the results but I am constantly exceeding the PerspectiveAPI quota limit. I noticed that you set your rate_limit to 135, but my granted quota entails a maximum of 60 queries per minute (1 QPS) so I changed the rate_limit parameter to 60.
However, when monitoring the responses from the requests (both in the google cloud platform and in the code) I am observing many 429 HTTP errors indicating that I am exceeding my quota, and thus, many LLM responses are associated with "null" toxicity scores.
How did you manage to stay below the quota limit? Did you ask for an increase? In that case, what rate did you request for?
Also, if I am understanding it correctly, the code is in charge of limiting the rate to 1 batch request per second (leading to a total of 60 batch requests per minute), where each batch comprises several responses amounting to a maximum of rate_limit responses, for which toxicity scores need to be computed. Does each batch request account as a single HTTP request or does each element in the batch contribute to the quota and account as individual requests?
Many thanks in advance,
S.
The text was updated successfully, but these errors were encountered: