-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Huge amount of upload errors on Grafana Cloud: resource_exhausted push rate limit #63
Comments
Thank you for reporting the issue @f0o. Indeed, Go profiles can be quite large depending on the workload. The upload rate can be changed via the I'm wondering which profile types are enabled. Napkin math shows that each of the apps generates ~100KB of profiling data (uncompressed) every 15 seconds – this is an unexpectedly high data rate. Could you please tell us more about the workload? I'd also like to clarify how many individual processes you're profiling, and what you mean by apps – do you mean 20 instances (processes/hosts/pods) of the same service, or 20 logical services, represented by some fleet? |
I'll look into the UploadRate parameter and tweak it once the retention expires those old profiles. I'm using:
With:
And for clarification it's 3 services amounting to 19-20 pods, each very small in resource consumption (we're talking 0.05 cpu and 32-64mb memory). The workload is best described with signal/data forwarding without processing. I was about to write the processing service when I noticed these errors and started disabling profiling everywhere instead. |
Hi @f0o, thank you for the feedback. I'll double-check everything and report back soon. In the meantime, please consider disabling goroutine, mutex, and block profiles. |
Every few seconds pyroscope client using the reference config in README errors with:
We're not talking about hundreds of apps here; it's only 19-20.
What limit am I hitting and what config should I use to prevent/mitigate it?
//Edit:
Turns out 20 apps running for 4 days == 50G of data. How can I limit the sampling/reporting rate because this is insane
The text was updated successfully, but these errors were encountered: