Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deps: Use opendal's PrometheusLayer directly #4041

Open
Xuanwo opened this issue May 25, 2024 · 4 comments
Open

deps: Use opendal's PrometheusLayer directly #4041

Xuanwo opened this issue May 25, 2024 · 4 comments
Labels
C-enhancement Category Enhancements

Comments

@Xuanwo
Copy link
Contributor

Xuanwo commented May 25, 2024

What type of enhancement is this?

Refactor, Tech debt reduction

What does the enhancement do?

The current implementation of PrometheusLayer closely resembles that of opendal's PrometheusLayer. We should consider using the upstream implementation directly and improving it together with community.

Implementation challenges

Remove

@Xuanwo Xuanwo added the C-enhancement Category Enhancements label May 25, 2024
@evenyag
Copy link
Contributor

evenyag commented May 27, 2024

We should consider using the upstream implementation directly and improving it together with community.

Sure, we used to have trouble with sharing the same registry between PrometheusLayers because they register its metrics to the same registry multiple times. The underlying prometheus library aborts in that case. But this should be fixed by constructing a global layer once and cloning it. I'll have a try after #4037.

Another blocker is that the layer observes the timer before returning Writer/Reader. So it doesn't record the time spent by the Writer/Reader.
https://github.com/apache/opendal/blob/8d181ef3c6ac78ce409e51eb49e969313e323fb6/core/src/layers/prometheus.rs#L351-L374

        timer.observe_duration();
        // write_res holds a writer.
        write_res.map_err(|e| {
            self.stats
                .increment_errors_total(Operation::Write, e.kind());
            e
        })

Is this an expected behavior? If not, I can help address this on the OpenDAL side. Otherwise, we might still need to maintain a different layer in this project.

@Xuanwo
Copy link
Contributor Author

Xuanwo commented May 27, 2024

Another blocker is that the layer observes the timer before returning Writer/Reader. So it doesn't record the time spent by the Writer/Reader.

The idea of tracking the duration for a reader and writer seems illogical to me because it might include irrelevant time periods. For instance, if you create a new writer and have them write every 10 seconds, measuring their duration doesn't make sense.

The current time only measures the cost of read/write, users can use this to know if they spent too much time on opening a reader/writer.

I think we can add a new metrics called bytes_duration_seconds, so we can observe the time we spent on reading/writing bytes.

@waynexia
Copy link
Member

any update here? @evenyag

@evenyag
Copy link
Contributor

evenyag commented Jul 19, 2024

Blocked by apache/opendal#4854 and tikv/rust-prometheus#495

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-enhancement Category Enhancements
Projects
None yet
Development

No branches or pull requests

3 participants