Rename `db.collection.name` to `db.target.name` #1527

lmolkova · 2024-10-29T03:01:00Z

Changes

db.collection.name does not work well for all possible database objects such as stored procedures, user defined functions, triggers, users (e.g. when performing GRANT queries).

This PR renames it to db.target.name to cover all these cases

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
schema-next.yaml updated with changes to existing conventions.

schema-next.yaml

model/database/spans.yaml

model/database/common.yaml

bogdandrutu · 2024-10-31T14:59:21Z

Few comments here:

There may be multiple tables/functions (only one store procedure). Because of that I think we should make it a list of strings this argument.
Ideally I would separate "storage" targets from "executions" targets. The reason is that the "storage" targets like tables will not have traces, but "executions" targets will have.

If you comment that if traces exists for other executions then the spans will be available, that is not the case in some systems, since the function/procedure are user-code and traces may be in a different system than the platform traces.

model/database/registry.yaml

cbandy · 2024-10-31T18:04:22Z

There may be multiple tables/functions (only one store procedure). Because of that I think we should make it a list of strings this argument.

The same can be said of db.namespace, perhaps.

My opinion is that deeper insights belong in more specific fields. If a database (engine/system) can or wants to indicate that the operation involves multiple tables, indices, tablets, pages, locks, what-have-you; then it can go in a tech-specific field.

Co-authored-by: Trask Stalnaker <[email protected]>

lmolkova · 2024-11-04T21:26:22Z

@bogdandrutu

There may be multiple tables/functions (only one store procedure). Because of that I think we should make it a list of strings this argument.

Array will be problematic on metrics and will be significantly harder to use for noSQL databases which primarily operate on one collection. It also has much higher cardinality.

Having array of operations and array of collections will be even worse for usability.

That's why we introduced db.query.summary - https://github.com/open-telemetry/semantic-conventions/blob/main/docs/database/database-spans.md#generating-a-summary-of-the-query-text which would contain something like INSERT shipping_details SELECT orders for query like

INSERT INTO shipping_details
            (order_id,
            address)
SELECT order_id,
       address
FROM   orders
WHERE  order_id = ?

and will be a better representation of a query than arrays of operations and tables.

Ideally I would separate "storage" targets from "executions" targets. The reason is that the "storage" targets like tables will not have traces, but "executions" targets will have.

I'm not sure I understand what the difference between storage and execution is. CREATE TABLE foo ... or CREATE USER user_name is traced in the same way as SELECT * from foo where .... Instrumentations are not even required to parse query text and will create spans in the same way for them.

If you comment that if traces exists for other executions then the spans will be available, that is not the case in some systems, since the function/procedure are user-code and traces may be in a different system than the platform traces.

We're only defining client DB conventions now.

lmolkova · 2024-11-04T21:37:51Z

@cbandy

The same can be said of db.namespace, perhaps.
My opinion is that deeper insights belong in more specific fields. If a database (engine/system) can or wants to indicate that the operation involves multiple tables, indices, tablets, pages, locks, what-have-you; then it can go in a tech-specific field.

Deeper insights are available on db.query.text. Query text is not available on metrics (by default) due to high-ish cardinality, so we introduced db.query.summary that contains perations and targets - https://github.com/open-telemetry/semantic-conventions/blob/main/docs/database/database-spans.md#generating-a-summary-of-the-query-text

db.namespace (for SQL database) is defined as The database associated with the connection - you should not have more than one at the same time.

Cross-DB queries in common case would have fully-qualified table names, i.e. multiple database names will be captured as a part of each target name and inside db.query.text|summary. Generic instrumentation cannot really say if something in the table name represents an database or a schema or a part of a table name - in case that database supports . in the table name.

lmolkova requested review from a team as code owners October 29, 2024 03:01

trask reviewed Oct 29, 2024

View reviewed changes

schema-next.yaml Outdated Show resolved Hide resolved

model/database/spans.yaml Outdated Show resolved Hide resolved

model/database/common.yaml Outdated Show resolved Hide resolved

model/database/common.yaml Outdated Show resolved Hide resolved

trask approved these changes Oct 29, 2024

View reviewed changes

XSAM approved these changes Oct 29, 2024

View reviewed changes

bogdandrutu reviewed Oct 31, 2024

View reviewed changes

model/database/registry.yaml Outdated Show resolved Hide resolved

lmolkova and others added 6 commits November 4, 2024 13:01

Rename db.collection.name to db.target.name

6627495

Changelog

75ead19

lint

df7938a

Update schema-next.yaml

9a00920

Co-authored-by: Trask Stalnaker <[email protected]>

Apply suggestions from code review

e0ba172

Co-authored-by: Trask Stalnaker <[email protected]>

regenerate tables

dab38f2

review

b23a85c

lmolkova force-pushed the collection-to-target branch from f4e4c06 to b23a85c Compare November 4, 2024 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename `db.collection.name` to `db.target.name` #1527

Rename `db.collection.name` to `db.target.name` #1527

lmolkova commented Oct 29, 2024

bogdandrutu commented Oct 31, 2024

cbandy commented Oct 31, 2024

lmolkova commented Nov 4, 2024

lmolkova commented Nov 4, 2024 •

edited

Loading

Rename db.collection.name to db.target.name #1527

Are you sure you want to change the base?

Rename db.collection.name to db.target.name #1527

Conversation

lmolkova commented Oct 29, 2024

Changes

Merge requirement checklist

bogdandrutu commented Oct 31, 2024

cbandy commented Oct 31, 2024

lmolkova commented Nov 4, 2024

lmolkova commented Nov 4, 2024 • edited Loading

Rename `db.collection.name` to `db.target.name` #1527

Rename `db.collection.name` to `db.target.name` #1527

lmolkova commented Nov 4, 2024 •

edited

Loading