Change `gen_ai.response.finish_reasons` type from array to `string`, use it on metrics #1291

lmolkova · 2024-07-26T21:09:46Z

Changes

While OpenAI supports returning multiple completions (and therefore could have multiple finish reasons), I did not find models outside of OpenAI that return multiple choices and many SDKs (e.g. vercel) don't support returning multiple choices.

So effectively there is usually one finish reason.
Having an array for it:

complicates querying
makes usage of finish reason on metrics more complicated

This PR:

turns finish reasons attribute into a string (comma-separated if there is more than one)
adds it to metrics.

Having comma-separated list has high-ish theoretical cardinality in edge cases, but results in best experience for common case.
I'm suggesting to start with it and consider making it opt-in on metrics if it's proven to be problematic.

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
schema-next.yaml updated with changes to existing conventions.

lmolkova · 2024-07-26T21:12:20Z

docs/gen-ai/gen-ai-metrics.md

@@ -266,8 +274,9 @@ This metric SHOULD be specified with [ExplicitBucketBoundaries] of
 | [`gen_ai.system`](/docs/attributes-registry/gen-ai.md) | string | The Generative AI product as identified by the client or server instrumentation. [2] | `openai` | `Required` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 | [`error.type`](/docs/attributes-registry/error.md) | string | Describes a class of error the operation ended with. [3] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | `Conditionally Required` if the operation ended in an error | ![Stable](https://img.shields.io/badge/-stable-lightgreen) |
 | [`server.port`](/docs/attributes-registry/server.md) | int | Server port number. [4] | `80`; `8080`; `443` | `Conditionally Required` If `sever.address` is set. | ![Stable](https://img.shields.io/badge/-stable-lightgreen) |
+| [`gen_ai.response.finish_reason`](/docs/attributes-registry/gen-ai.md) | string | The reason(s) the model stopped generating tokens. [5] | `stop`; `stop,length` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |


@achandrasekar any thoughts/objections? on adding finish_reason attribute to LLM server metrics?

This looks good to me. Adding a finish_reason attribute makes sense. It would be useful in a lot of cases including benchmarking where we can see if we generated to the max_length or stopped early due to the EOS token.

lmolkova · 2024-09-03T16:39:59Z

I'm going to close this PR and pursue other ways that work better for batching and multiple choices outlined here

#1277 (comment)

lmolkova added 2 commits July 26, 2024 13:22

clean up

621d35f

rename to finish_reason

553ebfc

lmolkova commented Jul 26, 2024

View reviewed changes

lmolkova changed the title ~~Change gen_ai.responmse.finish_reasons type from array to string, use it on metrics~~ Change gen_ai.response.finish_reasons type from array to string, use it on metrics Jul 29, 2024

lmolkova closed this Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change `gen_ai.response.finish_reasons` type from array to `string`, use it on metrics #1291

Change `gen_ai.response.finish_reasons` type from array to `string`, use it on metrics #1291

lmolkova commented Jul 26, 2024

lmolkova Jul 26, 2024

achandrasekar Jul 29, 2024

lmolkova commented Sep 3, 2024

Change gen_ai.response.finish_reasons type from array to string, use it on metrics #1291

Change gen_ai.response.finish_reasons type from array to string, use it on metrics #1291

Conversation

lmolkova commented Jul 26, 2024

Changes

Merge requirement checklist

lmolkova Jul 26, 2024

Choose a reason for hiding this comment

achandrasekar Jul 29, 2024

Choose a reason for hiding this comment

lmolkova commented Sep 3, 2024

Change `gen_ai.response.finish_reasons` type from array to `string`, use it on metrics #1291

Change `gen_ai.response.finish_reasons` type from array to `string`, use it on metrics #1291