fix(bedrock): utilize invocation metrics from response body for AI21, Anthropic, Meta models when available to record usage on spans #1286

aannirajpatel · 2024-06-09T21:13:46Z

TL;DR: report actuals when possible when instrumenting Bedrock Anthropic and AI21 models. Add a test to cover instrumentation for Meta's Llama models on Bedrock.

✅ I have added tests that cover my changes.
✅ If adding a new instrumentation or changing an existing one, I've added screenshots from some observability platform showing the change.

✅ PR name follows conventional commits format: feat(instrumentation): ... or fix(instrumentation): ....
✅ (If applicable) I have updated the documentation accordingly - not applicable for this fix PR.

…opic, Meta models when available to record usage on spans

CLAassistant · 2024-06-09T21:13:51Z

All committers have signed the CLA.

nirga

Great work @aannirajpatel, thanks so much! I've been meaning to do that for a while. Left a small comment, and there's a small lint issue to fix 🙏

nirga · 2024-06-10T07:39:32Z

...ages/opentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/__init__.py

@@ -216,8 +216,18 @@ def _set_anthropic_completion_span_attributes(span, request_body, response_body)
    )

    if Config.enrich_token_usage:
-        prompt_tokens = _count_anthropic_tokens([request_body.get("prompt")])


The reason we put this under if Config.enrich_token_usage is because _count_anthropic_tokens is expensive to run and we want to give users an option to disable it. If you're getting the data from the request - no need to put it under this if

nirga · 2024-06-10T07:40:31Z

...ages/opentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/__init__.py

@@ -361,6 +380,17 @@ def _set_llama_span_attributes(span, request_body, response_body):
        span, SpanAttributes.LLM_REQUEST_MAX_TOKENS, request_body.get("max_gen_len")
    )

+    if Config.enrich_token_usage and response_body.get("prompt_token_count") is not None and response_body.get("generation_token_count") is not None:


same here, no need for the Config.enrich_token_usage

Thanks for the pointers, Nir! I've addressed the lint issue and consolidated the attribute addition logic for usage into a function call common for ai21, anthropic, and meta. I also implemented similar logic for Cohere based on their API documentation for Command R and related models. However, when I ran a local test, I noticed that the model did not return token counts as the documentation suggested it would. So I've wrapped that logic in a try-catch but still kept it here (O(1) hit at worst -- not much to lose).

…c, and meta models, add unit test for ai21 model instrumentation

…thub.com/aannirajpatel/openllmetry into fix-bedrock-instrumentation-token-counts

nirga

Great work @aannirajpatel, thank you so much for this!

fix(bedrock): utilize invocation metrics from response body for Anthr…

39d2ab0

…opic, Meta models when available to record usage on spans

Merge branch 'main' into fix-bedrock-instrumentation-token-counts

b542725

aannirajpatel changed the title ~~fix(bedrock): utilize invocation metrics from response body for Antropic, Meta models when available to record usage on spans~~ fix(bedrock): utilize invocation metrics from response body for Anthropic, Meta models when available to record usage on spans Jun 9, 2024

nirga reviewed Jun 10, 2024

View reviewed changes

feat(bedrock): streamline usage recording on spans for ai21, anthropi…

a431297

…c, and meta models, add unit test for ai21 model instrumentation

Merge branch 'fix-bedrock-instrumentation-token-counts' of https://gi…

a3cddea

…thub.com/aannirajpatel/openllmetry into fix-bedrock-instrumentation-token-counts

aannirajpatel force-pushed the fix-bedrock-instrumentation-token-counts branch from aabd1e4 to a3cddea Compare June 11, 2024 01:05

aannirajpatel and others added 2 commits June 10, 2024 21:11

Merge branch 'main' into fix-bedrock-instrumentation-token-counts

20b61eb

fix(bedrock): removed un-needed try-except

6d19e56

nirga approved these changes Jun 11, 2024

View reviewed changes

Merge branch 'main' into fix-bedrock-instrumentation-token-counts

c7ae71c

nirga merged commit b0d948f into traceloop:main Jun 11, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(bedrock): utilize invocation metrics from response body for AI21, Anthropic, Meta models when available to record usage on spans #1286

fix(bedrock): utilize invocation metrics from response body for AI21, Anthropic, Meta models when available to record usage on spans #1286

aannirajpatel commented Jun 9, 2024 •

edited

Loading

CLAassistant commented Jun 9, 2024 •

edited

Loading

nirga left a comment

nirga Jun 10, 2024

nirga Jun 10, 2024

aannirajpatel Jun 11, 2024

nirga left a comment

fix(bedrock): utilize invocation metrics from response body for AI21, Anthropic, Meta models when available to record usage on spans #1286

fix(bedrock): utilize invocation metrics from response body for AI21, Anthropic, Meta models when available to record usage on spans #1286

Conversation

aannirajpatel commented Jun 9, 2024 • edited Loading

CLAassistant commented Jun 9, 2024 • edited Loading

nirga left a comment

Choose a reason for hiding this comment

nirga Jun 10, 2024

Choose a reason for hiding this comment

nirga Jun 10, 2024

Choose a reason for hiding this comment

aannirajpatel Jun 11, 2024

Choose a reason for hiding this comment

nirga left a comment

Choose a reason for hiding this comment

aannirajpatel commented Jun 9, 2024 •

edited

Loading

CLAassistant commented Jun 9, 2024 •

edited

Loading