Updated Cookbook: Example for Fetching Scores from Langfuse by Sohammhatre10 · Pull Request #857 · langfuse/langfuse-docs

Sohammhatre10 · 2024-10-14T08:04:59Z

Description

This update provides an example of using the fetch_scores() function from Langfuse to retrieve evaluation metrics. The example integrates UpTrain and Ragas for model evaluation and demonstrates how to log and fetch scores within Langfuse as mentioned in langfuse/langfuse#3505

Key Features

Evaluation with UpTrain and Ragas:
- Provides examples for evaluating context relevance, factual accuracy, response completeness, context precision, faithfulness, and answer relevancy..
Fetching Scores:
- Shows how to retrieve and filter scores using fetch_scores_from_langfuse.
Correlation Matrix Visualization:
- Adds a section that calculates and visualizes the correlation between UpTrain and Ragas evaluation scores using a heatmap.

Important

Adds an example for using Langfuse to fetch scores, evaluate models with UpTrain and Ragas, and visualize results using a correlation matrix.

Behavior:
- Adds example for using fetch_scores() from Langfuse to retrieve evaluation metrics.
- Demonstrates integration with UpTrain and Ragas for model evaluation.
- Shows how to log and fetch scores within Langfuse.
Visualization:
- Includes a section for calculating and visualizing correlation between evaluation scores using a heatmap.
Misc:
- Minor whitespace changes in dspy.md, instructor.md, example-javascript.md, example-python-langgraph.md, example-python-instrumentation-module.md, example-python.md, example-vercel-ai.md, example_external_evaluation_pipelines.md, integration_dspy.md, integration_instructor.md, integration_langgraph.md, integration_llama-index_instrumentation.md, integration_llama_index_posthog_mistral.md, integration_mirascope.md, integration_mistral_sdk.md, integration_ollama.md, integration_openai_structured_output.md, example-langchain.md, js_integration_langchain.md, js_tracing_example_vercel_ai_sdk.md, prompt_management_langchain.md.

^{This description was created by}^{for 02ebc24. It will automatically update as commits are pushed.}

review-notebook-app · 2024-10-14T08:05:04Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

vercel · 2024-10-14T08:05:04Z

@Sohammhatre10 is attempting to deploy a commit to the langfuse Team on Vercel.

A member of the Team first needs to authorize it.

CLAassistant · 2024-10-14T08:05:07Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
0 out of 2 committers have signed the CLA.

❌ Your Name
❌ Sohammhatre10

Your Name seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 02ebc24 in 42 seconds

More details

Looked at 1680 lines of code in 26 files
Skipped 3 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. pages/docs/integrations/dspy.md:242

Draft comment:
Remove trailing whitespace for cleaner code. This issue is present in multiple files, such as example-javascript.md, example-python-langgraph.md, example-python-instrumentation-module.md, example-python.md, example-vercel-ai.md, external-evaluation-pipelines.md, integration_dspy.md, integration_instructor.md, integration_langgraph.md, integration_llama-index_instrumentation.md, integration_llama_index_posthog_mistral.md, integration_mirascope.md, integration_mistral_sdk.md, integration_ollama.md, integration_openai_structured_output.md, js_integration_langchain.md, js_tracing_example_vercel_ai_sdk.md, prompt_management_langchain.md.
Reason this comment was not posted:
Confidence changes required: 50%
The PR introduces a new example for fetching scores from Langfuse, but there are several instances of trailing whitespace in the markdown files. These should be removed for cleaner code.

Workflow ID: wflow_ON6OiLFA8uvvhNsK

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

greptile-apps

Disclaimer: Experimental PR review

PR Summary

This pull request adds a comprehensive example of using the fetch_scores() function from Langfuse to retrieve and analyze evaluation metrics, integrating UpTrain and Ragas for model evaluation.

Added pages/docs/scores/example_usage_of_fetch_score.md with detailed code snippets for setting up, evaluating models, logging scores, and visualizing correlations
Updated pages/guides/cookbook/example_external_evaluation_pipelines.md with a guide on creating external evaluation pipelines using Langfuse, including synthetic data creation and custom evaluations
Made minor formatting and content improvements across multiple integration cookbooks (DSPy, Instructor, LangGraph, etc.) to enhance readability and consistency
Updated various Langchain examples to demonstrate better integration with Langfuse for tracing and prompt management

_{26 file(s) reviewed, 7 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

marcklingen

thanks for the contribution. It seems like you mostly want to showcase correlation analysis of different scores in Langfuse (which is a good notebook example). Are you sure that your example correlates the scores on a single trace basis for the analysis at the bottom of this notebook

Sohammhatre10 · 2024-11-19T05:39:00Z

@marcklingen Yupp, this was based on a single trace, and the scores were fetched accordingly. Haven't used any specifics for traces, but this was the first trace I created, so it defaulted to the first trace. Should I add more specificity for a single trace? Apologies for the late reply.

marcklingen · 2025-02-27T20:35:21Z

@jannikmaierhoefer can you review this?

Your Name and others added 11 commits October 14, 2024 12:36

Updated cookbook for fetch scores example

0142cec

Update example_usage_of_fetch_scores.md

29da93d

Add files via upload

5a7f11a

Delete cookbook/example_usage_of_fetch_scores_files directory

4a87b59

Delete public/images/example_usage_of_fetch_scores_files directory

14ae567

Add files via upload

8644a7e

Update example_usage_of_fetch_scores.md

283c3d8

Update example_usage_of_fetch_scores.md

c14b9ac

Delete cookbook/example_usage_of_fetch_scores.md

32afedf

Create example_usage_of_fetch_score

5a37156

Rename example_usage_of_fetch_score to example_usage_of_fetch_score.md

02ebc24

ellipsis-dev Bot reviewed Oct 14, 2024

View reviewed changes

Sohammhatre10 mentioned this pull request Oct 14, 2024

docs: example python notebook on how to do correlation analysis on scores added to langfuse langfuse/langfuse#3505

Closed

greptile-apps Bot reviewed Oct 14, 2024

View reviewed changes

Comment thread pages/docs/integrations/mirascope/example-python.md Outdated

Merge branch 'main' into main

66b24d9

marcklingen reviewed Nov 16, 2024

View reviewed changes

marcklingen requested a review from jannikmaierhoefer February 27, 2025 20:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Cookbook: Example for Fetching Scores from Langfuse#857

Updated Cookbook: Example for Fetching Scores from Langfuse#857
Sohammhatre10 wants to merge 12 commits intolangfuse:mainfrom
Sohammhatre10:main

Sohammhatre10 commented Oct 14, 2024 •

edited by ellipsis-dev Bot

Loading

Uh oh!

review-notebook-app Bot commented Oct 14, 2024

Uh oh!

vercel Bot commented Oct 14, 2024

Uh oh!

CLAassistant commented Oct 14, 2024 •

edited

Loading

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

greptile-apps Bot left a comment

Uh oh!

Uh oh!

marcklingen left a comment

Uh oh!

Sohammhatre10 commented Nov 19, 2024

Uh oh!

marcklingen commented Feb 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sohammhatre10 commented Oct 14, 2024 • edited by ellipsis-dev Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Key Features

Uh oh!

review-notebook-app Bot commented Oct 14, 2024

Uh oh!

vercel Bot commented Oct 14, 2024

Uh oh!

CLAassistant commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Disclaimer: Experimental PR review

PR Summary

Uh oh!

Uh oh!

marcklingen left a comment

Choose a reason for hiding this comment

Uh oh!

Sohammhatre10 commented Nov 19, 2024

Uh oh!

marcklingen commented Feb 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sohammhatre10 commented Oct 14, 2024 •

edited by ellipsis-dev Bot

Loading

CLAassistant commented Oct 14, 2024 •

edited

Loading