n8nworkflows.io

Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity

Download [16.9KB]

Nodes

+4

Categories

AI & Machine Learning

Tags

#OpenAI #Cosine Similarity

Created by

Last edited 208 days ago

This n8n template demonstrates how to calculate the evaluation metric "Relevance" which in this scenario, measures the relevance of the agent's response to the user's question.

The scoring approach is adapted from the open-source evaluations project RAGAS and you can see the source here https://github.com/explodinggradients/ragas/blob/main/ragas/src/ragas/metrics/_answer_relevance.py

How it works

This evaluation works best for Q&A agents.
For our scoring, we analyse the agent's response and ask another AI to generate a question from it. This generated question is then compared to the original question using cosine similarity.
A high score indicates relevance and the agent's successful ability to answer the question whereas a low score means agent may have added too much irrelevant info, went off script or hallucinated.

Requirements

n8n version 1.94+
Check out this Google Sheet for a sample data https://docs.google.com/spreadsheets/d/1YOnu2JJjlxd787AuYcg-wKbkjyjyZFgASYVV0jsij5Y/edit?usp=sharing

You may also like

Evaluations Metric: Answer Similarity

Evaluations Metric: Answer Similarity

Evaluate AI Agent Response Correctness with OpenAI and RAGAS Methodology

Evaluate AI Agent Response Correctness with OpenAI and RAGAS Methodology

Evaluation metric example: RAG document relevance

Evaluation metric example: RAG document relevance

New to n8n?

Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!

Trending

Generate AI viral videos with NanoBanana & VEO3, shared on socials via Blotato

Generate AI viral videos with NanoBanana & VEO3, shared on socials via Blotato

Generate & Publish Professional Video Ads with Veo 3, Gemini & Creatomate

Generate & Publish Professional Video Ads with Veo 3, Gemini & Creatomate

Build a Multichannel Customer Support AI Assistant with Chatwoot & OpenRouter

Build a Multichannel Customer Support AI Assistant with Chatwoot & OpenRouter

zrGeorge Zargaryan