n8nworkflows.io

Evaluate RAG Response Accuracy with OpenAI: Document Groundedness Metric

Download [20.8KB]

Nodes

+8

Categories

AI & Machine Learning

Tags

Created by

Last edited 208 days ago

This n8n template demonstrates how to calculate the evaluation metric "RAG document groundedness" which in this scenario, measures the ability to provide or reference information included only in retrieved vector store documents.

The scoring approach is adapted from https://cloud.google.com/vertex-ai/generative-ai/docs/models/metrics-templates#pointwise_groundedness

How it works

This evaluation works best for an agent that requires document retrieval from a vector store or similar source.
For our scoring, we need to collect the agent's response and the documents retrieved and use an LLM to assess if the former is based off the latter.
A key factor is to look out information in the response which is not mentioned in the documents.
A high score indicates LLM adherence and alignment whereas a low score could signal inadequate prompt or model hallucination.

Requirements

n8n version 1.94+
Check out this Google Sheet for a sample data https://docs.google.com/spreadsheets/d/1YOnu2JJjlxd787AuYcg-wKbkjyjyZFgASYVV0jsij5Y/edit?usp=sharing

You may also like

Evaluate AI Agent Response Correctness with OpenAI and RAGAS Methodology

Evaluate AI Agent Response Correctness with OpenAI and RAGAS Methodology

Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity

Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity

Evaluations Metric: Answer Similarity

Evaluations Metric: Answer Similarity

New to n8n?

Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!

Trending

Generate AI viral videos with NanoBanana & VEO3, shared on socials via Blotato

Generate AI viral videos with NanoBanana & VEO3, shared on socials via Blotato

Generate & Publish Professional Video Ads with Veo 3, Gemini & Creatomate

Generate & Publish Professional Video Ads with Veo 3, Gemini & Creatomate

Build a Multichannel Customer Support AI Assistant with Chatwoot & OpenRouter

Build a Multichannel Customer Support AI Assistant with Chatwoot & OpenRouter

zrGeorge Zargaryan