Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators.·eugeneyan.com·Sep 9, 2024Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)
RAG Evaluation - Hugging Face Open-Source AI CookbookWe’re on a journey to advance and democratize artificial intelligence through open source and open science.#RAG·huggingface.co·Apr 26, 2024RAG Evaluation - Hugging Face Open-Source AI Cookbook
LlamaIndex: RAG Evaluation Showdown with GPT-4 vs. Open-Source Prometheus Model — LlamaIndex, Data Framework for LLM ApplicationsLlamaIndex is a simple, flexible data framework for connecting custom data sources to large language models (LLMs).·llamaindex.ai·May 22, 2024LlamaIndex: RAG Evaluation Showdown with GPT-4 vs. Open-Source Prometheus Model — LlamaIndex, Data Framework for LLM Applications
Using LLM-as-a-judge 🧑⚖️ for an automated and versatile evaluation - Hugging Face Open-Source AI CookbookWe’re on a journey to advance and democratize artificial intelligence through open source and open science.·huggingface.co·May 22, 2024Using LLM-as-a-judge 🧑⚖️ for an automated and versatile evaluation - Hugging Face Open-Source AI Cookbook
RAG Evaluation - Hugging Face Open-Source AI CookbookWe’re on a journey to advance and democratize artificial intelligence through open source and open science.#RAG·huggingface.co·May 22, 2024RAG Evaluation - Hugging Face Open-Source AI Cookbook