DevLifted
HomeArticlesCategoriesTags
All tags

#evaluation

2 articles with this tag.

The Impartial Judge: Inside a Production ML Evaluation Harness

intermediateMachine Learning Basics

The Impartial Judge: Inside a Production ML Evaluation Harness

A developer's walkthrough of a real ML eval harness — F1, macro averaging, OOS recall, warmup, and p50/p95/p99 latency — and the design decisions behind each.

April 16, 202612 min read
#evaluation#metrics#f1-score

Semantic Caching & RAGAS Evaluation: Make Your RAG Pipeline Faster and Measurable

intermediateNatural Language Processing

Semantic Caching & RAGAS Evaluation: Make Your RAG Pipeline Faster and Measurable

Learn how to add semantic caching to your RAG pipeline for lower latency and cost, then measure quality with RAGAS evaluation metrics.

April 14, 202614 min read
#rag#semantic-caching#ragas
DevLifted

A modern educational platform for developers. Learn, grow, and stay updated with the latest in technology and software development.

Explore

  • Articles
  • Categories
  • Tags

Connect