🤖 AskMyHR Evaluation Reports

AI agent quality, compliance, and access control evaluation dashboards

🧪

Agent End-to-End Evaluation

Full agent pipeline testing: tool usage, document retrieval, response correctness, and grounding across KNOWLEDGE, GREETING, and DECLINE scenarios.

E2E 11 CASES LLM JUDGE
🔐

Audience Filtering Evaluation

Verifies company and role-based document access controls — GSC/APMT isolation, manager-only content, and cross-country filtering.

AUDIENCE 9 CASES ACCESS CONTROL
📚

RAG Retrieval Evaluation

Standalone knowledge base retrieval quality. Tests document relevance, content matching, and response sufficiency using LLM-as-Judge scoring.

RAG 15 CASES RETRIEVAL
🌍

Country & Company Filtering

Tests that RAG retrieval respects country and company boundaries — India vs Global, GSC vs APMT, and cross-location isolation.

COUNTRY 11 CASES FILTERING