Model BreakdownsFrance
OpenDeepMind's FACTS Benchmark Suite: a claim-level framework and quick-start checklist for evaluating LLM factuality
DeepMind's FACTS Benchmark Suite evaluates LLM factuality with claim-level tests, error taxonomies and provenance checks. Includes a 5-item quick-start checklist and decision framework.