Frontier models to evaluate generative AI. Find and fix AI mistakes at scale, and build more reliable AI apps. Use Selene LLM-as-a-Judge to evaluate outputs and test prompts and models.