Opik is an end-to-end LLM evaluation platform designed to help AI developers test, ship, and continuously improve LLM-powered applications.