Contributor   /     LLM Evaluation: Opik with Gideon Mendels

Description

Gideon Mendels (Github: @gidim) is the co-founder and CEO of Comet, the end-to-end model evaluation platform for AI developers. Among the tools in the Comet ecosystem is Opik, an open-source solution for evaluating, testing and monitoring LLM applications. Opik allows users to log traces and spans, define and compute evaluation metrics, score LLM outputs, compare performance across app versions, and more. As a true open-source project, its full featureset is available for use by anyone, completely free. Contributor is looking for a community manager! If you want to know more, shoot us an email at eric@scalevp.com. Subscribe to Contributor on Substack for email notifications! In this episode we discuss: How Opik’s popularity blew up beyond the Comet team’s expectations Why CI/CD is especially important in an end-to-end platform Gideon’s “severe allergy” to “fake open-source” offerings Why the number of dedicated machine learning engineers is actually going down Eric’s thoughts on what it means for venture capital to invest in the LLM space Links: Opik Comet

Summary

Gideon Mendels (Github: @gidim) is the co-founder and CEO of Comet, the end-to-end model evaluation platform for AI developers. Among the tools in the Comet ecosystem is Opik, an open-source solution for evaluating, testing and monitoring LLM applications. Opik allows users to log traces and spans, define and compute evaluation metrics, score LLM outputs, compare performance across app versions, and more. As a true open-source project, its full featureset is available for use by anyone, completely free.

Contributor is looking for a community manager! If you want to know more, shoot us an email at eric@scalevp.com.

Subscribe to Contributor on Substack for email notifications!

In this episode we discuss:

  • How Opik’s popularity blew up beyond the Comet team’s expectations

  • Why CI/CD is especially important in an end-to-end platform

  • Gideon’s “severe allergy” to “fake open-source” offerings

  • Why the number of dedicated machine learning engineers is actually going down

  • Eric’s thoughts on what it means for venture capital to invest in the LLM space

Links:

Subtitle
Duration
37:59
Publishing date
2025-01-15 02:00
Link
https://www.contributor.fyi/opik
Contributors
  Eric Anderson
author  
Enclosures
https://aphid.fireside.fm/d/1437767933/657ccb75-c55f-4363-8892-f45dd46caf80/1d5f151a-1ff9-4f99-a1e2-f6c1082a67c3.mp3
audio/mpeg

Shownotes

Gideon Mendels (Github: @gidim) is the co-founder and CEO of Comet, the end-to-end model evaluation platform for AI developers. Among the tools in the Comet ecosystem is Opik, an open-source solution for evaluating, testing and monitoring LLM applications. Opik allows users to log traces and spans, define and compute evaluation metrics, score LLM outputs, compare performance across app versions, and more. As a true open-source project, its full featureset is available for use by anyone, completely free.

Contributor is looking for a community manager! If you want to know more, shoot us an email at eric@scalevp.com.

Subscribe to Contributor on Substack for email notifications!

In this episode we discuss:

  • How Opik’s popularity blew up beyond the Comet team’s expectations

  • Why CI/CD is especially important in an end-to-end platform

  • Gideon’s “severe allergy” to “fake open-source” offerings

  • Why the number of dedicated machine learning engineers is actually going down

  • Eric’s thoughts on what it means for venture capital to invest in the LLM space

Links: