问HN:有哪些好的大型语言模型可观察性平台?

2作者: seany6221 天前原帖
我的公司经历了三家不同的“LLM可观察性”供应商,但他们都未能满足我们唯一的需求。我们愿意为此付费。 我们唯一关心的是能够: - 记录LLM的输出,并能够在用户界面中按下一个按钮,重新运行完全相同的输出(业内通常称之为“游乐场”)。我们可以以生产环境中的相同方式重新运行这个输出。 我们不关心的内容包括: - “数据集” - “评分” - “提示增强器”
查看原文
My company has been through 3 different &quot;LLM Observability&quot; vendors and they each have failed to give us the one (simple) thing we want. Willing to pay for this.<p>The ONLY thing we care about is the ability to: - Log an LLM completion, and be able to press a button that lets us re-run the exact same completion in a UI (industry seems to call this the &quot;playground&quot;). We can rerun this completion exactly how it was in production.<p>What we DO NOT care about: - &quot;datasets&quot; - &quot;scores&quot; - &quot;prompt enhancers&quot;