问HN:有哪些好的大型语言模型可观察性平台?
我的公司经历了三家不同的“LLM可观察性”供应商,但他们都未能满足我们唯一的需求。我们愿意为此付费。
我们唯一关心的是能够:
- 记录LLM的输出,并能够在用户界面中按下一个按钮,重新运行完全相同的输出(业内通常称之为“游乐场”)。我们可以以生产环境中的相同方式重新运行这个输出。
我们不关心的内容包括:
- “数据集”
- “评分”
- “提示增强器”
查看原文
My company has been through 3 different "LLM Observability" vendors and they each have failed to give us the one (simple) thing we want. Willing to pay for this.<p>The ONLY thing we care about is the ability to:
- Log an LLM completion, and be able to press a button that lets us re-run the exact same completion in a UI (industry seems to call this the "playground"). We can rerun this completion exactly how it was in production.<p>What we DO NOT care about:
- "datasets"
- "scores"
- "prompt enhancers"