Agentic QA - 开源中间件,用于对循环中的代理进行模糊测试

2作者: Saurabh_Kumar_2 个月前原帖
我之所以构建这个,是因为我看到我的 LangChain 代理在一夜之间因无限循环而消耗了大约 50 美元的 OpenAI 额度。 这是一个中间件 API,充当“飞行模拟器”。你可以将代理的提示发送给它,它会进行对抗性攻击(红队测试),以在部署之前捕捉循环和个人信息泄露。 代码和仓库: [https://github.com/Saurabh0377/agentic-qa-api](https://github.com/Saurabh0377/agentic-qa-api) 在线演示: [https://agentic-qa-engine.onrender.com/docs](https://agentic-qa-engine.onrender.com/docs) 欢迎反馈你们见过的其他失败模式!
查看原文
I built this because I watched my LangChain agent burn ~$50 in OpenAI credits overnight due to an infinite loop.<p>It&#x27;s a middleware API that acts as a &#x27;Flight Simulator&#x27;. You send it your agent&#x27;s prompt, and it runs adversarial attacks (Red Teaming) to catch loops and PII leaks before deployment.<p>Code &amp; Repo: https:&#x2F;&#x2F;github.com&#x2F;Saurabh0377&#x2F;agentic-qa-api Live Demo: https:&#x2F;&#x2F;agentic-qa-engine.onrender.com&#x2F;docs<p>Would love feedback on other failure modes you&#x27;ve seen!