HackerNews中文版

TDD（测试驱动开发）已经成为AI编码代理的默认工作流程，原因有很多。<p>我最近开始做的一件我非常喜欢的事情是，在构建一个功能时，我会编写验收标准，然后确保代理拥有完成质量保证（QA）和自行验证所需的一切。与之前最大的不同在于，我不再自己进行质量保证（虽然我仍然会做，但几乎总是有效的），而是让代理能够自行执行整个流程并验证验收标准。这需要更多的工作，因为我通常需要设置MCP（最小可行产品）、账户、凭证等，但最终的输出效果更好。<p>我的问题是……是否有人对此有好的方法？这是一个常见的工作流程吗？我看到一些关于这个话题的讨论称为“工具工程”；有些称为“反馈循环工程”；还有一些初创公司（如Shiplight AI、Autosana、Ranger）。我想找到更多的讨论和最佳实践，看看其他人是如何思考这个问题的。

查看原文

TDD has become the default workflow for AI coding agents, for a lot of reasons.<p>Something I recently started doing that I really like is, when building a feature, I write acceptance criteria and then I make sure the agents have everything they need to do QA and verify it themselves. The main difference from before is that instead of QAing it myself (which I still do but it's almost always working), the agent can exercise the whole flow itself and verify the acceptance criteria. It takes more work because I often have to set up MCPs, accounts, credentials, etc - but the output is better.<p>My question... do people have good approaches to this? Is this a common workflow? I see some of this conversation called harness engineering; I see some called feedback-loop engineering; I see some startups (Shiplight AI, Autosana, Ranger). I'm trying to find more discussion & best practices here, and see how others are thinking about this.

问HN：以代理驱动的质量保证（QA）是否存在？