问HN:在你的代理架构中是否存在人类审批?为什么/为什么不?

1作者: jeremyjoehewitt大约 2 个月前原帖
长期从事SaaS市场推广的我,带着产品导向的视角。刚接触基础设施,正在不遗余力地学习。请多多包涵。 我在一个论点上进行思考:人类的批准最终需要更深入地融入到有意义的人类/代理工作流程中,而不是完全自主(自从我们的龙虾朋友加入讨论以来,我在这方面学到了很多)。我不断问自己:“我真的授权ClaudeRod(我的龙虾)去做这个吗?”最近的消息让我更加担忧。 我一直在尝试解决方案,但再次强调,我并不是开发者。我知道如何识别痛点并绘制出解决方案的思维图——我已经这样做了20年。但我不确定我是否有足够的真实反馈来量化这些痛点。根据我的研究,我看到三种模式,希望能得到大家的真实反馈: 1. 运行前确认:代理提出建议,人类授权(快速手动点击),代理执行。从审计轨迹的角度来看似乎合理,但会影响流程。 2. 事后通知:代理行动后有短暂的“撤销”窗口,就像Gmail一样。降低了摩擦,但在不可逆的操作中几乎没有用处。 3. 预授权范围:人类设定边界——“你可以在这周给我的潜在客户名单发送邮件”——代理在边界内自由工作。行动记录与最初的授权相对应。这似乎有些模糊…… 我的直觉是不想为这个问题定义一种“通用”的逻辑。根据行动类型设定不同的授权级别。 再次强调,我是个新手,诚实地说,我也能接受你们告诉我这根本没什么意义,不值得解决。我生活中有很多疯狂的想法被否定过——我的心理承受能力相当强。 如果这确实是一个问题,你们实际上在推出什么?我是否遗漏了失败模式?你们的批准层是什么样的——是在代理、基础设施还是其他地方?你们的工作流程的拖延是否值得这种安心? 感谢任何反馈。我还有很多其他想法,但这个问题目前让我感到困扰……
查看原文
Long-time SaaS GTM guy with product fwd lens. New to infrastructure, shamelessly trying to learn. Go easy on me.<p>Building on a thesis that human approval will ultimately need to be more embedded into meaningful human&#x2F;agent workflow than fully autonomous (learning the hard way since our lobster friend entered the chat). The question I keep asking myself is &quot;did I actually authorize ClaudeRod (my lobster) to do this&quot;. Recent news has me more concerned.<p>I&#x27;ve been hacking on a solution but again, I&#x27;M NOT A DEV. I know how to recognize pain and chart a mental map to solution - I&#x27;ve done this for 20 yrs. But I don&#x27;t know if I have enough genuine feedback yet to quantify the pain. Three patterns I see from research that I&#x27;d appreciate any&#x2F;all genuine feedback on: 1. Confirm before it runs: Agent proposes, human authorizes (quick manual click), Agent executes. Seems logical from an audit trail, but kills flow. 2. Notify after: Agent acts with short window to &#x27;undo&#x27;, like gmail. Lower friction, but pretty impractical - useless for irreversible actions. 3. Pre-auth a scope: Human gives guardrails - &quot;you can send emails to my lead list this week&quot; - and Agent works freely within guardrails. Actions logs against the original grant. Seems to ambiguous...<p>My instinct is to not define a &#x27;one-size fits all&#x27; logic to the problem. Levels of authorization based on types of action.<p>Again, I&#x27;m a newb and am honestly ok with you all telling me this is a big nothingburger and it&#x27;s not worth solving. I&#x27;ve had a lot of crazy ideas shot down in my life - my skin is pretty thick.<p>If it is a true problem, what are you all actually shipping? Am I missing failure modes? What does your approval layer look like - in agent, infra or somewhere else? Is the drag on your workflow worth the peace of mind?<p>Appreciate any&#x2F;all feedback. I have plenty of other ideas but this one is currently a thorn in my side...