问HN:在你的代理架构中是否存在人类审批?为什么/为什么不?
长期从事SaaS市场推广的我,带着产品导向的视角。刚接触基础设施,正在不遗余力地学习。请多多包涵。
我在一个论点上进行思考:人类的批准最终需要更深入地融入到有意义的人类/代理工作流程中,而不是完全自主(自从我们的龙虾朋友加入讨论以来,我在这方面学到了很多)。我不断问自己:“我真的授权ClaudeRod(我的龙虾)去做这个吗?”最近的消息让我更加担忧。
我一直在尝试解决方案,但再次强调,我并不是开发者。我知道如何识别痛点并绘制出解决方案的思维图——我已经这样做了20年。但我不确定我是否有足够的真实反馈来量化这些痛点。根据我的研究,我看到三种模式,希望能得到大家的真实反馈:
1. 运行前确认:代理提出建议,人类授权(快速手动点击),代理执行。从审计轨迹的角度来看似乎合理,但会影响流程。
2. 事后通知:代理行动后有短暂的“撤销”窗口,就像Gmail一样。降低了摩擦,但在不可逆的操作中几乎没有用处。
3. 预授权范围:人类设定边界——“你可以在这周给我的潜在客户名单发送邮件”——代理在边界内自由工作。行动记录与最初的授权相对应。这似乎有些模糊……
我的直觉是不想为这个问题定义一种“通用”的逻辑。根据行动类型设定不同的授权级别。
再次强调,我是个新手,诚实地说,我也能接受你们告诉我这根本没什么意义,不值得解决。我生活中有很多疯狂的想法被否定过——我的心理承受能力相当强。
如果这确实是一个问题,你们实际上在推出什么?我是否遗漏了失败模式?你们的批准层是什么样的——是在代理、基础设施还是其他地方?你们的工作流程的拖延是否值得这种安心?
感谢任何反馈。我还有很多其他想法,但这个问题目前让我感到困扰……
查看原文
Long-time SaaS GTM guy with product fwd lens. New to infrastructure, shamelessly trying to learn. Go easy on me.<p>Building on a thesis that human approval will ultimately need to be more embedded into meaningful human/agent workflow than fully autonomous (learning the hard way since our lobster friend entered the chat). The question I keep asking myself is "did I actually authorize ClaudeRod (my lobster) to do this". Recent news has me more concerned.<p>I've been hacking on a solution but again, I'M NOT A DEV. I know how to recognize pain and chart a mental map to solution - I've done this for 20 yrs. But I don't know if I have enough genuine feedback yet to quantify the pain. Three patterns I see from research that I'd appreciate any/all genuine feedback on:
1. Confirm before it runs: Agent proposes, human authorizes (quick manual click), Agent executes. Seems logical from an audit trail, but kills flow.
2. Notify after: Agent acts with short window to 'undo', like gmail. Lower friction, but pretty impractical - useless for irreversible actions.
3. Pre-auth a scope: Human gives guardrails - "you can send emails to my lead list this week" - and Agent works freely within guardrails. Actions logs against the original grant. Seems to ambiguous...<p>My instinct is to not define a 'one-size fits all' logic to the problem. Levels of authorization based on types of action.<p>Again, I'm a newb and am honestly ok with you all telling me this is a big nothingburger and it's not worth solving. I've had a lot of crazy ideas shot down in my life - my skin is pretty thick.<p>If it is a true problem, what are you all actually shipping? Am I missing failure modes? What does your approval layer look like - in agent, infra or somewhere else? Is the drag on your workflow worth the peace of mind?<p>Appreciate any/all feedback. I have plenty of other ideas but this one is currently a thorn in my side...