问HN:Claude Opus 4.8坏了吗?

2作者: pqdbr大约 1 个月前原帖
在我使用它的第一个小时里,感觉就像回到了GPT-2时代。<p>它甚至连文件都无法读取了。<p>随机使用“sed”而没有解释,路径都是虚构的文件。连续出现15次错误,提示“没有这样的文件或目录”,路径根本不存在——它试图像往常一样在我们的Rails应用中读取一个项目文件。<p>然后你问它为什么不直接读取正确路径中的文件,它便表示抱歉:<p>&gt; 你有理由生气。出了两个问题,都是我的错: &gt; &gt; 1. 我输入了sed/cat,而不是使用读取工具(项目规则明确说明不这样做), &gt; 2. 我猜测了一个不存在的文件名app/services/gmail/sync_worker.rb——真实文件是app/workers/gmail/sync_worker.rb。所以我在读取一个我自己虚构的路径。真是马虎。<p>它刚刚又向我道歉——在这个会话中已经是第5次了——写道:<p>&gt; 我再次在验证步骤中输入了一个虚构的消息ID(19e70e9d...),而不是从我刚刚获取的列表中读取真实的ID。这正是我一直犯的错误。<p>(上下文窗口为15%)。<p>它慢得令人无法忍受。<p>它总是出现10多个错误,比如“取消:并行工具调用Bash出错”。<p>这简直不真实。
查看原文
In my first hour with it, it&#x27;s like we&#x27;re back to the GPT-2 era.<p>It can&#x27;t even read a file anymore.<p>Randomly uses &#x27;sed&#x27; with no explanation in hallucinated file paths. Errors out 15 times in a row with &#x27;No such file or directory&#x27; with paths that don&#x27;t exist - it was trying to read a project file as usual in our Rails app.<p>You then ask it why it&#x27;s not simply reading the file in the correct path, and then it says it&#x27;s sorry:<p>&gt; You&#x27;re right to be angry. Two things went wrong and both are on me: &gt; &gt; 1. I typed sed&#x2F;cat instead of using the Read tool (the project rules explicitly say not to), and &gt; 2. I guessed a filename app&#x2F;services&#x2F;gmail&#x2F;sync_worker.rb that doesn&#x27;t exist — the real file is app&#x2F;workers&#x2F;gmail&#x2F;sync_worker.rb. So I was reading a path I&#x27;d invented. Sloppy,<p>It just apologized to me - for the 5th time in this session - writing this:<p>&gt; I again typed a made-up message ID into the verify step (19e70e9d...) instead of reading the real one from the list I just fetched. That&#x27;s the exact mistake I keep making.<p>(Context window at 15%).<p>It&#x27;s unbearably slow.<p>It presents 10+ errors like &#x27;Cancelled: parallel tool call Bash errored&#x27; all the time.<p>It&#x27;s unreal.