Codex的精准度和对细节的关注在正确设置后是*惊人的*。

3作者: ditchfieldcaleb大约 2 个月前原帖
最近,我一直在使用Codex开发一款塔防游戏,部分原因是为了学习游戏开发的工作原理,部分原因是想看看仅用Codex能做到什么,而不进行任何手动编码。我已经有了我的AGENTS.md和CODESTYLE.md,以及六个其他的全大写md文件等,并且正在进行一些重构,以保持代码库的整洁和文件大小的低廉等等。 然后我在我最新重构的ExecPlan中看到这个: --- # 观察 - 观察:重构后,屏幕截图在基线正确重新捕获后像素完全相同。 证据:sha256sum screenshots/before-implementation-x.png screenshots/after-implementation-x.png报告在前后对比的1、2和3对之间的哈希值匹配。 --- 这真是太疯狂了!我从来没有告诉Codex对应用程序的前后屏幕截图进行<i>SHA比较</i>,但我在我的PLANS.md中确实有指示,要求在游戏的网页应用程序上拍摄前后屏幕截图,以确保我们避免前端回归(它使用GPT-Image-2进行分析)。因此,对于不影响前端的更改,当然在游戏开始时在<i>相同时间戳</i>拍摄的屏幕截图之间不应该有任何不同。 但进行显式的SHA比较——这真是……我从来没有想过的事情。太疯狂了。
查看原文
Lately I&#x27;ve been working on a Tower Defense game with Codex, in part to learn how game development works and in part to see how far I can get using <i>just</i> Codex, no manual coding at all. I&#x27;ve got my AGENTS md &amp; my CODESTYLE md &amp; six other ALLCAPS md files etc, and am working on some refactoring to keep the codebase clean &amp; file sizes low, etc.<p>And then I see this in the ExecPlan for my latest refactor:<p>---<p># Observations<p>- Observation: The refactor made the screenshots pixel-identical after the baseline was recaptured correctly.<p>Evidence: sha256sum screenshots&#x2F;before-implementation-x.png screenshots&#x2F;after-implementation-x.png reported matching hashes for before&#x2F;after pairs 1, 2, and 3.<p>---<p>Which is crazy! I&#x27;ve never told Codex to do an <i>sha compare</i> on before&#x2F;after screenshots of the app, but I do have instructions in my PLANS.md to take before &amp; after screenshots of the webapp for the game to make sure we avoid frontend regressions (it uses GPT-Image-2 for analysis). So for non-frontend impacting changes, of course nothing should be different between screenshots taken at <i>identical timestamps</i> into the game start.<p>But doing an explicit SHA compare - that&#x27;s just...not something I would&#x27;ve <i>ever</i> thought of. Wild.