OpenAI 可能会对响应 API 数据进行训练。
官方文档中他们首席科学家的引用相当可疑:<p><pre><code> 隐藏的思维链使我们能够“读懂”模型的思维,并理解其思考过程。例如,在未来,我们可能希望监控思维链,以寻找操控用户的迹象。
</code></pre>
如果他们不在此基础上进行训练,他们肯定是在读取推理标记。<p>https://developers.openai.com/blog/responses-api
查看原文
The quote from their Chief Scientist in the official documentation is quite suspicious:<p><pre><code> the hidden chain of thought allows us to “read the mind” of the model and understand its thought process. For example, in the future we may wish to monitor the chain of thought for signs of manipulating the user.
</code></pre>
If they don't train on it, they are definitely reading the reasoning tokens.<p>https://developers.openai.com/blog/responses-api