HackerNews中文版

首先让我澄清一下。我并不讨厌大型语言模型（LLMs），我会向它们提问，触发代理执行一些我大致知道最终目标的任务，并对应用程序的小部分进行分析。话虽如此，每当我给它一些稍微复杂的任务，比如在单个文件脚本中执行某些操作时，它总是让我失望。要么代码质量很差，要么方法糟糕得像是一个完全不知道该怎么做的人，或者它完全开始做我在初始提示中明确表示不想要的事情。有时，当这种情况发生时，我会请我的LLM爱好者的同事来帮忙，但他们也无法“修复”它，而我却被认为是因为“提示错误”或“缺乏正确的上下文”而做错了。我创建了很多“Agents.md”文件，把文件放入上下文窗口……但都没有用。当我需要进行全新项目或概念验证（PoC）时，它的反应速度很快，但将其应用于现有的大型应用程序时却失败了。我唯一感到“高效”的时候，是在使用我完全不熟悉的语言或技术时，但我也不知道最终得到的功能代码是否存在我不知道的问题。你们当中有谁真的在使用LLMs为大型企业应用程序创建完整的功能吗？

查看原文

Let me be clear first. I don't dislike LLMs, I query them, trigger agents to do stuff where I kind of know what the end goal is and to make analisys of small parts of an application.That said, everytime I give it something a little more complex that do something in a single file script it fails me horribly. Either the code is really bad, or the approach is as bad a someone who doesn't really know what to do or it plains start doing things that I explicitly said not to do in the initial prompt.I have sometimes asked my LLM fan's coworkers to come and help when that happens and they also are not able to "fix it", but somehow I am the one doing it wrong due "wrong prompt" or "lack of correct context".I have created a lot of "Agents.md" files, drop files into the context window... Nothing.When I need to do green field stuff, or PoCs it delivers fast, but then applying it to work inside an existent big application fails.The only place where I feel as "productive" as I heard from other people is when I do stuff in languages or technologies I don't know at all, but then again, I also don't know if that functional code I get at the end is broken in things I am not aware of.Are any of you guys really using LLMs to create full features in big enterprise apps?

你们中有没有人使用大型语言模型（LLMs）来为大型企业应用程序创建完整的功能？