你们中有没有人使用大型语言模型(LLMs)来为大型企业应用程序创建完整的功能?
首先让我澄清一下。我并不讨厌大型语言模型(LLMs),我会向它们提问,触发代理执行一些我大致知道最终目标的任务,并对应用程序的小部分进行分析。
话虽如此,每当我给它一些稍微复杂的任务,比如在单个文件脚本中执行某些操作时,它总是让我失望。要么代码质量很差,要么方法糟糕得像是一个完全不知道该怎么做的人,或者它完全开始做我在初始提示中明确表示不想要的事情。
有时,当这种情况发生时,我会请我的LLM爱好者的同事来帮忙,但他们也无法“修复”它,而我却被认为是因为“提示错误”或“缺乏正确的上下文”而做错了。
我创建了很多“Agents.md”文件,把文件放入上下文窗口……但都没有用。
当我需要进行全新项目或概念验证(PoC)时,它的反应速度很快,但将其应用于现有的大型应用程序时却失败了。
我唯一感到“高效”的时候,是在使用我完全不熟悉的语言或技术时,但我也不知道最终得到的功能代码是否存在我不知道的问题。
你们当中有谁真的在使用LLMs为大型企业应用程序创建完整的功能吗?
查看原文
Let me be clear first. I don't dislike LLMs, I query them, trigger agents to do stuff where I kind of know what the end goal is and to make analisys of small parts of an application.<p>That said, everytime I give it something a little more complex that do something in a single file script it fails me horribly. Either the code is really bad, or the approach is as bad a someone who doesn't really know what to do or it plains start doing things that I explicitly said not to do in the initial prompt.<p>I have sometimes asked my LLM fan's coworkers to come and help when that happens and they also are not able to "fix it", but somehow I am the one doing it wrong due "wrong prompt" or "lack of correct context".<p>I have created a lot of "Agents.md" files, drop files into the context window... Nothing.<p>When I need to do green field stuff, or PoCs it delivers fast, but then applying it to work inside an existent big application fails.<p>The only place where I feel as "productive" as I heard from other people is when I do stuff in languages or technologies I don't know at all, but then again, I also don't know if that functional code I get at the end is broken in things I am not aware of.<p>Are any of you guys really using LLMs to create full features in big enterprise apps?