为什么没有针对大型语言模型(LLMs)的LoRA?

1作者: instagraham大约 1 个月前原帖
当你查看用于图像生成模型的LoRA(比如在Civitai上),你会发现各种强大的视觉风格,这使得基础模型能够不断调整,以适应当下的流行趋势。 那么,文本大型语言模型(LLM)有什么不同呢?例如,为什么没有针对Python编程、科幻写作等特定领域的LoRA呢? 我看到过关于文本LoRA与其基础模型密切相关的讨论——这在图像生成领域也是很常见的。 丰富的LoRA文化将使更多模型变得多功能,并减少不断下载新检查点的需求。
查看原文
When you look at LoRAs for image generation model (like on Civitai), you get a very powerful range of different visual styles - allowing a base model to be constantly refined to suit whatever is trending at the time.<p>What makes text LLMs different, then? For example, why are there no specific LoRAs for things like python coding, or sci-fi writing, or so on?<p>I&#x27;ve read discussions about text LoRAs being specific to their base model - which is honestly also largely the case with image gen.<p>A rich LoRA culture would make a lot of models more versatile and reduce the need to keep downloading new checkpoints