HackerNews中文版

当你查看用于图像生成模型的LoRA（比如在Civitai上），你会发现各种强大的视觉风格，这使得基础模型能够不断调整，以适应当下的流行趋势。那么，文本大型语言模型（LLM）有什么不同呢？例如，为什么没有针对Python编程、科幻写作等特定领域的LoRA呢？我看到过关于文本LoRA与其基础模型密切相关的讨论——这在图像生成领域也是很常见的。丰富的LoRA文化将使更多模型变得多功能，并减少不断下载新检查点的需求。

查看原文

When you look at LoRAs for image generation model (like on Civitai), you get a very powerful range of different visual styles - allowing a base model to be constantly refined to suit whatever is trending at the time.<p>What makes text LLMs different, then? For example, why are there no specific LoRAs for things like python coding, or sci-fi writing, or so on?<p>I've read discussions about text LoRAs being specific to their base model - which is honestly also largely the case with image gen.<p>A rich LoRA culture would make a lot of models more versatile and reduce the need to keep downloading new checkpoints

为什么没有针对大型语言模型（LLMs）的LoRA？