展示HN:将原始屏幕录制转换为带注释的精确指南截图

1作者: docuagent大约 1 个月前原帖
在你说这是另一个RAG克隆之前,请先听我说几句。 <p>问题: 作为创作者:你需要录制屏幕、编辑、注释,然后再进行展示。如果有任何变化,你就得重新做一遍这个过程。 作为最终用户:你需要观看一段5分钟的视频,而你只需要知道其中5秒钟的内容来完成特定任务。 <p>解决方案: 对于创作者:录制并上传你的原始屏幕捕捉,无需进一步的努力。 对于最终用户:你提问后,会得到与你的具体问题相关的文档,并附有注释的截图。 <p>这与Scribe或RAG有什么不同? * 与Scribe相比:Scribe是用于主动捕捉(在工作时点击)。DocuFine则是用于被动提取——它在事后将你现有的原始视频或演示转换为指南。 * 与RAG相比:大多数视频RAG仅搜索转录文本。DocuFine通过大型语言模型(LLM)“看”用户界面,然后使用光学字符识别(OCR)将注释“贴”到实际按钮上,因此即使视频没有声音,指南也能在空间上准确。 <p>网站尚未上线——我目前正在收集对这个概念和演示的反馈,待优化LLM成本和提取逻辑后再开放。 <p>演示链接: - 初始录制:<a href="https://streamable.com/c5gom5" rel="nofollow">https://streamable.com/c5gom5</a> - 提问内容:如何找到客户下的订单? - 生成的输出指南:<a href="https://streamable.com/9c4ncj" rel="nofollow">https://streamable.com/9c4ncj</a> <p>端到端演示:<a href="https://streamable.com/hqb6te" rel="nofollow">https://streamable.com/hqb6te</a> <p>非常感谢你的诚实反馈!
查看原文
Before you say another RAG clone, please hear me out for a second.<p>The Problem: As a creator: You have to screen record, edit, annotate, and then present. If anything changes, you redo the process. As an end user: You have to watch a 5-minute video when all you need to know is 5 seconds of that video to perform a specific task.<p>The Solution: For creators: Record and upload your raw screen captures. No further effort. For end users: You ask a question, and you get exactly the document for your specific question with annotated screenshots.<p>How is this different from Scribe or RAG? * vs. Scribe: Scribe is for active capture (clicking while you work). DocuFine is for passive extraction—it turns your existing raw videos or demos into guides after the fact. * vs. RAG: Most video RAG just searches transcripts. DocuFine &quot;sees&quot; the UI using an LLM and then uses OCR to &quot;snap&quot; the annotations to the actual buttons, so the guides are spatially accurate even if the video is silent.<p>The site isn&#x27;t live yet—I&#x27;m currently gathering feedback on the concept and demo before opening it up, as I&#x27;m still optimizing the LLM costs and extraction logic.<p>Demo Links: - Initial Recording: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;c5gom5" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;c5gom5</a> - Query Asked: How do I find orders placed by a customer? - Generated Output Guide: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;9c4ncj" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;9c4ncj</a><p>End-to-End-Demo: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;hqb6te" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;hqb6te</a><p>Honest feedback appreciated!