HackerNews中文版

I manage several HubSpot instances, and getting data from PDF contracts into the CRM is a massive bottleneck.<p>Standard OCR + Regex was too brittle. So I built Scanny AI.<p>It listens for Drive webhooks, uses a vision model to extract keys (like "Total Value") regardless of layout, enforces a strict JSON schema, and patches the HubSpot API.<p>It handles about 5k pages/hour.<p>Docs and API keys: scanny-ai.com

查看原文

A vision-based PDF parser to validate contracts in Google Drive