A vision-based PDF parser to validate contracts in Google Drive

1作者: scannyai28 天前原帖
I manage several HubSpot instances, and getting data from PDF contracts into the CRM is a massive bottleneck.<p>Standard OCR + Regex was too brittle. So I built Scanny AI.<p>It listens for Drive webhooks, uses a vision model to extract keys (like &quot;Total Value&quot;) regardless of layout, enforces a strict JSON schema, and patches the HubSpot API.<p>It handles about 5k pages&#x2F;hour.<p>Docs and API keys: scanny-ai.com
查看原文
I manage several HubSpot instances, and getting data from PDF contracts into the CRM is a massive bottleneck.<p>Standard OCR + Regex was too brittle. So I built Scanny AI.<p>It listens for Drive webhooks, uses a vision model to extract keys (like &quot;Total Value&quot;) regardless of layout, enforces a strict JSON schema, and patches the HubSpot API.<p>It handles about 5k pages&#x2F;hour.<p>Docs and API keys: scanny-ai.com