Product

Document Scan OCR: Copy the text in your scans

August 11, 2023
Collin Pham

Today we introduced the ability to copy the text in document scans and deliver that text to you programmatically via our API + Webhooks.

Document Scans

Before this update, our PDFs were really just jpegs under the hood — they didn’t contain any actual text or metadata.

Now, our PDFs contain real text. Specifically, we’re extracting the text we detect in the image using OCR, and then overlaying the detected text directly on the image — sort of like a stamp. Surprisingly, this stamping method is very common with PDFs.

This update affects the PDF files we generate, which means you can copy the text wherever the PDF is rendered — in the Stable Dashboard, our notifications, or anywhere else you may have downloaded the file.

API + Webhooks

The OCR results from each document scan will be included in our API + Webhook payloads. These results will contain the text and the location of the text within the document.

We hope this information will enable new use cases for document handling like invoice resolution, custom data extraction, document classification, and more.

Let us know what you think of this feature once you try it out!

Get 50% off your first year with Stable

Get a special discount on our virtual address + mailroom sent to your inbox
Oops! Something went wrong while submitting the form.
A virtual address + mailroom for businesses
Learn More

Get 50% off our Grow plan

Get a special discount on our virtual address + mailroom sent to your inbox
Thank you! We'll email you soon with the referral code.
Oops! Something went wrong while submitting the form.