Inbox Ledger
FeaturesDocuments

Pre-classification

How Inbox Ledger filters non-financial PDFs out of the pipeline before processing.

Pre-classification

What pre-classification does

Every PDF that arrives from an email source (Gmail, Outlook, or IMAP) is checked by an AI model before it is stored. The model labels it financial (an invoice or receipt) or non-financial. When the model is confident the file is non-financial, at a confidence of 0.7 or higher, Inbox Ledger skips it: nothing is stored, no extraction runs, and no credit is spent.

Why it exists

Inboxes carry a lot that is not an invoice: newsletters, product brochures, signed contracts, and marketing attachments. Pre-classification keeps your credit balance and your Documents list focused on the files you actually care about.

What gets skipped

These are the kinds of PDF pre-classification filters out:

  • Marketing material such as brochures, catalogs, and whitepapers
  • Contracts and signed agreements with no totals or line items
  • Newsletters and announcements that ship with a PDF
  • Manuals, technical documents, blank pages, and plain letters

Scanned files with very little readable text are harder to classify, so a genuine invoice with low text density is occasionally skipped. If that happens, the bypass routes below get it processed.

When the check errors

If the classifier call fails for any reason, a timeout, an API error, or a malformed response, the document is treated as financial and goes through the full pipeline. Pre-classification only ever removes work. An infrastructure hiccup never blocks a real invoice.

How to bypass it

Three routes skip the classifier, because each one signals that you meant to send the file:

  • Manual upload. Anything you drag into the Documents page is always processed.
  • Forwarding addresses. Email sent to your @fw.inboxledger.app address is always processed.
  • Reprocess. Re-running the pipeline on a document from the detail view forces a full extraction pass.

If pre-classification skipped something it should have kept, forward it to your forwarding address or upload it by hand. Both routes bypass the check. Seeing the same false positive repeat? Send the PDF to support@inboxledger.app. The reports we get help us tune the classifier.

Where to see the count

Each sync run reports how many files it skipped this way in a counter named documentsSkippedNonFinancial. You will find it in the sync stats on the Sources page after a sync finishes.

Sync result showing the skipped-document count
Dashboard then Sources. Show a finished sync's stats with the documents-skipped-as-non-financial count visible.

Ready to try this?

Inbox Ledger turns your inbox into clean accounting data. The free tier includes 10 credits, refilled every 30 days.

Start free

On this page