How to Budget for a Large Document Scanning Project
Budgeting for a large scanning project can feel like guessing. You know you have a lot of paper, but how much it will cost to digitise depends on factors you may not have considered — document condition, preparation requirements, indexing depth, and the gap between what you think you have and what you actually have. This guide helps you build a realistic budget.
Step 1: Estimate Your Volume
You need a reasonable estimate of how many pages you are dealing with. The easiest way:
- Archive boxes: A standard archive box holds approximately 2,000-2,500 A4 sheets. Count your boxes and multiply
- Filing cabinets: A full four-drawer filing cabinet holds approximately 8,000-10,000 sheets (roughly 3-4 archive boxes per drawer)
- Lever arch files: A full lever arch file holds approximately 400-500 sheets
- Expansion files and wallets: Variable, but typically 100-300 sheets each
Do not count too precisely at this stage — you need an order-of-magnitude estimate. Knowing you have 50,000-75,000 pages is far more useful for budgeting than spending weeks trying to count to the exact page.
Step 2: Assess Document Condition
Open a sample of boxes and evaluate the contents. This is the step most people skip, and it is the reason most budgets are wrong. Look at:
- Are documents loose or in folders/binders?
- Are they stapled?
- Are there mixed paper sizes?
- Is the paper in good condition or damaged/fragile?
- Are there sticky notes, annotations, or attachments?
- How well organised are they — sorted or random?
A box of clean, loose A4 sheets costs a fraction of a box of stapled, folded, mixed-format documents in lever arch files. If you send a sample box to your scanning provider, they can give you a much more accurate quote than if you just tell them a number.
Step 3: Define Your Requirements
Before requesting quotes, decide:
- Resolution: 300 DPI is the standard. Only specify higher if you have a specific need
- Colour mode: Auto-colour detect is the best default — colour where needed, black and white otherwise
- OCR: Almost always yes — the cost is minimal and the benefit is significant
- Indexing: What level? Basic file naming (cheapest), document-level indexing (moderate), or full metadata capture (most expensive)?
- Output format: PDF is standard. Multi-page PDFs per document, or per folder, or per box?
- Delivery: USB drive, cloud upload, or integration with your document management system?
- Originals: Return after scanning, keep in storage, or securely destroy?
Step 4: Build the Budget
A realistic budget for a large project should include all these line items:
- Document preparation: 3-12p per page (the biggest variable)
- Scanning: 4-10p per page (depends on volume and document type)
- OCR: 1-3p per page
- Indexing: 0-20p per document (depends on depth)
- Quality assurance: Usually included in the per-page rate, but confirm
- Collection and delivery of physical documents: £100-£500 depending on volume and distance
- Output delivery: Free for USB drive, variable for cloud upload or system integration
- Secure destruction of originals (if applicable): £5-£15 per box
- Contingency: 15-20% of the total, because volumes are always estimates and document condition always has surprises
Step 5: Get Multiple Quotes
Request quotes from at least three providers, giving each the same brief. Compare on:
- Total project cost (not just per-page rate — some providers quote low per-page but charge extra for preparation, OCR or indexing)
- What is included in the per-page rate and what is extra
- Turnaround time — some providers are faster but more expensive
- Quality assurance process — what checking is done and by whom?
- Security credentials — ISO 27001, DBS checks, data handling procedures
- References from similar projects
Common Budget Mistakes
- Underestimating volume: People consistently undercount. A “small archive” of 20 boxes is 50,000 pages — not a small scanning project
- Ignoring preparation: The scanning rate looks affordable until preparation doubles it
- Forgetting indexing: Scanning without indexing creates a pile of unnamed files that are almost as hard to search as the original paper
- No contingency: Volumes turn out higher than estimated, document condition is worse than expected, and scope creep adds requirements. Budget at least 15% contingency
- Comparing only on price: The cheapest provider may cut corners on quality, leaving you with an unusable archive
Get a Free Quote
Every project is different, so the best way to understand your costs is to get in touch with our team. We provide clear, no-obligation quotes — usually within the same day.
Call us on 01691 650355 or use the form below.





