Best Way to Digitise Historical Archives
Digitising historical archives is fundamentally different from scanning modern business documents. Historical records may be centuries old, physically fragile, oversized, bound, faded or written in hands that modern OCR cannot read. The priorities shift from speed and cost-efficiency to preservation, accuracy and long-term accessibility. Getting this wrong can damage irreplaceable material.
Understanding the Challenges
Physical Condition
Historical documents present handling challenges that modern paper does not:
- Fragile paper: Acid paper from the 19th and early 20th centuries becomes brittle and can crumble when handled. Parchment and vellum from earlier periods may be stiff, warped or cracked
- Faded ink: Iron gall ink (common before the 20th century) fades over time and can eat through the paper. Pencil entries may be barely visible
- Bound volumes: Ledgers, minute books, parish registers and other bound records cannot pass through a document feeder. Pages near the spine may be difficult to flatten without damaging the binding
- Unusual sizes: Historical records come in every conceivable size — from tiny receipt slips to large maps and plans. Standard A4/A3 scanners cannot handle much of this material
- Mixed media: A single archive may contain paper, parchment, photographs, maps, drawings, fabric samples, pressed flowers and other non-standard items
Specialist Handling Requirements
Handling historical material safely requires:
- Clean cotton or nitrile gloves to prevent oils from skin damaging the paper
- Supporting fragile documents on acid-free board during transport
- Never forcing a bound volume flat — use a book cradle or V-shaped scanner that allows the spine to remain partially closed
- Working in a clean, temperature-controlled environment (ideally 16-20C, 45-55% relative humidity)
- Minimising handling — each time a document is touched is a risk
Scanning Equipment and Methods
Flatbed Scanners
For loose single sheets in reasonable condition, a high-quality flatbed scanner (such as an Epson Expression or similar) provides excellent results. The document lies flat on the glass with no mechanical stress. Flatbed scanning is slow — one page at a time, manually positioned — but safe and accurate.
Overhead and Planetary Scanners
For bound volumes and fragile material, overhead (planetary) scanners are the standard. The document sits on a flat surface or book cradle beneath a camera that captures the image from above. The document is never pressed against glass or fed through rollers. High-end models (Zeutschel, i2S) used by national archives produce exceptional results at resolutions up to 800 DPI.
Large-Format Scanners
Maps, plans and oversized documents require wide-format scanners with feed widths of 36-44 inches. For particularly fragile large items, overhead capture using a calibrated DSLR camera on a copy stand may be safer than feeding through any mechanical scanner.
Resolution and Format Choices
Heritage digitisation typically requires higher specifications than business scanning:
- Resolution: 300 DPI minimum for text documents. 400-600 DPI for documents with fine detail, small text or illustrations. 600-800 DPI for photographs, artwork and material where maximum detail capture is required
- Master format: Uncompressed TIFF is the established standard for preservation masters. These files are large (a 600 DPI colour A3 scan produces a file of approximately 200-300MB) but preserve every detail captured by the scanner
- Access copies: JPEG 2000 or PDF/A for access and delivery. These are compressed for practical use while maintaining good quality
- Colour mode: Full colour for most historical material, even documents that appear black and white. Colour capture preserves paper tone, ink colour variations, stains and other features that may have historical or forensic significance
Metadata Standards
Historical archives require richer metadata than business records. Standard fields include:
- Unique identifier (reference number within the archive catalogue)
- Title or description of the document
- Date or date range (which may be approximate for older material)
- Creator or author (if known)
- Physical description (size, material, condition, number of pages)
- Subject keywords or classification
- Copyright status and access restrictions
- Technical metadata (scanner model, resolution, colour profile, date of digitisation)
The Dublin Core metadata standard is widely used in the heritage sector. For UK archives, cataloguing should follow ISAD(G) (General International Standard Archival Description) and be compatible with the National Archives’ Discovery catalogue where relevant.
UK Heritage Guidelines
Several UK organisations provide guidance on digitisation standards:
- The National Archives (TNA): Publishes detailed digitisation guidance including technical specifications, metadata requirements and quality assurance procedures. TNA’s standards are considered the benchmark for UK archive digitisation
- Historic England: Provides guidance on digitising heritage records, particularly architectural drawings, photographs and archaeological records
- JISC: Offers guidance for higher education and research institutions on digitisation projects, including technical standards and project management
- The British Library: Shares technical specifications and lessons learned from its large-scale digitisation programmes
Following established standards is not just good practice — it ensures your digital archive is compatible with national catalogues, interoperable with other collections and meets the expectations of researchers and funding bodies.
Cost Expectations
Historical archive digitisation costs significantly more than commercial document scanning:
- Loose documents, good condition: 30-75p per page (flatbed scanning at high resolution with metadata)
- Bound volumes: 50p-£2 per page (overhead scanning, careful handling, book cradle)
- Fragile or damaged material: £1-£5 per page (additional handling time, conservation support)
- Large-format maps and plans: £3-£15 per item (specialist equipment, manual handling)
- Metadata creation: £1-£5 per item (research and cataloguing by qualified archivists)
A typical project to digitise 10,000 pages of mixed historical material with full metadata might cost £10,000-£30,000. Projects involving fragile bound volumes or extensive cataloguing can be significantly more. Heritage Lottery Fund (now National Lottery Heritage Fund) grants have funded many UK archive digitisation projects and may be worth investigating.
Get a Free Quote
Every project is different, so the best way to understand your options is to get in touch with our team. We provide clear, no-obligation advice — usually within the same day.
Call us on 01691 650355 or use the form below.





