metadata
license: mit
language:
- en
tags:
- computer-vision
- object-detection
- yolov8
- document-analysis
- historical-documents
- heritage-ai
- pytorch
- ultralytics
pipeline_tag: object-detection
ποΈ YOLOv8 β Historical Document Ornament Detector
Automatic detection of typographic ornaments in 16thβ18th century printed documents. Developed as part of the TypoRef project at PolyTech Tours.
π Model Summary
| Property | Details |
|---|---|
| ποΈ Architecture | YOLOv8 (Ultralytics) |
| π― Task | Object Detection |
| π mAP@50 | 95% |
| ποΈ Dataset | 50+ expert-annotated historical document pages |
| π Document period | 16th β 18th century printed books |
| βοΈ Framework | PyTorch + Ultralytics |
| π Processing speedup | 20% faster than manual workflow |
| π License | MIT |
π§ What This Model Does
This model detects and localizes typographic ornaments and decorative graphic elements in scanned pages of early modern European printed books.
It was built to replace a slow, fully manual cataloguing process for the TypoRef digital humanities project, enabling automated analysis of thousands of document pages that would otherwise require extensive expert annotation.
Detected classes: typographic ornaments, decorative initials, vignettes, and other graphic elements typical of 16thβ18th century printing.
π Performance
| Metric | Score |
|---|---|
| mAP@50 | 95% |
| Training duration | 6 months iterative refinement |
| Annotations integrated | 50+ pages in 2 months |
| Processing time reduction | 20% vs previous pipeline |
π How to Use
from ultralytics import YOLO
# Load the model
model = YOLO("best.pt")
# Run inference on a document scan
results = model("your_document_scan.jpg", conf=0.35)
# Show results
results[0].show()
# Save annotated image
results[0].save("output.jpg")
ποΈ Training Data
- Source: Historical printed books from the TypoRef corpus (16thβ18th century)
- Annotations: Expert-annotated by digital humanities researchers at PolyTech Tours
- Volume: 50+ annotated document pages
- Augmentation: Standard YOLOv8 augmentation pipeline
β οΈ Limitations
- Optimized for black-and-white or greyscale document scans
- Performance may degrade on very low-resolution scans (< 150 DPI)
- Trained on Western European printing conventions β may generalize poorly to other traditions
π Related Resources
- π€ Live Demo Space
- π» GitHub Repository
π€ Author
Martin Badrous β Computer Vision & Deep Learning Engineer