Skip to content

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Notifications You must be signed in to change notification settings

swapnil-ahlawat/Document_Layout_Analysis-MonkAI

Repository files navigation

Document Layout Detection using MonkAI Object Detection Library

Deep learning models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Choice of architecture

-Inspiration from the blog- https://medium.com/@Intellica.AI/a-comparative-study-of-custom-object-detection-algorithms-9e7ddf6e765e

Yolov3, FasterRCNN & SSD are broadly top 3 model architectures that are used for Object detection. So, for this task, prediction and confidence on inference images of these 3 architectures have been compared.

Tutorial Blog

https://medium.com/@swapnil.ahlawat/object-detection-document-layout-analysis-using-monk-object-detection-toolkit-6c57200bde5

About

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published