video699.page.vgg16¶
This module implements a page detector that matches last hidden VGG16 layer activations for document page image data with last hidden VGG16 layer activations for projection screen image data. Related classes and functions are also implemented.
Module Contents¶
-
video699.page.vgg16.LOGGER¶
-
video699.page.vgg16.CONFIGURATION¶
-
video699.page.vgg16.VGG16_INPUT_SIZE= 224¶
-
video699.page.vgg16.VGG16_OUTPUT_SIZE= 25088¶
-
video699.page.vgg16.VGG16_MODEL¶
Produces the last hidden VGG16 layer activations for images.
Parameters: images (iterable of ImageABC) – Images. Returns: activations – The last hidden VGG16 layer activations for the images. Return type: iterable of array_like
-
class
video699.page.vgg16.KerasVGG16PageDetector(documents)¶ Bases:
video699.interface.PageDetectorABCA page detector using approximate nearest neighbor search of last VGG16 layer activations.
The VGG16 model is based on the paper by Simonyan and Zisserman [Simoyan15].
Large-Scale Image Recognition.” arXiv. 2015. URL
Parameters: documents (set of DocumentABC) – The provided document pages. -
detect(self, frame, appeared_screens, existing_screens, disappeared_screens)¶
-