video699.page.vgg16

This module implements a page detector that matches last hidden VGG16 layer activations for document page image data with last hidden VGG16 layer activations for projection screen image data. Related classes and functions are also implemented.

Module Contents

video699.page.vgg16.LOGGER
video699.page.vgg16.CONFIGURATION
video699.page.vgg16.VGG16_INPUT_SIZE = 224
video699.page.vgg16.VGG16_OUTPUT_SIZE = 25088
video699.page.vgg16.VGG16_MODEL
video699.page.vgg16._last_hidden_vgg16_layer(images)

Produces the last hidden VGG16 layer activations for images.

Parameters:images (iterable of ImageABC) – Images.
Returns:activations – The last hidden VGG16 layer activations for the images.
Return type:iterable of array_like
class video699.page.vgg16.KerasVGG16PageDetector(documents)

Bases: video699.interface.PageDetectorABC

A page detector using approximate nearest neighbor search of last VGG16 layer activations.

The VGG16 model is based on the paper by Simonyan and Zisserman [Simoyan15].

Large-Scale Image Recognition.” arXiv. 2015. URL

Parameters:documents (set of DocumentABC) – The provided document pages.
detect(self, frame, appeared_screens, existing_screens, disappeared_screens)