Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот документ:
https://elib.bsu.by/handle/123456789/289841
Заглавие документа: | Object Detection in Video Surveillance Based on Multiscale Frame Representation and Block Processing by a Convolutional Neural Network |
Авторы: | Bohush, Rykhard Ma, Guangdi Weichen, Yang Ablameyko, Sergey |
Тема: | ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Математика ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Кибернетика |
Дата публикации: | 2022 |
Издатель: | Pleiades journals |
Библиографическое описание источника: | Pattern Recogn Image Anal 2022;32(1). |
Аннотация: | A method for detecting objects in high-resolution images is proposed that is based on representing an image as a set of its copies of decreasing scale, splitting it into blocks with overlap at each level of the image pyramid except for the top one, detecting objects in the blocks, and analyzing objects at the boundaries of adjacent blocks to merge them. The number of pyramid layers is determined by the size of the image and the input layer of the convolutional neural network (CNN). At all levels except for the top one, a block splitting is performed, and the use of overlap allows one to improve the correct classification of objects, which are divided into fragments and located in adjacent blocks. The decision to merge such fragments is made based on the analysis of the metric of intersection over union and membership in the same class. The proposed approach is evaluated for 4K and 8K images. To carry out experiments, a database is prepared with objects of two classes, person and vehicle, marked in such images. Networks of the You Only Look Once (YOLO) family of the third and fourth versions are used as CNNs. A quantitative assessment of the detection efficiency of objects is performed using the mAP metric for various combinations of parameters such as the degree of threshold confidence of the CNN and the percentage of intersection of blocks in the hierarchical representation of images. The results of the investigations are presented. |
URI документа: | https://elib.bsu.by/handle/123456789/289841 |
DOI документа: | 10.1134/S1054661822010035 |
Scopus идентификатор документа: | 85126797120 |
Лицензия: | info:eu-repo/semantics/openAccess |
Располагается в коллекциях: | Статьи факультета прикладной математики и информатики |
Все документы в Электронной библиотеке защищены авторским правом, все права сохранены.