Paper
13 June 2024 A video object removal method based on pointwise spatial attention mechanism
Zhanli Li, Daren Wang, Qi Mu
Author Affiliations +
Proceedings Volume 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024); 131802M (2024) https://doi.org/10.1117/12.3034087
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 2024, Guangzhou, China
Abstract
In the advancing fields of computer vision and deep learning, video dynamic object removal technology is an important research area. The goal of this technology is to remove specific dynamic objects from a continuous video sequence while filling in the removed background. This paper will explore a new video dynamic object removal method based on feature pyramid and point by point spatial attention mask extraction network. This method can detect dynamic objects and fill the holes caused by dynamic object removal in the video. We have designed and implemented an end-to-end dynamic target removal system based on the above methods. The system first applies RCNN for dynamic target detection and generates corresponding masks. Then, a generative adversarial network is used to complete video restoration of the removed target area. In addition, the system focuses on optimizing performance to ensure processing speed and result quality. The experimental results of this article indicate that this method has high accuracy and effectiveness in removing and repairing dynamic targets in videos. This indicates that this new visual technology method has broad rule-based capabilities in handling moving objects and repairing their backgrounds. In the future, this technology can be widely applied in various video processing and repair projects, such as video editing, security monitoring, and other fields.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Zhanli Li, Daren Wang, and Qi Mu "A video object removal method based on pointwise spatial attention mechanism", Proc. SPIE 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 131802M (13 June 2024); https://doi.org/10.1117/12.3034087
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Detection and tracking algorithms

Convolution

Feature extraction

Semantics

Image segmentation

Back to Top