Paper
4 January 2021 Improvement of U-Net architecture for image binarization with activation functions replacement
Author Affiliations +
Proceedings Volume 11605, Thirteenth International Conference on Machine Vision; 116050Y (2021) https://doi.org/10.1117/12.2587027
Event: Thirteenth International Conference on Machine Vision, 2020, Rome, Italy
Abstract
In this work we study the effect of activation functions in a neural network. We consider how activation functions with different properties and their combination affect the final quality of the model. Due to optimization and speed performance issues with most of bounded functions that are represented by sigmoids, we propose the generalized version of SoftSign function - ratio function (rf). Its shape greatly depends on introduced degree parameter, which in theory leads to new interesting property - contraction to zero. For evaluation, we chose image binarization problem: based on UNet architecture of DIBCO-2017 winners, we conducted all experiments with replacing activation functions only. Our research has led us to the state-of-the-art results in binarization quality on DIBCO-2017 test dataset. U-Net with modified activation functions significantly outperforms all existing solutions in all metrics.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Alexander V. Gayer, Alexander V. Sheshkus, Dmitri P. Nikolaev, and Vladimir V. Arlazarov "Improvement of U-Net architecture for image binarization with activation functions replacement", Proc. SPIE 11605, Thirteenth International Conference on Machine Vision, 116050Y (4 January 2021); https://doi.org/10.1117/12.2587027
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top