Image representation is the key part of image classification, and Fisher kernel has been considered as one of the most effective image feature coding methods. For the Fisher encoding method, there is a critical issue that the single GMM only models features within a rough granularity space. In this paper, we propose a method that is named Multi-scale and Multi-GMM Pooling (MMP), which could effectively represent the image from various granularities. We first conduct pooling using the multi-GMM instead of a single GMM. Then, we introduce multi-scale images to enrich the model’s inputs, which could improve the performance further. Finally, we validate out proposal on PASCAL VOC2007 dataset, and the experimental results show an obvious superiority over the basic Fisher model.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.