Image clustering using pretrained neural networks

A.S. Kuznetsov, E.Y. Semenov, L.D. Matrosova


The article deals with the stages of solving image clustering problems using pre-trained neural networks. Some composite solutions of the clustering problem are presented, where clustering methods are used at the last stage, and most of the work is the extraction of features and their preprocessing. The analysis of modern approaches to feature extraction from images, including classical methods of pattern recognition theory and computer vision and feature extraction methods using convolutional neural networks. The paper provides recommendations for choosing the most effective architecture for convolutional neural networks, depending on the nature of the tasks. A classification of methods for reducing the dimension of images by types: preserving the distance between points when displaying from high-dimensional to low-dimensional space; preserving the global structure of data; search for the nearest vectors in large-dimensional spaces. We consider a special method of clustering images DeepCluster, which iteratively groups the features using a standard clustering algorithm. The obtained results can serve in further research in this area, as well as in solving problems in the preparation of pre-trained models of convolutional neural networks.

Full Text:

PDF (Russian)


Quackenbush, Lindi J. "A review of techniques for extracting linear features from imagery." Photogrammetric Engineering & Remote Sensing 70.12 (2004): 1383-1392.

Kumar, Gaurav, and Pradeep Kumar Bhatia. "A detailed review of feature extraction in image processing systems." 2014 Fourth international conference on advanced computing & communication technologies. IEEE, 2014.

Vasil'ev, V.N., I. P. Gurov, and A. S. Potapov. "Matematicheskie metody i algoritmicheskoe obespechenie analiza i raspoznavanija izobrazhenij v informacionno-telekommunikacionnyh sistemah." Vserossijskij konkursnyj otbor obzornoanaliticheskih statej po prioritetnomu napravleniju" Informacionno-telekommunikacionnye sistemy 46 (2008).

Gladilin, Sergej Aleksandrovich, et al. "Postroenie ustojchivyh priznakov detekcii i klassifikacii ob"ektov, ne obladajushhih harakternymi jarkostnymi kontrastami." Informacionnye tehnologii i vychislitel'nye sistemy 1 (2014): 53-60.

Kuchuganov, Aleksandr Valer'evich. "Bioinspirirovannye algoritmy vydelenija informativnyh priznakov izobrazhenij." Izvestija Tomskogo politehnicheskogo universiteta. Inzhiniring georesursov 321.5 (2012).

De, Sourav, et al. "Dimension Reduction Using Image Transform for Content-Based Feature Extraction." Feature Dimension Reduction for Content-Based Image Identification. IGI Global, 2018. 26-40.

Zhao, Wenzhi, and Shihong Du. "Spectral–spatial feature extraction for hyperspectral image classification: A dimension reduction and deep learning approach." IEEE Transactions on Geoscience and Remote Sensing 54.8 (2016): 4544-4554.

Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.

Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).

He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.

Huang, Gao, et al. "Densely connected convolutional networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.

Benchmarks for popular CNN models. URL:

McInnes, Leland, John Healy, and James Melville. "Umap: Uniform manifold approximation and projection for dimension reduction." arXiv preprint arXiv:1802.03426 (2018).

Obzor novogo algoritma umen'shenija razmernosti UMAP. Dejstvitel'no li on luchshe i bystree, chem t-SNE?

Chan, David M., et al. "t-SNE-CUDA: GPU-Accelerated t-SNE and its Applications to Modern Data." arXiv preprint arXiv:1807.11824(2018).

Johnson, Jeff, Matthijs Douze, and Hervé Jégou. "Billion-scale similarity search with GPUs." arXiv preprint arXiv:1702.08734 (2017).

Caron, Mathilde, et al. "Deep clustering for unsupervised learning of visual features." Proceedings of the European Conference on Computer Vision (ECCV). 2018.


  • There are currently no refbacks.

Abava  Absolutech Convergent 2020

ISSN: 2307-8162