<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.2" xml:lang="ru" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="issn">2518-1092</journal-id><journal-title-group><journal-title>Research result. Information technologies</journal-title></journal-title-group><issn pub-type="epub">2518-1092</issn></journal-meta><article-meta><article-id pub-id-type="doi">10.18413/2518-1092-2025-10-3-0-2</article-id><article-id pub-id-type="publisher-id">3901</article-id><article-categories><subj-group subj-group-type="heading"><subject>INFORMATION SYSTEM AND TECHNOLOGIES</subject></subj-group></article-categories><title-group><article-title>&lt;strong&gt;FORMATION OF INVESTMENT PORTFOLIOS USING CLUSTER ANALYSIS&lt;/strong&gt;</article-title><trans-title-group xml:lang="en"><trans-title>&lt;strong&gt;FORMATION OF INVESTMENT PORTFOLIOS USING CLUSTER ANALYSIS&lt;/strong&gt;</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><name-alternatives><name xml:lang="ru"><surname>Nazarov</surname><given-names>Timur Rafikovich</given-names></name><name xml:lang="en"><surname>Nazarov</surname><given-names>Timur Rafikovich</given-names></name></name-alternatives><email>TimNazarovya@gmail.com</email></contrib></contrib-group><pub-date pub-type="epub"><year>2025</year></pub-date><volume>10</volume><issue>3</issue><fpage>0</fpage><lpage>0</lpage><self-uri content-type="pdf" xlink:href="/media/information/2025/3/ИТ_НР_10_3_2.pdf" /><abstract xml:lang="ru"><p>The study is devoted to testing the hypothesis about the possibility of creating a diversified portfolio of stocks by clustering them according to risk and sustainability indicators. Within the framework of this problem, three clustering methods were used: the K-means method, the Gaussian mixture model, and hierarchical clustering by the Ward method. We also selected three criteria for stock clustering: Beta index, volatility, and ROE. According to the data from 2018 to 2023, clusterization was carried out for each period with the selection of the optimal number of clusters, and portfolios of financial assets were created according to predefined rules. Alternatively, stock portfolios were created for each year without prior clustering according to the rules of Markowitz portfolio theory. After that, each portfolio was tested over the next year. The results based on the metrics of profitability and risk show that the hypothesis has a right to exist. Portfolios created with pre-clustering outperformed the classic options in terms of profitability. The problem has scope for further investigation. In particular, a different set of features for stock segmentation or the use of clustering methods in combination with other machine learning or neural network algorithms can also produce high-quality results.</p></abstract><trans-abstract xml:lang="en"><p>The study is devoted to testing the hypothesis about the possibility of creating a diversified portfolio of stocks by clustering them according to risk and sustainability indicators. Within the framework of this problem, three clustering methods were used: the K-means method, the Gaussian mixture model, and hierarchical clustering by the Ward method. We also selected three criteria for stock clustering: Beta index, volatility, and ROE. According to the data from 2018 to 2023, clusterization was carried out for each period with the selection of the optimal number of clusters, and portfolios of financial assets were created according to predefined rules. Alternatively, stock portfolios were created for each year without prior clustering according to the rules of Markowitz portfolio theory. After that, each portfolio was tested over the next year. The results based on the metrics of profitability and risk show that the hypothesis has a right to exist. Portfolios created with pre-clustering outperformed the classic options in terms of profitability. The problem has scope for further investigation. In particular, a different set of features for stock segmentation or the use of clustering methods in combination with other machine learning or neural network algorithms can also produce high-quality results.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>clustering</kwd><kwd>stocks</kwd><kwd>volatility</kwd><kwd>Beta index</kwd><kwd>K-means</kwd><kwd>hierarchical clustering</kwd><kwd>Gaussian mixture method</kwd><kwd>Markowitz portfolio theory</kwd></kwd-group><kwd-group xml:lang="en"><kwd>clustering</kwd><kwd>stocks</kwd><kwd>volatility</kwd><kwd>Beta index</kwd><kwd>K-means</kwd><kwd>hierarchical clustering</kwd><kwd>Gaussian mixture method</kwd><kwd>Markowitz portfolio theory</kwd></kwd-group></article-meta></front><back><ref-list><title>Список литературы</title><ref id="B1"><mixed-citation>The number of individuals with brokerage accounts on the Moscow Stock Exchange increased by 5.4 million in 2024. URL: https://www.interfax.ru/business/1003027 (accessed: 05/26/2025).</mixed-citation></ref><ref id="B2"><mixed-citation>Bayuk D., Bayuk O., Berzin D., Ivanyuk V. Practical application of clustering, classification and approximation methods based on neural networks. &amp;ndash; 1st ed. &amp;ndash; Moscow: Prometheus, 2020. &amp;ndash; 448 p.</mixed-citation></ref><ref id="B3"><mixed-citation>Dmitriev N.D. Clusterization of Russian companies in the technology sector based on market and financial indicators / N.D. Dmitriev, A.V. Tyulkova, V.V. Sorokozherdyev // Bulletin of the Chelyabinsk State University. &amp;ndash; 2024. &amp;ndash; № 10(492). &amp;ndash; Pp. 116-127.</mixed-citation></ref><ref id="B4"><mixed-citation>Tyukhova E.M. The use of cluster analysis for the formation of a securities portfolio in investment systems (roboadvisors) / E.M. Tyukhova, D.S. Sizykh // Management of large-scale systems development (MLSD&amp;#39;2019): Proceedings of the twelfth International conference, Moscow, October 01-03, 2019 / Under the general editorship of S.N. Vasiliev, A.D. Tsvirkun. Moscow: International Research Institute of Management Problems of the Russian Academy of Sciences, &amp;ndash; 2019. &amp;ndash; pp. &amp;ndash; 210-220.</mixed-citation></ref><ref id="B5"><mixed-citation>Broad Market Index. URL: https://www.moex.com/ru/index/MOEXBMI/about (accessed: 05/24/2024).</mixed-citation></ref><ref id="B6"><mixed-citation>Zhdanov I.Y. Investment assessment of projects and business: a textbook / I.Y. Zhdanov, V.Y. Zhdanov. Moscow: Prospekt Publ., &amp;ndash; 2019, &amp;ndash; 120 p.</mixed-citation></ref><ref id="B7"><mixed-citation>Borisova L.V., Sagaeva I.D. Models of optimal investment: an educational and methodical manual, Saratov. URL: http://elibrary.sgu.ru/uch_lit/1347.pdf (accessed: 05/23/2025).</mixed-citation></ref><ref id="B8"><mixed-citation>Clusterization and partial learning. URL: http://www.machinelearning.ru/wiki/images/5/52/Voron-ML-Clustering-SSL-slides.pdf (accessed: 05/22/2025).</mixed-citation></ref><ref id="B9"><mixed-citation>Takhir M.A., Ahmed A.U., Egorova L.D. On algorithms for the K-medoid problem // Actual problems of aviation and cosmonautics. &amp;ndash; 2021. &amp;ndash; No. 1. &amp;ndash; pp. 194-196.</mixed-citation></ref><ref id="B10"><mixed-citation>Dekanova K.V., Gate A.A. Methodology for forming an investment portfolio taking into account the individual characteristics of a private investor // Economics: yesterday, today, tomorrow. &amp;ndash; 2023. &amp;ndash; Volume 13. &amp;ndash; No. 7A. &amp;ndash; pp. 311-329.</mixed-citation></ref><ref id="B11"><mixed-citation>Nourahmadi M., Sadeqi H. Portfolio Diversification Based on Clustering Analysis // Iranian Journal of Accounting, Auditing and Finance. &amp;ndash; 2023. &amp;ndash; №7(3) &amp;ndash; P. 1-16.</mixed-citation></ref><ref id="B12"><mixed-citation>Szczepocki, Piotr Clustering companies listed on the Warsaw Stock Exchange according to time-varying beta // Econometrics. &amp;ndash; 2023. &amp;ndash; №23 &amp;ndash; P. 63-79.</mixed-citation></ref><ref id="B13"><mixed-citation>Lim T., Ng H. Financial Portfolio Management Based on Shaped-Based Unsupervised Machine Learning: A Dynamic Time Warping Baycenter Averaging Approach to International Markets and Periods of Downside Event Risks // The Journal of Investing. &amp;ndash; 2023. &amp;ndash; №32. &amp;ndash; P. 74-96.</mixed-citation></ref><ref id="B14"><mixed-citation>Hastie T., Tibshirani R., Friedman J. The elements of statistical learning. &amp;ndash; 12th ed. &amp;ndash; Springer, 2017. &amp;ndash; 745&amp;nbsp;p.</mixed-citation></ref><ref id="B15"><mixed-citation>Fama, E. F., &amp;amp; French, K. R. The Cross-Section of Expected Stock Returns // Journal of Finance. &amp;ndash; 1992.&amp;nbsp;&amp;ndash; №47(2). &amp;ndash; P. 427-465.</mixed-citation></ref><ref id="B16"><mixed-citation>Asness C.S., Moskowitz T., Pedersen L. Value and Momentum Everywhere // Journal of Finance. &amp;ndash; 2013.&amp;nbsp;&amp;ndash; 68. &amp;ndash; P. 929-985.</mixed-citation></ref></ref-list></back></article>