The formation of an ensemble of key concepts in relation to texts with highlighted semantic parts (text with a table of contents) is reduced to the stratification process from three procedures - the procedure of text preparation, the procedure of extracting key concepts from semantic parts and dividing the entire text into fragments related to the found key concepts. Quantitative characteristics of the ensemble (the number of words in related fragments) make it possible to solve a number of problems, including determining the predominant content of the text, calculating the parameters of text proximity, identifying concepts of interest to the reader, forming fragments for training neural networks while preserving the author's style, etc. The article briefly describes the formation procedures and provides three examples of using the ensembles of key concepts obtained by stratification. In the first example, the most fully (by the number of words in related fragments) disclosed key concepts in textbooks on the subject of "Project Management" in English, German, French and Russian are determined. The results obtained make it possible, for example, to justify the choice of a specific textbook. In the second example, for ten Russian-language educational and methodological publications on project management, proximity parameters were calculated, including the normalized length of the difference vector, the angle between the ensemble vectors, and the normalized integral characteristic. The results obtained can be used in selecting materials for educational programs and individual courses. In the textbooks participating in the first example, the longest continuous fragments of texts by the number of words, suitable for LLM training, were found.
Published in | American Journal of Education and Information Technology (Volume 9, Issue 1) |
DOI | 10.11648/j.ajeit.20250901.13 |
Page(s) | 19-24 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2025. Published by Science Publishing Group |
TOC Text, Statistical Text Processing, Ensemble of Key Concepts
Text | Dominant key concepts | (23-25) % |
---|---|---|
English 100% ≈ 72,1 thousand words [9] | project schedule | 6% |
project team members | 4% | |
managing a project | 4% | |
project cost | 4% | |
project completion | 3% | |
needed for a project | 3% | |
German 100% ≈ 80,7 thousand words [10] | kosten | 9% |
et cetera | 5% | |
soziale kompetenz | 4% | |
dass projektmanagement | 3% | |
werden müssen | 3% | |
French 100% ≈ 29,7 thousand words [11] | livrables du projet | 7% |
méthodologie PM | 6% | |
doit être | 5% | |
peuvent être | 5% | |
Russian 100% ≈ 45,5 thousand words [12] | принятие решений | 6% |
критический путь | 5% | |
сетевая модель | 3% | |
пакеты работ | 3% | |
цели должны | 3% | |
текущая стоимость | 3% |
Characteristics of closeness | Minimal differences | Maximum differences | ||
---|---|---|---|---|
Meaning | Texts | Meaning | Texts | |
Normalized length of the difference vector (%) | 40.6 | ↔ [13] [17] | 72.7 | ↔ [17] [20] |
Angle between ensemble vectors in degrees | 29 | ↔ [20] [21] | 71 | ↔ [15] [19] |
Normalized integral characteristic (%) | 54.6 | ↔ [18] [21] | 79.3 | ↔ [17] [20] |
Text | Key concept | No. of words |
---|---|---|
[9] | requirements describe the characteristics of the final deliverable | 1656 |
skills that the project management | 1525 | |
[10] | module planning and project management I bis II | 1472 |
soziale kompetenz | 687 | |
[11] | compétences en gestion de projet | 662 |
méthodologie PM | 569 | |
[12] | принятие решений | 1294 |
критический путь | 1165 |
LLM | Large Language Model |
TOC | Table of Contents |
[1] | The Chicago Manual of Style (17th ed.). University of Chicago Press. 2017. ISBN 9780226287058. LCCN 2017020712. OCLC 1055308068. |
[2] |
Youri Arzumanyan, Mikhail Wolfson, Alexander Sotnikov, Arian Zakharov, Using quantitative methods to analyze the educational program, 9TH INTERNATIONAL CONFERENCE ON ADVANCED INFOTELECOMMUNICATIONS ICAIT, 2020, Conference Proceedings, vol. 2, pp. 601-605,
http://www.sut.ru/doci/nauka/1AEA/APINO/9-APINO-2020,%20%D0%A2.2.pdf |
[3] |
Youri Arzumanyan, Arian Zakharov, Yana Sokolova, Comparative analysis of information characteristics of academic disciplines, 9TH INTERNATIONAL CONFERENCE ON ADVANCED INFOTELECOMMUNICATIONS ICAIT, 2020, Conference Proceedings, vol. 2, pp. 606-609
http://www.sut.ru/doci/nauka/1AEA/APINO/9-APINO-2020,%20%D0%A2.2.pdf |
[4] | Youri Arzumanyan, Mikhail Wolfson, Alexander Sotnikov, Galia Katasonova, Arian Zakharov, Features of modeling educational programs in the development of educational trajectories for training IT specialists, XIX conference "Teaching information technologies in the Russian Federation, Moscow, 19-20 May 2021, Conference Proceedings, pp. 294-295, |
[5] |
Youri Arzumanyan, Mikhail Wolfson, Galia Katasonova, Alexander Sotnikov, Arian Zakharov, Models of educational programs for optimization problems in the design of individual educational trajectories, X INTERNATIONAL CONFERENCE ON ADVANCED INFOTELECOMMUNICATIONS ICAIT, 2021, Conference Proceedings, vol. 3, pp. 330-335,
https://www.sut.ru/doci/nauka/1AEA/APINO/10-APINO-2021.%20T.3.pdf |
[6] |
Youri Arzumanyan, Mikhail Wolfson, Galia Katasonova, Arian Zakharov, Alexander Sotnikov, Vector representation of educational programs XI INTERNATIONAL CONFERENCE ON ADVANCED INFOTELECOMMUNICATIONS ICAIT, 2022, Conference Proceedings, vol. 3, pp. 557-561,
https://www.sut.ru/doci/nauka/1AEA/APINO/11-APINO-2022.%20%D0%A2.3.pdf (accessed 19 January 2025) |
[7] | Youri Arzumanyan, Mikhail Wolfson, Arian Zakharov, Alexander Sotnikov, Comparative Analysis of SPbSUT Educational Programs in 2022, XII INTERNATIONAL CONFERENCE ON ADVANCED INFOTELECOMMUNICATIONS ICAIT, 2023, Conference Proceedings, vol. 4, pp. 15-21, |
[8] | Youri Arzumanyan, Mikhail Wolfson, Arian Zakharov, Alexander Sotnikov, Methods of analysis and design of educational programs using the tools of the Ensemble of Key Concepts, INFORMATION PROCESSES: CONCEPTUAL BASIS OF DIGITAL TRANSFORMATION OF THE ECONOMY, St.-Petersburg, 2024. pp. 44-62. |
[9] |
Watts, A. Project Management - 2nd Edition. Victoria, B.C.: BCcampus. (Websites) Available from:
https://opentextbc.ca/projectmanagement/ (accessed 19 January 2025) |
[10] |
Kluge, F. Projektmanagement in Praxis und Lehre der (Landschafts) Architektur … ein wenig Chaos gehört dazu. [Project management in practice and teaching of (landscape) architecture … a little chaos is part of it] Münster (Westfalen) 2008, P. 286 (Book) Available from:
https://publications.rwth-aachen.de/record/51297/files/Kluge_Florian.pdf (accessed 19 January 2025) |
[11] |
Le Guide de la Méthodologie de Gestion de Projet PM² 3.0.1. [The PM² Project Management Methodology Guide.] Commission européenne Centre d’Excellence en Gestion de Projets (CoEPM²) Bruxelles, Luxembourg. Mars 2021 P. 148 (Book) Available from:
https://www.pm2alliance.eu/wp-content/uploads/2023/10/Methodologie-de-gestion-de-projet-pm%C2%B2-NO0921037FRN_c.pdf (accessed 19 January 2025) |
[12] |
Abramov N. V., Motovilov N. V., Naumov N. D. Upravlenie proektami [Project Management]: Textbook – Nizhnevartovsk, 2008. — 197 p. (Book) Available from:
https://files.student-it.ru/download/275982 (accessed 19 January 2025) |
[13] |
Aleshin A. V., An'shin V. M., Bagrationi K. A. et al. Upravlenie proektami: fundamental'nyj kurs [Project Management: Fundamental Course]: Textbook, ed. by V. M. Anshin, O. N. Ilyina, National Research University “Higher School of Economics” - Moscow: Publishing House of the Higher School of Economics, 2013. 620 p. (Book) Available from:
https://publications.hse.ru/mirror/pubs/share/folder/nvs1ctzplo/direct/148559151.pdf (accessed 19 January 2025) |
[14] |
Boronina L. N., Senuk Z. V. Osnovy upravleniya proektami [Fundamentals of Project Management]: Textbook, Ministry of Education and Science of the Russian Federation Ural Federal University. - Yekaterinburg: Ural Federal University Press. 2015. — 112 p. (Book) Available from:
https://elar.urfu.ru/bitstream/10995/30881/1/978-5-7996-1416-4.pdf (accessed 19 January 2025) |
[15] |
Denisenko V. I. Upravlenie proektami [Project Management]: Textbook, ed. by Dr. of Technical Sciences, Prof. V. I. Denisenko, Dr. of Economics, Prof. N. M. Filimonova, A. G. and N. G. Stoletov Vladimir State University - Vladimir: VlSU Publishing House, 2015. – 108 p. (Book) Available from:
https://dspace.www1.vlsu.ru/bitstream/123456789/4337/1/01451.pdf (accessed 19 January 2025) |
[16] | Ivasenko A. G., Nikonova YA. I., Sizova A. O. Upravlenie proektami [Project Management]: Textbook – Novosibirsk: SGGA, 2007. – 202 p. (Book) |
[17] |
Mazur I. I., Shapiro V. D., Ol'derogge N. G., Polkovnikov A. V. Upravlenie proektami [Project Management]: textbook for students studying on specialty “Management of organization” ed. by I. I. Mazur and V. D. Shapiro 6th ed. - Moscow: Omega-L Publishing House, 2010. — 960 p. (Book) Available from:
https://topuch.com/download/i-i-mazur-v-d-shapiron-g-olederogge-a-v-polkovnikovupravleniep.pdf (accessed 19 January 2025) |
[18] | Osipov D. V. Upravlenie proektami [Project Management]: Textbook for Masters in Management - Мoscow.: RUT (MIIT), 2017.– 170 с. (Book) |
[19] |
Strelina E. N. Upravlenie proektami [Project Management]: Textbook for the enlarged group of training directions and specialties 38.00.00 Economics and management – Donetsk: DONNU, 2022. – 310 p. (Book) Available from:
http://repo.donnu.ru:8080/jspui/bitstream/123456789/4962/1/4371.pdf (accessed 19 January 2025) |
[20] |
Testina YA. S., CHumakov V. N. Upravlenie proektami [Project Management]: Textbook for Universities – Gatchina: GIEFPT Publishing House, 2023. – 69 p. (Book) Available from:
https://sovman.ru/wp-content/uploads/2023/09/ss125_compressed.pdf (accessed 19 January 2025) |
[21] |
Cycarova N. M. Upravlenie proektami [Project Management]: Textbook - Ulyanovsk State Technical University. - Ulyanovsk: UlGTU, 2021. – 105 p. (Book) Available from:
https://lib.ulstu.ru/venec/disk/2021/21.pdf (accessed 19 January 2025) |
APA Style
Arzumanyan, Y., Wolfson, M., Sotnikov, A., Zakharov, A. (2025). Stratification the Text with Table of Contents. American Journal of Education and Information Technology, 9(1), 19-24. https://doi.org/10.11648/j.ajeit.20250901.13
ACS Style
Arzumanyan, Y.; Wolfson, M.; Sotnikov, A.; Zakharov, A. Stratification the Text with Table of Contents. Am. J. Educ. Inf. Technol. 2025, 9(1), 19-24. doi: 10.11648/j.ajeit.20250901.13
AMA Style
Arzumanyan Y, Wolfson M, Sotnikov A, Zakharov A. Stratification the Text with Table of Contents. Am J Educ Inf Technol. 2025;9(1):19-24. doi: 10.11648/j.ajeit.20250901.13
@article{10.11648/j.ajeit.20250901.13, author = {Youri Arzumanyan and Mikhail Wolfson and Alexander Sotnikov and Arian Zakharov}, title = {Stratification the Text with Table of Contents }, journal = {American Journal of Education and Information Technology}, volume = {9}, number = {1}, pages = {19-24}, doi = {10.11648/j.ajeit.20250901.13}, url = {https://doi.org/10.11648/j.ajeit.20250901.13}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajeit.20250901.13}, abstract = {The formation of an ensemble of key concepts in relation to texts with highlighted semantic parts (text with a table of contents) is reduced to the stratification process from three procedures - the procedure of text preparation, the procedure of extracting key concepts from semantic parts and dividing the entire text into fragments related to the found key concepts. Quantitative characteristics of the ensemble (the number of words in related fragments) make it possible to solve a number of problems, including determining the predominant content of the text, calculating the parameters of text proximity, identifying concepts of interest to the reader, forming fragments for training neural networks while preserving the author's style, etc. The article briefly describes the formation procedures and provides three examples of using the ensembles of key concepts obtained by stratification. In the first example, the most fully (by the number of words in related fragments) disclosed key concepts in textbooks on the subject of "Project Management" in English, German, French and Russian are determined. The results obtained make it possible, for example, to justify the choice of a specific textbook. In the second example, for ten Russian-language educational and methodological publications on project management, proximity parameters were calculated, including the normalized length of the difference vector, the angle between the ensemble vectors, and the normalized integral characteristic. The results obtained can be used in selecting materials for educational programs and individual courses. In the textbooks participating in the first example, the longest continuous fragments of texts by the number of words, suitable for LLM training, were found. }, year = {2025} }
TY - JOUR T1 - Stratification the Text with Table of Contents AU - Youri Arzumanyan AU - Mikhail Wolfson AU - Alexander Sotnikov AU - Arian Zakharov Y1 - 2025/04/14 PY - 2025 N1 - https://doi.org/10.11648/j.ajeit.20250901.13 DO - 10.11648/j.ajeit.20250901.13 T2 - American Journal of Education and Information Technology JF - American Journal of Education and Information Technology JO - American Journal of Education and Information Technology SP - 19 EP - 24 PB - Science Publishing Group SN - 2994-712X UR - https://doi.org/10.11648/j.ajeit.20250901.13 AB - The formation of an ensemble of key concepts in relation to texts with highlighted semantic parts (text with a table of contents) is reduced to the stratification process from three procedures - the procedure of text preparation, the procedure of extracting key concepts from semantic parts and dividing the entire text into fragments related to the found key concepts. Quantitative characteristics of the ensemble (the number of words in related fragments) make it possible to solve a number of problems, including determining the predominant content of the text, calculating the parameters of text proximity, identifying concepts of interest to the reader, forming fragments for training neural networks while preserving the author's style, etc. The article briefly describes the formation procedures and provides three examples of using the ensembles of key concepts obtained by stratification. In the first example, the most fully (by the number of words in related fragments) disclosed key concepts in textbooks on the subject of "Project Management" in English, German, French and Russian are determined. The results obtained make it possible, for example, to justify the choice of a specific textbook. In the second example, for ten Russian-language educational and methodological publications on project management, proximity parameters were calculated, including the normalized length of the difference vector, the angle between the ensemble vectors, and the normalized integral characteristic. The results obtained can be used in selecting materials for educational programs and individual courses. In the textbooks participating in the first example, the longest continuous fragments of texts by the number of words, suitable for LLM training, were found. VL - 9 IS - 1 ER -