Directory

Maksym Vakulenko - Academia.edu

Skip to main content

Maksym Vakulenko

Followers

577

Following

18

Mentions

6

Public Views

Oleksandr Ishchenko

National Academy of Sciences of Ukraine

Johannes Gutenberg-Universität Mainz

Nataliya Tsisar

Humboldt Universität zu Berlin

Olga Mazurkevich

MGU

Ulana Holovatch

Ukrainian Catholic University

Rostyslav Paranko

Ivan Franko Lviv National University

Надежда Педченко

InterestsView All (12)

Uploads

Papers by Maksym Vakulenko

Transliteration of Non-Latin Texts: From Everyday Practice to Linguistic Technologies

Proceedings of the World Conference on Innovative Research in Education, 2024

This paper discusses various transcoding systems that convert non-Latin texts into Latin script. ... more This paper discusses various transcoding systems that convert non-Latin texts into Latin script. Particularly significant is the Romanization of Slavic languages. The Latinization systems generally fall into two classes: those based on the transliteration approach and those based on practical transcription. During transliteration, the main attention is paid to simple correspondence (mutual unambiguity) between original and converted characters that allows for text reversibility, i. e. the ability to restore the original text after re-transliteration. During practical transcription, the primary concern is the sound of words. In the last scenario, it is not always possible to restore the original text. The significance of transliteration extends to historical texts written in non-Latin scripts as well. Latinization systems are broadly utilized in multilingual Natural Language Processing tasks, resulting in their wider use and enhanced need.

Positional Variations of Ukrainian Back Vowel Formants

Proceedings of the 8th International Conference on Modern Research in Social Sciences (ICMRSS), 2024

The purpose of this paper is to study variations in the location of formants of Ukrainian back vo... more The purpose of this paper is to study variations in the location of formants of Ukrainian back vowels caused by different phonetic features. To do so, the oscillograms and two-dimensional spectrograms of recorded sounds in speech flow were analyzed. The corresponding invariant acoustic parameters being the ratios between frequencies of the formants rising due to the tube resonance are determined. It is shown that different phonetic environments cause different formant configurations in vowels which is a case of sound assimilation. The following phonetic features were found to give rise to variations of Ukrainian back vowels: coda, distant assimilation by the next or preceding vowel, and progressive assimilation by the preceding hard or palatalized consonant. In particular, high vowels make the other vowels in the phonetic word move higher, and the front ones make the back ones move forward. Such formant shifts are more pronounced in unstressed positions and especially in a coda.

Transliteration

Encyclopedia of Slavic Languages and Linguistics Online, Editor-in-Chief: Marc L. Greenberg, General Editor: Lenore A. Grenoble, 2024

Several transcoding systems aim to convert non-Latin texts in various languages into Latin script... more Several transcoding systems aim to convert non-Latin texts in various languages into Latin script. This process, called “romanization” or “latinization,” is important for Slavic languages. In general, romanization systems fall into two classes: those based on transliteration and those based on practical transcription. With transliteration, attention is paid to simple correspondence (mutual unambiguity) between the original and the converted characters, allowing for text reversibility, i.e., restoration of the original text after reverse transliteration. With practical transcription, attention is paid to the sound of words in the original or in some other language (e.g., English, French). In the latter case, the original text cannot always be restored. For Slavic languages, transliteration is also used for historical texts written in Cyrillic, Glagolitic, and Arabic scripts. Latinization systems are widely applied in multilingual natural-language-processing systems, resulting in their wider use and demand for their application. The present entry also addresses some issues in the history of latinization and uses the example of Ukrainian as an illuminating case study.

Towards Ukrainian WordNet: Incorporation of an Existing Thesaurus in the Domain of Physics

Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), 2023

In this paper, we represent the first version of the Ukrainian wordnet-Ukrajinet 1.0. It contains... more In this paper, we represent the first version of the Ukrainian wordnet-Ukrajinet 1.0. It contains 3,360 sets of full synonyms in the field of physics, consisting of 8,700 words. This knowledge base will help incorporate the Ukrainian language into multilingual scenarios of Natural Language Processing that need information about lexical-semantic relations.

Normalization of Ukrainian Letters, Numerals, and Measures for Natural Language Processing (preprint version)

Digital Scholarship in the Humanities, 2022

There is still little linguistic data on the Ukrainian language in the context of Natural Languag... more There is still little linguistic data on the Ukrainian language in the context of Natural Language Processing. At the same time, in view of real perspectives of European integration of Ukraine, its state language will soon probably become one of the European Community languages. This fact draws more and more interest to the possible inclusion of the Ukrainian language in multilingual Natural Language Processing scenarios. This article aims to provide some important data that is necessary for text-to-speech conversion of the Ukrainian language. Relying on Ukrainian phonetics and grammar, we formulate the basic rules governing the normalization of Ukrainian letters, numerals, and measures, that help obtain Ukrainian speech data. The results are presented in tables and strings convenient for coding and further implementation into linguistic tools. These rules are necessary for the text-to-speech tasks arising in various Natural Language Processing tasks associated with speech synthesis, machine translation, information retrieval, etc.

Deep Contextual Disambiguation of Homonyms and Polysemants (preprint version)

Digital Scholarship in the Humanities, 2022

A new metric method for word sense disambiguation is put forward, being formulated within a pheno... more A new metric method for word sense disambiguation is put forward, being formulated within a phenomenological analogy to the wave function of an observable quantity in quantum mechanics where the actual meaning of a multivalued word (a homonym or a polysemant) is determined by its context. The choice of the actualized sense is determined by the minimal semantic distance between the semantics of possible word senses and that of the context, where the meanings of the word in question and the context follow from their semantic fields based on lexicographic hyperchains. In the contrast to the common models, our method accounts for the semantic polarity. The formulated method showed good results in disentangling polysemy, which was not possible to achieve within the BERT-based contextualized embedding approach.

Ізоморфна латинізація староукраїнських і новоукраїнських текстів: від історичної писемної спадщини до сучасних лінгвістичних технологій

Balcania et Slavia 2, 2022

The development of information and linguistic tools for natural language processing raises, in pa... more The development of information and linguistic tools for natural language processing raises, in particular, the need for automated systems of simple-correspondent romanization of non-Latin alphabets, which would ensure full equivalence of source and transliterated texts. In order to preserve the historical written heritage, such systems should also be suitable for alphabets that have been used during the historical development of the language.
This article analyzes the Old Ukrainian and New Ukrainian alphabets and proposes a system of their isomorphic (simple-correspondent) transliteration into the Latin script, which is necessary for the effective inclusion of Ukraine in international cooperation in the field of information and linguistic technologies, preservation of historical written monuments, development of multilingual corpus linguistics and computational lexicography. Particular attention is paid to the letters that, during the historical development of the Ukrainian language, have had different readings, particularly <г>, <є>, <и>, <ъ>, <ь>.
Keywords: Ukrainian alphabets, Cyrillic-Latin transliteration, isomorphic latinization, Old Ukrainian language, New Ukrainian language, historical written monuments, linguistic technologies.

Semantic Comparison of Texts by the Metric Approach (preprint version)

Digital Scholarship in the Humanities, 2022

A novel approach to the semantic comparison of texts based on the metric method to calculate sema... more A novel approach to the semantic comparison of texts based on the metric method to calculate semantic distances between lexical units is put forward. The supplementary semantic information is provided through semes of the words composing the texts, or through their semantic fields. The proposed method takes into account semantic polarity and yields, for two paraphrase sentences, more feasible results than the conventional approaches based on word occurrences. The described approach may be useful for linguistic theory as well as for a variety of Natural Language Processing tasks based on supervised learning that require semantic information: computer lexicography, semantic analysis, information search and retrieval, document classification, text summarization, and understanding, machine translation, and others.

Використання національного транслітераційного стандарту ДСТУ 9112:2021 в інформаційному забезпеченні бібліотечної роботи

Бібліотека. Наука. Комунікація. Інноваційні трансформації ресурсів і послуг: матеріали Міжнар. наук. конф. (04 жовтня 2022 року). Київ, 2022

The national transliteration standard DSTU 9112:2021 ensures the implementation of the basic prin... more The national transliteration standard DSTU 9112:2021 ensures the implementation of the basic principles of transliteration and complete equivalence of the original and transliterated texts. It is necessary for library work and beyond, especially in view of the euro integration of Ukraine.
Національний транслітераційний стандарт ДСТУ 9112:2021 забезпечує виконання основних принципів транслітерації та цілковиту еквівалентність вихідного й транслітерованого текстів. Євроінтеграція України підвищує його важливість не тільки для бібліотечної роботи, а й для інших сфер.
Ключові слова: взаємнооднозначна транслітерація, принципи транслітерації, транслітераційний стандарт, бібліотечна робота, євроінтеграція України

A Tool for Scholarly Phonemic Transcription of Ukrainian Texts

Proceedings of the 2022 12th International Conference on Advanced Computer Information Technologies (ACIT), 2022

This article aims to present an instrument for phonemic transcription of Ukrainian texts that is ... more This article aims to present an instrument for phonemic transcription of Ukrainian texts that is necessary for phonetic studies, lexicographic work, and in text-to-speech conversion, machine translation, information retrieval, automatic speech recognition, language model transfer, and other natural language processing tasks. The tool provides an automatic scholarly transcription of Ukrainian texts into the International Phonetic Alphabet (IPA) and the Extended Speech Assessment Methods Phonetic Alphabet (X-SAMPA).

Unified Parametrization of Phonetic Features and Numerical Calculation of Phonetic Distances between Speech Sounds (preprint version)

Journal of Quantitative Linguistics, 2022

A metric method to numerically measure phonetic and phonemic distances, or contrasts, between spe... more A metric method to numerically measure phonetic and phonemic distances, or contrasts, between speech sounds is put forward. The feature values of the compared phones taken from the standard IPA charts, are treated as independent parameters that give rise to corresponding Euclidean distances. As an illustration, the general phone set is mapped to Ukrainian phonemes. The proposed model agrees well with the historical linguistic facts, experimental phonetic data, and with general intuition. The described approach may find its due applications in various fields of linguistics and speech technologies, including historical and typological linguistics, language acquisition, phonetic studies, computational phonology, machine translation, information retrieval, text-to-speech.

On the grammatical restrictions of the use of feminatives in the Ukrainian language.

In: Arghumenty suchasnoji filologhiji. Obraz zhinky: zhinoche, feministsjke, feminne [Arguments of modern philology. An image of a woman: womannish, feminist, feminine]. Ed. By Prof. S. K. Kryvoruchko. Materials of the 2nd International scientific conference, 7-8 April 2022. Kharkiv, 2022

This article is focused on the grammatical and social aspects of the use of feminatives in the Uk... more This article is focused on the grammatical and social aspects of the use of feminatives in the Ukrainian language. It was shown that the unreasonable use of feminatives may result in a number of inconsistencies leading to violation of grammatical and phonological rules and to the degradation of the language expressive ability. Helping little to make a woman more visible, this practice leaves no room to correctly denote non-binary persons and personified objects. It was shown also that the feminatives with the final «-kynja» are questionable from the phonological view. To resolve the grammatical issues resulting from the feminative usage, we propose to recognize the general gender that, similarly to the common one, is a complex gender. In contrast to the common gender, the general gender combines simple genders by means of the logical «OR» instead of the «AND». The corresponding names should be used in place of the masculine nouns in the generalizing function.

DSTU 9112: 2021 / ДСТУ 9112:2021

Cyrillic-Latin transliteration and Latin-Cyrillic retransliteration of Ukrainian texts. Writing rules, 2022

From 01 April 2022, the national standard DSTU 9112: 2021 comes into force: «Cyrillic-Latin trans... more From 01 April 2022, the national standard DSTU 9112: 2021 comes into force: «Cyrillic-Latin transliteration and Latin-Cyrillic retransliteration of Ukrainian texts. Writing rules». The standard was proposed and developed by specialists of the State Scientific and Technical Library of Ukraine and the Technical Committee 144 "Information and Documentation".
This standard applies not only to bibliographic records but also to all Ukrainian texts from the 10th century to the present day, including proper names. So, we have every legal reason to reproduce the Ukrainian letter "ї" not in the Russian manner (Kyiv, Ukraina), but according to the history, phonetics, and grammar of the Ukrainian language: Kyïv, Ukraïna (system A), Kyjiv, Ukrajina (system B). We have the transliteration rules that ensure the complete equivalence of Cyrillic and Latin writings. This standard is highly important given recent EU decisions on Ukraine, as Ukrainian will soon, along with other European languages, take its rightful place in multilingual natural language processing scenarios, including machine translation.
Slava Ukraïni!

Від 01.04.2022 набуває чинності національний стандарт ДСТУ 9112:2021 «Кирилично-латинична транслітерація і латинично-кирилична ретранслітерація українських текстів. Правила написання». Стандарт запропонували й розробили фахівці Державної науково-технічної бібліотеки України та Технічного комітету 144 «Інформація і документація».
Цей стандарт поширюється не тільки на бібліографічні записи, а й на всі українські тексти від X століття до наших днів, включаючи власні назви. Отже, маємо всі законні підстави відтворювати літеру «ї» не за російським зразком (Kyiv, Ukraina), а так, як це відповідає історії, фонетиці та граматиці української мови: Kyïv, Ukraïna (система А), Kyjiv, Ukrajina (система Б). Маємо транслітераційні правила, які забезпечують повну еквівалентність кириличного й латиничного записів. Значення цього стандарту особливо зростає з огляду на недавні рішення Євросоюзу щодо України, адже невдовзі українська мова, нарівні з іншими європейськими мовами, займе гідне місце у сценаріях багатомовного опрацювання природної мови, зокрема машинного перекладу.
Slava Ukraïni!

From Semantic Metrics to Semantic Fields

Proceedings of the 2021 IEEE 16th International Conference on Computer Science and Information Technologies (CSIT), 22–25 September 2021, Lviv, Ukraine., Dec 27, 2021

A new approach to build semantic fields based on the metric method to calculate semantic distance... more A new approach to build semantic fields based on the metric method to calculate semantic distances between lexical units is put forward. The general case of different weights of semes contributing to the word meaning is considered. It is assumed that the word semes are fed from their definitions or explanations available in a sufficiently large and comprehensive dictionary. We repeatedly apply the formula for semantic representation of a given word treating its semes as new words having their own semes, thus forming a lexicographic semantic tree. Due to hypercycles rising in real dictionaries, the number of iterations is finite which allows one to exactly calculate the weights of semes and, henceforth, to build semantic fields. It is demonstrated that the comprehensive componential analysis of the word should be based on the semantic fields. The described approach may be useful in a variety of Natural Language Processing tasks based on supervised learning that require semantic information: computer lexicography, semantic analysis, information search and retrieval, Machine Translation, etc.

Calculation of Phonetic Distances between Speech Sounds (Preprint Version)

Vakulenko, Maksym. 2019. “Calculation of Phonetic Distances between Speech Sounds.” In: Journal of Quantitative Linguistics [WoS, Scopus. 2018 impact factor 0.824]. Published online: October 23 , 2019

A new formalism to numerically measure phonetic differences between speech sounds treating featur... more A new formalism to numerically measure phonetic differences between speech sounds treating feature values of the compared phones as independent parameters that give rise to corresponding Euclidean distances, is put forward. The articulatory and acoustic methods within this formalism were compared, where the corresponding results display good agreement. The more reliable and more universal character of the acoustic approach is provided by robust and precise acoustic parameters used therein. The theoretical model and the findings of this article comply also with the experimental phonetic results. The proposed approach contributes to formalization of the procedure of phone comparison and mapping needed for automatic text and speech processing.

Key words: phonetic distance measurement, speech sounds comparison, phonetic feature values, robust parameters, phone mapping, IPA charts.

Науковий підхід до передачі англійських наукометричних термінів українською мовою.

Вакуленко М. О. Науковий підхід до передачі англійських наукометричних термінів українською мовою. Бібліотека. Наука. Комунікація : актуальні тенденції у цифрову епоху : матеріали Міжнар. наук. конф. (8-10 жовтня 2019 р.) : у 2-х т. Київ, 2019. Т. 1. 466 с. С. 413-417, 2019

A scholarly approach to the development and normalization of the system of Ukrainian scientometri... more A scholarly approach to the development and normalization of the system of Ukrainian scientometric terms as an integral part of the national terminology system is formulated. This approach is based on the application of methods of terminology-science and on the apt term features. Practical recommendations for the translation of some English scientometric terms are given. In particular, it is recommended to translate the English term Hirsch index as the “індекс Хірша”.
Keywords: scientometric terms, apt term features, terminological translation, phonetic analysis, voiceless vocoid, Ukrainian national term system.

У статті сформульовано науковий підхід до розвитку та унормування системи українських наукометричних термінів як невід'ємної частини національної терміносистеми. Цей підхід базується на застосуванні методів науки термінології та на ознаках вдалого терміна. Надано практичні рекомендації щодо перекладу деяких англійських наукометричних термінів. Зокрема, рекомендовано перекладати термін «Hirsch index» як «індекс Хірша».
Ключові слова: наукометричні терміни, ознаки вдалого терміна, термінологічний переклад, фонетичний аналіз, глухий вокоїд, українська національна терміносистема.

Ukrainian Consonant Phones in the IPA Context with Special Reference to /v/ and /gh/. In: Linguistica online 22: 36-61.

The acoustic and articulatory properties of Ukrainian consonant phones were investigated, and a f... more The acoustic and articulatory properties of Ukrainian consonant phones were investigated, and a full set of relevant IPA notations was proposed for these and compiled in a table. Acoustic correspondence of Ukrainian phones to those appearing in European languages was analyzed and discussed. Special attention was paid to the phonemes /v/ (represented in Cyrillic script as "/в/") and /gh/ (ren-dered in Cyrillic script as "/г/") that cause the most difficulties in their description. In particular, our experiments and observations suggest that a standard Ukrainian phoneme /v/ is realised as labiodental fricatives [v] and [v j ] before vowels and also as sonorant bilabial approximants [β̞ β̞ ˛ β̞ ɔ] between a vowel and a consonant, in the initial position before consonants and after a vowel at the end of a word, and sometimes is devoiced to [v̥ ] in the coda after a voiceless consonant. In some utterances after a vowel (before a consonant and in the coda), a strongly rounded bilabial approximant [β̞ ɔ] may approach a non-syllabic semivowel [ṷ]. These conclusions are in good agreement with the consonantal status of the Ukrainian language and with the general tendencies of sound combinations in the world languages. The findings of this research contribute to better understanding of Ukrain-ian and its special features in comparison with other world languages that may have substantial practical use in various phonetic and translation studies, as well as in modern linguistic technologies aimed at artificial intelligence development, machine translation incorporating text-to-speech conversion, automatic speech analysis, recognition and synthesis, and in other areas of applied linguistics.

Міфи і реалії правописного процесу 1928 року: документальні свідчення // Наукові праці Національної бібліотеки України імені В. І. Вернадського (Слов’янські обрії) : збірник наукових праць. Київ, 2019. Вип. 51(10). 320 с. С. 5-21.

Міфи і реалії правописного процесу 1928 року: документальні свідчення, 2019

An attempt to analyze unclear facts and processes that took place during adoption of the Ukrainia... more An attempt to analyze unclear facts and processes that took place during adoption of the Ukrainian spelling of 1928, is made. A previously commonly unknown letter of the State Spelling Commission Presidium member is taken into account. This letter states that Mykola Skrypnyk, Head of the Spelling Commission, repeatedly acted in the unfair manner in order to adopt unjustified and «revolutionary» changes to the Ukrainian spelling: he used to involve stranger persons to vote for the spelling, insisted on the ill-grounded expansion of the use of «ґ» in spite of the Ukrainian language practice, entered into Presidium a person who was not an expert in linguistics, and did not invite a Presidium member to the sessions. The author of this letter expresses displeasure by these actions and estimates the changes to the Ukrainian spelling carried out by Mykola Skrypnyk, as a «revolution». It is conjectured that there are no grounds to treat the Pravopys-1928 as legitimate and scholarly justified.

У статті зроблено спробу проаналізувати неоднозначні факти і процеси, що відбувалися під час прийняття українського правопису 1928 року, з урахуванням невідомого широкому загалу архівного документа – листа члена Президії Державної Правописної Комісії. Виявлено численні порушення регламенту в правописній процедурі, що свідчить про відсутність вагомих підстав уважати Правопис 1928 року легітимним і науково обґрунтованим.
Долю Правопису 1928 року фактично вирішувало не 80 кваліфікованих мовознавців, а лише двоє осіб: нарком освіти Микола Скрипник і товариш Антін Приходько. Перший фігурує в своїх анкетних даних як "професійний революціонер" із Донбасу, а другий – як юрист із Кубані.

Ukrainian vowel phones in the IPA context. In: Govor 35 (2): 189-214.

Acoustic and articulatory properties of Ukrainian vowels are investigated in this study and a ful... more Acoustic and articulatory properties of Ukrainian vowels are investigated in this study and a full set of relevant IPA notations are proposed. The notations are shown in the vowel diagram and the table. The results of the earlier acoustic invariant speech analysis based on special software, auditory and spectrum analysis were used and the results are discussed in the context of general and Ukrainian phonetic laws governing language evolution and acoustic properties of non-stressed vowels in relation to their stressed cognates. Such combined approach resulted in a more detailed vowel inventory than proposed heretofore. The findings of this research contribute to better understanding of Ukrainian language and its special features in comparison with other world languages that may have substantial practical use in various phonetic and translation studies, as well as in modern linguistic technologies aimed at artificial intelligence development, machine translation incorporating text-to-speech conversion, automatic speech analysis, recognition and synthesis, and in other areas of applied linguistics.

Calculation of semantic distances between words: From synonymy to antonymy (AOM)

Journal of Quantitative Linguistics, 2019

A new approach to numerically measure the semantic distances between lexical units (words and col... more A new approach to numerically measure the semantic distances between lexical units (words and collocations) based on the geometric analogies and analytical calculations, is put forward. Having considered the cases of equal and different weights of semes, we obtained exact algebraic formulas describing different levels of the meanings proximity, ranging from absolute synonymy to full antonymy. It was emphasized that absolute synonymy arises when the compared units contain equal numbers of semes that fully coincide and have equal weights in the corresponding pairs. Calculation of the semes weights helps to locate the unit more precisely on the semantic sphere. It was shown that the level of synonymy and antonymy decreases if different semes are accentuated, while the semantic distance between the units without identical semes cannot be influenced by seme boosting. It was observed that depending on the context, a word may wander over this sphere, thus modifying its lexical semantic relations with other units. As the proposed approach contributes to formalization of the units comparison procedure, it is advisable for relevant automatic tools, particularly for WordNets and FrameNet. The obtained results may be useful for various linguistic and associated studies including automatic text analysis and processing, computer lexicography, information search and retrieval, machine translation and other NLP applications that are related to the artificial intelligence problem.

Transliteration of Non-Latin Texts: From Everyday Practice to Linguistic Technologies

Proceedings of the World Conference on Innovative Research in Education, 2024

This paper discusses various transcoding systems that convert non-Latin texts into Latin script. ... more This paper discusses various transcoding systems that convert non-Latin texts into Latin script. Particularly significant is the Romanization of Slavic languages. The Latinization systems generally fall into two classes: those based on the transliteration approach and those based on practical transcription. During transliteration, the main attention is paid to simple correspondence (mutual unambiguity) between original and converted characters that allows for text reversibility, i. e. the ability to restore the original text after re-transliteration. During practical transcription, the primary concern is the sound of words. In the last scenario, it is not always possible to restore the original text. The significance of transliteration extends to historical texts written in non-Latin scripts as well. Latinization systems are broadly utilized in multilingual Natural Language Processing tasks, resulting in their wider use and enhanced need.

Positional Variations of Ukrainian Back Vowel Formants

Proceedings of the 8th International Conference on Modern Research in Social Sciences (ICMRSS), 2024

The purpose of this paper is to study variations in the location of formants of Ukrainian back vo... more The purpose of this paper is to study variations in the location of formants of Ukrainian back vowels caused by different phonetic features. To do so, the oscillograms and two-dimensional spectrograms of recorded sounds in speech flow were analyzed. The corresponding invariant acoustic parameters being the ratios between frequencies of the formants rising due to the tube resonance are determined. It is shown that different phonetic environments cause different formant configurations in vowels which is a case of sound assimilation. The following phonetic features were found to give rise to variations of Ukrainian back vowels: coda, distant assimilation by the next or preceding vowel, and progressive assimilation by the preceding hard or palatalized consonant. In particular, high vowels make the other vowels in the phonetic word move higher, and the front ones make the back ones move forward. Such formant shifts are more pronounced in unstressed positions and especially in a coda.

Transliteration

Encyclopedia of Slavic Languages and Linguistics Online, Editor-in-Chief: Marc L. Greenberg, General Editor: Lenore A. Grenoble, 2024

Several transcoding systems aim to convert non-Latin texts in various languages into Latin script... more Several transcoding systems aim to convert non-Latin texts in various languages into Latin script. This process, called “romanization” or “latinization,” is important for Slavic languages. In general, romanization systems fall into two classes: those based on transliteration and those based on practical transcription. With transliteration, attention is paid to simple correspondence (mutual unambiguity) between the original and the converted characters, allowing for text reversibility, i.e., restoration of the original text after reverse transliteration. With practical transcription, attention is paid to the sound of words in the original or in some other language (e.g., English, French). In the latter case, the original text cannot always be restored. For Slavic languages, transliteration is also used for historical texts written in Cyrillic, Glagolitic, and Arabic scripts. Latinization systems are widely applied in multilingual natural-language-processing systems, resulting in their wider use and demand for their application. The present entry also addresses some issues in the history of latinization and uses the example of Ukrainian as an illuminating case study.

Towards Ukrainian WordNet: Incorporation of an Existing Thesaurus in the Domain of Physics

Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), 2023

In this paper, we represent the first version of the Ukrainian wordnet-Ukrajinet 1.0. It contains... more In this paper, we represent the first version of the Ukrainian wordnet-Ukrajinet 1.0. It contains 3,360 sets of full synonyms in the field of physics, consisting of 8,700 words. This knowledge base will help incorporate the Ukrainian language into multilingual scenarios of Natural Language Processing that need information about lexical-semantic relations.

Normalization of Ukrainian Letters, Numerals, and Measures for Natural Language Processing (preprint version)

Digital Scholarship in the Humanities, 2022

There is still little linguistic data on the Ukrainian language in the context of Natural Languag... more There is still little linguistic data on the Ukrainian language in the context of Natural Language Processing. At the same time, in view of real perspectives of European integration of Ukraine, its state language will soon probably become one of the European Community languages. This fact draws more and more interest to the possible inclusion of the Ukrainian language in multilingual Natural Language Processing scenarios. This article aims to provide some important data that is necessary for text-to-speech conversion of the Ukrainian language. Relying on Ukrainian phonetics and grammar, we formulate the basic rules governing the normalization of Ukrainian letters, numerals, and measures, that help obtain Ukrainian speech data. The results are presented in tables and strings convenient for coding and further implementation into linguistic tools. These rules are necessary for the text-to-speech tasks arising in various Natural Language Processing tasks associated with speech synthesis, machine translation, information retrieval, etc.

Deep Contextual Disambiguation of Homonyms and Polysemants (preprint version)

Digital Scholarship in the Humanities, 2022

A new metric method for word sense disambiguation is put forward, being formulated within a pheno... more A new metric method for word sense disambiguation is put forward, being formulated within a phenomenological analogy to the wave function of an observable quantity in quantum mechanics where the actual meaning of a multivalued word (a homonym or a polysemant) is determined by its context. The choice of the actualized sense is determined by the minimal semantic distance between the semantics of possible word senses and that of the context, where the meanings of the word in question and the context follow from their semantic fields based on lexicographic hyperchains. In the contrast to the common models, our method accounts for the semantic polarity. The formulated method showed good results in disentangling polysemy, which was not possible to achieve within the BERT-based contextualized embedding approach.

Ізоморфна латинізація староукраїнських і новоукраїнських текстів: від історичної писемної спадщини до сучасних лінгвістичних технологій

Balcania et Slavia 2, 2022

The development of information and linguistic tools for natural language processing raises, in pa... more The development of information and linguistic tools for natural language processing raises, in particular, the need for automated systems of simple-correspondent romanization of non-Latin alphabets, which would ensure full equivalence of source and transliterated texts. In order to preserve the historical written heritage, such systems should also be suitable for alphabets that have been used during the historical development of the language.
This article analyzes the Old Ukrainian and New Ukrainian alphabets and proposes a system of their isomorphic (simple-correspondent) transliteration into the Latin script, which is necessary for the effective inclusion of Ukraine in international cooperation in the field of information and linguistic technologies, preservation of historical written monuments, development of multilingual corpus linguistics and computational lexicography. Particular attention is paid to the letters that, during the historical development of the Ukrainian language, have had different readings, particularly <г>, <є>, <и>, <ъ>, <ь>.
Keywords: Ukrainian alphabets, Cyrillic-Latin transliteration, isomorphic latinization, Old Ukrainian language, New Ukrainian language, historical written monuments, linguistic technologies.

Semantic Comparison of Texts by the Metric Approach (preprint version)

Digital Scholarship in the Humanities, 2022

A novel approach to the semantic comparison of texts based on the metric method to calculate sema... more A novel approach to the semantic comparison of texts based on the metric method to calculate semantic distances between lexical units is put forward. The supplementary semantic information is provided through semes of the words composing the texts, or through their semantic fields. The proposed method takes into account semantic polarity and yields, for two paraphrase sentences, more feasible results than the conventional approaches based on word occurrences. The described approach may be useful for linguistic theory as well as for a variety of Natural Language Processing tasks based on supervised learning that require semantic information: computer lexicography, semantic analysis, information search and retrieval, document classification, text summarization, and understanding, machine translation, and others.

Використання національного транслітераційного стандарту ДСТУ 9112:2021 в інформаційному забезпеченні бібліотечної роботи

Бібліотека. Наука. Комунікація. Інноваційні трансформації ресурсів і послуг: матеріали Міжнар. наук. конф. (04 жовтня 2022 року). Київ, 2022

The national transliteration standard DSTU 9112:2021 ensures the implementation of the basic prin... more The national transliteration standard DSTU 9112:2021 ensures the implementation of the basic principles of transliteration and complete equivalence of the original and transliterated texts. It is necessary for library work and beyond, especially in view of the euro integration of Ukraine.
Національний транслітераційний стандарт ДСТУ 9112:2021 забезпечує виконання основних принципів транслітерації та цілковиту еквівалентність вихідного й транслітерованого текстів. Євроінтеграція України підвищує його важливість не тільки для бібліотечної роботи, а й для інших сфер.
Ключові слова: взаємнооднозначна транслітерація, принципи транслітерації, транслітераційний стандарт, бібліотечна робота, євроінтеграція України

A Tool for Scholarly Phonemic Transcription of Ukrainian Texts

Proceedings of the 2022 12th International Conference on Advanced Computer Information Technologies (ACIT), 2022

This article aims to present an instrument for phonemic transcription of Ukrainian texts that is ... more This article aims to present an instrument for phonemic transcription of Ukrainian texts that is necessary for phonetic studies, lexicographic work, and in text-to-speech conversion, machine translation, information retrieval, automatic speech recognition, language model transfer, and other natural language processing tasks. The tool provides an automatic scholarly transcription of Ukrainian texts into the International Phonetic Alphabet (IPA) and the Extended Speech Assessment Methods Phonetic Alphabet (X-SAMPA).

Unified Parametrization of Phonetic Features and Numerical Calculation of Phonetic Distances between Speech Sounds (preprint version)

Journal of Quantitative Linguistics, 2022

A metric method to numerically measure phonetic and phonemic distances, or contrasts, between spe... more A metric method to numerically measure phonetic and phonemic distances, or contrasts, between speech sounds is put forward. The feature values of the compared phones taken from the standard IPA charts, are treated as independent parameters that give rise to corresponding Euclidean distances. As an illustration, the general phone set is mapped to Ukrainian phonemes. The proposed model agrees well with the historical linguistic facts, experimental phonetic data, and with general intuition. The described approach may find its due applications in various fields of linguistics and speech technologies, including historical and typological linguistics, language acquisition, phonetic studies, computational phonology, machine translation, information retrieval, text-to-speech.

On the grammatical restrictions of the use of feminatives in the Ukrainian language.

In: Arghumenty suchasnoji filologhiji. Obraz zhinky: zhinoche, feministsjke, feminne [Arguments of modern philology. An image of a woman: womannish, feminist, feminine]. Ed. By Prof. S. K. Kryvoruchko. Materials of the 2nd International scientific conference, 7-8 April 2022. Kharkiv, 2022

This article is focused on the grammatical and social aspects of the use of feminatives in the Uk... more This article is focused on the grammatical and social aspects of the use of feminatives in the Ukrainian language. It was shown that the unreasonable use of feminatives may result in a number of inconsistencies leading to violation of grammatical and phonological rules and to the degradation of the language expressive ability. Helping little to make a woman more visible, this practice leaves no room to correctly denote non-binary persons and personified objects. It was shown also that the feminatives with the final «-kynja» are questionable from the phonological view. To resolve the grammatical issues resulting from the feminative usage, we propose to recognize the general gender that, similarly to the common one, is a complex gender. In contrast to the common gender, the general gender combines simple genders by means of the logical «OR» instead of the «AND». The corresponding names should be used in place of the masculine nouns in the generalizing function.

DSTU 9112: 2021 / ДСТУ 9112:2021

Cyrillic-Latin transliteration and Latin-Cyrillic retransliteration of Ukrainian texts. Writing rules, 2022

From 01 April 2022, the national standard DSTU 9112: 2021 comes into force: «Cyrillic-Latin trans... more From 01 April 2022, the national standard DSTU 9112: 2021 comes into force: «Cyrillic-Latin transliteration and Latin-Cyrillic retransliteration of Ukrainian texts. Writing rules». The standard was proposed and developed by specialists of the State Scientific and Technical Library of Ukraine and the Technical Committee 144 "Information and Documentation".
This standard applies not only to bibliographic records but also to all Ukrainian texts from the 10th century to the present day, including proper names. So, we have every legal reason to reproduce the Ukrainian letter "ї" not in the Russian manner (Kyiv, Ukraina), but according to the history, phonetics, and grammar of the Ukrainian language: Kyïv, Ukraïna (system A), Kyjiv, Ukrajina (system B). We have the transliteration rules that ensure the complete equivalence of Cyrillic and Latin writings. This standard is highly important given recent EU decisions on Ukraine, as Ukrainian will soon, along with other European languages, take its rightful place in multilingual natural language processing scenarios, including machine translation.
Slava Ukraïni!

Від 01.04.2022 набуває чинності національний стандарт ДСТУ 9112:2021 «Кирилично-латинична транслітерація і латинично-кирилична ретранслітерація українських текстів. Правила написання». Стандарт запропонували й розробили фахівці Державної науково-технічної бібліотеки України та Технічного комітету 144 «Інформація і документація».
Цей стандарт поширюється не тільки на бібліографічні записи, а й на всі українські тексти від X століття до наших днів, включаючи власні назви. Отже, маємо всі законні підстави відтворювати літеру «ї» не за російським зразком (Kyiv, Ukraina), а так, як це відповідає історії, фонетиці та граматиці української мови: Kyïv, Ukraïna (система А), Kyjiv, Ukrajina (система Б). Маємо транслітераційні правила, які забезпечують повну еквівалентність кириличного й латиничного записів. Значення цього стандарту особливо зростає з огляду на недавні рішення Євросоюзу щодо України, адже невдовзі українська мова, нарівні з іншими європейськими мовами, займе гідне місце у сценаріях багатомовного опрацювання природної мови, зокрема машинного перекладу.
Slava Ukraïni!

From Semantic Metrics to Semantic Fields

Proceedings of the 2021 IEEE 16th International Conference on Computer Science and Information Technologies (CSIT), 22–25 September 2021, Lviv, Ukraine., Dec 27, 2021

A new approach to build semantic fields based on the metric method to calculate semantic distance... more A new approach to build semantic fields based on the metric method to calculate semantic distances between lexical units is put forward. The general case of different weights of semes contributing to the word meaning is considered. It is assumed that the word semes are fed from their definitions or explanations available in a sufficiently large and comprehensive dictionary. We repeatedly apply the formula for semantic representation of a given word treating its semes as new words having their own semes, thus forming a lexicographic semantic tree. Due to hypercycles rising in real dictionaries, the number of iterations is finite which allows one to exactly calculate the weights of semes and, henceforth, to build semantic fields. It is demonstrated that the comprehensive componential analysis of the word should be based on the semantic fields. The described approach may be useful in a variety of Natural Language Processing tasks based on supervised learning that require semantic information: computer lexicography, semantic analysis, information search and retrieval, Machine Translation, etc.

Calculation of Phonetic Distances between Speech Sounds (Preprint Version)

Vakulenko, Maksym. 2019. “Calculation of Phonetic Distances between Speech Sounds.” In: Journal of Quantitative Linguistics [WoS, Scopus. 2018 impact factor 0.824]. Published online: October 23 , 2019

A new formalism to numerically measure phonetic differences between speech sounds treating featur... more A new formalism to numerically measure phonetic differences between speech sounds treating feature values of the compared phones as independent parameters that give rise to corresponding Euclidean distances, is put forward. The articulatory and acoustic methods within this formalism were compared, where the corresponding results display good agreement. The more reliable and more universal character of the acoustic approach is provided by robust and precise acoustic parameters used therein. The theoretical model and the findings of this article comply also with the experimental phonetic results. The proposed approach contributes to formalization of the procedure of phone comparison and mapping needed for automatic text and speech processing.

Key words: phonetic distance measurement, speech sounds comparison, phonetic feature values, robust parameters, phone mapping, IPA charts.

Науковий підхід до передачі англійських наукометричних термінів українською мовою.

Вакуленко М. О. Науковий підхід до передачі англійських наукометричних термінів українською мовою. Бібліотека. Наука. Комунікація : актуальні тенденції у цифрову епоху : матеріали Міжнар. наук. конф. (8-10 жовтня 2019 р.) : у 2-х т. Київ, 2019. Т. 1. 466 с. С. 413-417, 2019

A scholarly approach to the development and normalization of the system of Ukrainian scientometri... more A scholarly approach to the development and normalization of the system of Ukrainian scientometric terms as an integral part of the national terminology system is formulated. This approach is based on the application of methods of terminology-science and on the apt term features. Practical recommendations for the translation of some English scientometric terms are given. In particular, it is recommended to translate the English term Hirsch index as the “індекс Хірша”.
Keywords: scientometric terms, apt term features, terminological translation, phonetic analysis, voiceless vocoid, Ukrainian national term system.

У статті сформульовано науковий підхід до розвитку та унормування системи українських наукометричних термінів як невід'ємної частини національної терміносистеми. Цей підхід базується на застосуванні методів науки термінології та на ознаках вдалого терміна. Надано практичні рекомендації щодо перекладу деяких англійських наукометричних термінів. Зокрема, рекомендовано перекладати термін «Hirsch index» як «індекс Хірша».
Ключові слова: наукометричні терміни, ознаки вдалого терміна, термінологічний переклад, фонетичний аналіз, глухий вокоїд, українська національна терміносистема.

Ukrainian Consonant Phones in the IPA Context with Special Reference to /v/ and /gh/. In: Linguistica online 22: 36-61.

The acoustic and articulatory properties of Ukrainian consonant phones were investigated, and a f... more The acoustic and articulatory properties of Ukrainian consonant phones were investigated, and a full set of relevant IPA notations was proposed for these and compiled in a table. Acoustic correspondence of Ukrainian phones to those appearing in European languages was analyzed and discussed. Special attention was paid to the phonemes /v/ (represented in Cyrillic script as "/в/") and /gh/ (ren-dered in Cyrillic script as "/г/") that cause the most difficulties in their description. In particular, our experiments and observations suggest that a standard Ukrainian phoneme /v/ is realised as labiodental fricatives [v] and [v j ] before vowels and also as sonorant bilabial approximants [β̞ β̞ ˛ β̞ ɔ] between a vowel and a consonant, in the initial position before consonants and after a vowel at the end of a word, and sometimes is devoiced to [v̥ ] in the coda after a voiceless consonant. In some utterances after a vowel (before a consonant and in the coda), a strongly rounded bilabial approximant [β̞ ɔ] may approach a non-syllabic semivowel [ṷ]. These conclusions are in good agreement with the consonantal status of the Ukrainian language and with the general tendencies of sound combinations in the world languages. The findings of this research contribute to better understanding of Ukrain-ian and its special features in comparison with other world languages that may have substantial practical use in various phonetic and translation studies, as well as in modern linguistic technologies aimed at artificial intelligence development, machine translation incorporating text-to-speech conversion, automatic speech analysis, recognition and synthesis, and in other areas of applied linguistics.

Міфи і реалії правописного процесу 1928 року: документальні свідчення // Наукові праці Національної бібліотеки України імені В. І. Вернадського (Слов’янські обрії) : збірник наукових праць. Київ, 2019. Вип. 51(10). 320 с. С. 5-21.

Міфи і реалії правописного процесу 1928 року: документальні свідчення, 2019

An attempt to analyze unclear facts and processes that took place during adoption of the Ukrainia... more An attempt to analyze unclear facts and processes that took place during adoption of the Ukrainian spelling of 1928, is made. A previously commonly unknown letter of the State Spelling Commission Presidium member is taken into account. This letter states that Mykola Skrypnyk, Head of the Spelling Commission, repeatedly acted in the unfair manner in order to adopt unjustified and «revolutionary» changes to the Ukrainian spelling: he used to involve stranger persons to vote for the spelling, insisted on the ill-grounded expansion of the use of «ґ» in spite of the Ukrainian language practice, entered into Presidium a person who was not an expert in linguistics, and did not invite a Presidium member to the sessions. The author of this letter expresses displeasure by these actions and estimates the changes to the Ukrainian spelling carried out by Mykola Skrypnyk, as a «revolution». It is conjectured that there are no grounds to treat the Pravopys-1928 as legitimate and scholarly justified.

У статті зроблено спробу проаналізувати неоднозначні факти і процеси, що відбувалися під час прийняття українського правопису 1928 року, з урахуванням невідомого широкому загалу архівного документа – листа члена Президії Державної Правописної Комісії. Виявлено численні порушення регламенту в правописній процедурі, що свідчить про відсутність вагомих підстав уважати Правопис 1928 року легітимним і науково обґрунтованим.
Долю Правопису 1928 року фактично вирішувало не 80 кваліфікованих мовознавців, а лише двоє осіб: нарком освіти Микола Скрипник і товариш Антін Приходько. Перший фігурує в своїх анкетних даних як "професійний революціонер" із Донбасу, а другий – як юрист із Кубані.

Ukrainian vowel phones in the IPA context. In: Govor 35 (2): 189-214.

Acoustic and articulatory properties of Ukrainian vowels are investigated in this study and a ful... more Acoustic and articulatory properties of Ukrainian vowels are investigated in this study and a full set of relevant IPA notations are proposed. The notations are shown in the vowel diagram and the table. The results of the earlier acoustic invariant speech analysis based on special software, auditory and spectrum analysis were used and the results are discussed in the context of general and Ukrainian phonetic laws governing language evolution and acoustic properties of non-stressed vowels in relation to their stressed cognates. Such combined approach resulted in a more detailed vowel inventory than proposed heretofore. The findings of this research contribute to better understanding of Ukrainian language and its special features in comparison with other world languages that may have substantial practical use in various phonetic and translation studies, as well as in modern linguistic technologies aimed at artificial intelligence development, machine translation incorporating text-to-speech conversion, automatic speech analysis, recognition and synthesis, and in other areas of applied linguistics.

Calculation of semantic distances between words: From synonymy to antonymy (AOM)

Journal of Quantitative Linguistics, 2019

A new approach to numerically measure the semantic distances between lexical units (words and col... more A new approach to numerically measure the semantic distances between lexical units (words and collocations) based on the geometric analogies and analytical calculations, is put forward. Having considered the cases of equal and different weights of semes, we obtained exact algebraic formulas describing different levels of the meanings proximity, ranging from absolute synonymy to full antonymy. It was emphasized that absolute synonymy arises when the compared units contain equal numbers of semes that fully coincide and have equal weights in the corresponding pairs. Calculation of the semes weights helps to locate the unit more precisely on the semantic sphere. It was shown that the level of synonymy and antonymy decreases if different semes are accentuated, while the semantic distance between the units without identical semes cannot be influenced by seme boosting. It was observed that depending on the context, a word may wander over this sphere, thus modifying its lexical semantic relations with other units. As the proposed approach contributes to formalization of the units comparison procedure, it is advisable for relevant automatic tools, particularly for WordNets and FrameNet. The obtained results may be useful for various linguistic and associated studies including automatic text analysis and processing, computer lexicography, information search and retrieval, machine translation and other NLP applications that are related to the artificial intelligence problem.

Сучасна українська термінологія: методологія, кодифікація, лексикографічна практика: автореф. дис. на здобуття наук. ступеня док. філол. наук: спец. 10.02.01 / Modern Ukrainian Terminology: Methodology, Codification, Lexicographic Practice. Dissertation Abstract

Київський національний університет імені Тараса Шевченка, Міністерство освіти і науки України, 2023

A new complex approach to the Ukrainian natural terms use and coinage is presented, grounded, and... more A new complex approach to the Ukrainian natural terms use and coinage is presented, grounded, and developed here. It is based both on linguistic and natural sciences routine, effectively combining descriptive and prescriptive aspects of term study. Such an approach creates prerequisites to consider terminology not only as a lexicology subsection, a set of terms or doctrine on them but first of all as an independent discipline about terms rise, development, and functioning.

Словник фізичних термінів-синонімів / М. О. Вакуленко, О. В. Вакуленко. 2017. Київ : ВПЦ “Київський університет”. 191 с.

Наведено, упорядковано й уточнено терміни-синоніми в галузі фізики та суміжних наук, уживані в пи... more Наведено, упорядковано й уточнено терміни-синоніми в галузі фізики та суміжних наук, уживані в писемному та усному фаховому спілкуванні від початку ХХ століття до цього часу. Словник містить близько 5000 рядів фізичних синонімів.
Запропоновано класифікацію термінів, позначуваних віддієслівними іменниками, відповідно до якості дії (разової або короткочасної, тривалої та повторюваної).
Для студентів і викладачів природничих факультетів вищих навчальних закладів, мовознавців, фахівців у галузі фізики та термінології.

Українська термінологія: комплексний лінгвістичний аналіз: [монографія] / М. О. Вакуленко. – Івано-Франківськ : Фоліант, 2015. – 361 с., іл.

У монографії подано, обґрунтовано та розвинено комплексний пiдxiд до вживання і творення українсь... more У монографії подано, обґрунтовано та розвинено комплексний пiдxiд до вживання і творення українських тeрмiнiв, заснований на використанні методів природничих наук і мовознавчих аспектів дослідження. Такий підхід створює передумови для того, щоб розглядати термінологію не лише як розділ лексики, сукупність термінів і вчення про них, а насамперед як науку про утворення, розвиток і функціонування термінів. За допомогою послідовного застосування статистичного та аналітичного методів досліджено теоретичні й практичні термінографічні проблеми, розкрито правописні аспекти відтворення запозичених та іншомовних слів. На основі методу акустичних інваріантів досліджено акустичні характеристики звуків української (а також англійської) мови та знайдено їх інваріантні акустичні параметри. У монографії відображено результати автора, одержані в процесі його наукової та викладацької роботи в Державній академії керівних кадрів освіти, Міжрегіональному інституті вдосконалення вчителів, Київському національному лінгвістичному університеті, Київському національному університеті імені Тараса Шевченка, Техаському університеті в Арлінгтоні (США), Єнському університеті імені Фрідріха Шіллера (Німеччина), Гіссенському університеті імені Юстуса Лібіга (Німеччина), Лейпцізькому університеті (Німеччина), Дрезденському технічному університеті (Німеччина), Університеті Країни басків (Іспанія), Українському мовно-інформаційному фонді Національної академії наук України.

Для науковців, викладачів, аспірантів і студентів філологічних факультетів вищих навчальних закладів, а також для лексикографів, фонетистів, перекладачів і фахівців у галузі прикладної лінгвістики, української термінології і термінографії.

CONTENTS
LIST OF SYMBOLS AND ABBREVIATIONS 8
INTRODUCTION 9
PART 1
THEORETICAL AND METHODOLOGICAL PRINCIPLES OF TERMINOLOGY 18
1.1. Term and terminology: underlying statements 18
1.2. Successful term criteria 30
1.3. General tendencies of development of terminological systems 30
1.4. Main lexical semantic relations in term vocabulary (polysemy, homonymy, paronymy, synonymy) 40
1.5. National and international trends of term fund updating 62
PART 2
SEMANTIC, GRAMMATICAL, AND PHONETIC PROBLEMS OF
UKRAINIAN TERMS CODIFICATION 75
Section 2.1. Semantic and morphological qualities of Ukrainian
term vocabulary 75
2.1.1. Alien borrowings as a linguistic and terminological problem 75
2.1.2. Semantic and stylistic features of use of paronyms and pseudosynonyms in the professional language 83
2.1.3. Scientific approach to terminological use of verbal nouns 112
2.1.4. Principles of representation of active participial forms in the Ukrainian scientific language 125
2.1.5. Possibilities and prospects of national word coinage
in terminology 127
Section 2.2. Acoustic characteristics of Ukrainian speech sounds as a ground
for scientific approach to render the borrowings 135
2.2.1. Phonetics, acoustic analysis, speech recognition 135
2.2.2. Phonetical and phonological language elements 138
2.2.2.1. Basic concepts of phonetics and phonology 138
2.2.2.2. Nonlinearity and non-invariance of speech elements 147
2.2.3. Speech acoustics 152
2.2.3.1. Acoustic theory of speech production 152
2.2.3.2. Speech sounds formants 157
2.2.3.3. Resonators in a human speech tract 159
2.2.3.4. Invariant acoustic speech analysis 162
2.2.3.5. Acoustic characteristics and invariants of
speech sounds 165
2.2.3.6. Interaction between sounds 179
PART 3
PECULIARITIES OF ORTHOGRAPHIC NORM APPLICATION IN SCIENTIFIC LEXICOGRAPHY AND WAYS TO PERFECT
MODERN SPELLING 189
3.1. Modern spelling strategy problems 189
3.2. Problems of modern spelling of terminological
and common vocabulary 191
3.2.1. Genitive case of masculine nouns of second reversal 181
3.2.2. Use of genitive and accusative cases 195
3.2.3. Spelling of compiled adjectives 196
3.2.4. The features of use of Ukrainian prepositions “в” (“у”) and
“до” for denotation of motion aim and direction 198
3.2.5. Stress problem in three-syllable nouns with the prefix “за ” 202
3.3. Denotation of time moments in Ukrainian 206
3.4. Problem of adaptation of alien terms to Ukrainian spelling 211
3.4.1. Study of phonemes /ґ/, /г/, and /х/ 211
3.4.2. “The nine” (“dev’jatka”) rule in the context of Slavic and non Slavic parallels of the Ukrainian language 214
3.4.3. Doubling simplification problems 220
3.4.4. Some “trifles” of spelling perfection 221
3.4.5. Discussion suggests to spelling 222
3.5. Scholar principles to render borrowed and foreign words:
invariant transcription and transliteration 229
3.5.1. Pronunciation and transcription 229
3.5.2. Writing and transliteration 234
CONCLUDING STATEMENTS 255
LIST OF LITERATURE 260
APPENDIX A: Visual representation of speech sounds
(spectrograms, oscillograms) 397
APPENDIX B: Inculcation of the research results
(main documentation) 337
APPENDIX V: Use of the term Χυμεία in Greek 361

Сучасна українська термінологія: методологія, кодифікація, лексикографічна практика. Рукопис. Дис. на здобуття наук. ступеня док. філол. наук: спец. 10.02.01 – українська мова / Modern Ukrainian Terminology: Methodology, Codification, Lexicographic Practice

Київський національний університет імені Тараса Шевченка, Міністерство освіти і науки України, Київ, 2023

The thesis is devoted to the study of descriptive and prescriptive approaches in the modern codif... more The thesis is devoted to the study of descriptive and prescriptive approaches in the modern codification of the Ukrainian scientific term lexicon.
A new complex approach to the Ukrainian natural terms use and coinage is presented, grounded, and developed here. It is based both on linguistic and natural sciences routine, effectively combining descriptive and prescriptive aspects of term study. Such an approach creates prerequisites to consider terminology not only as a lexicology subsection, a set of terms or doctrine on them but first of all as an independent discipline about terms rise, development, and functioning.

The Ukrainian transliteration standard DSTU 9112:2021 “Cyrillic-Latin transliteration and Latin-Cyrillic retransliteration of Ukrainian texts. Writing rules”: on Ukraine’s way to the European Community

Chance for Science, 2022

Smolynsjka - analiz metodrozrobky dijeprykmetnyka

Смолинська Є. С. Коментарі щодо методрозробки про викладання дієприкметників, 1959

Коментарі Є. С. Смолинської на машинописі методрозробки невідомого автора щодо дієприкметників. З... more Коментарі Є. С. Смолинської на машинописі методрозробки невідомого автора щодо дієприкметників.
Зауваження Євгенії Смолинської містять цікаву інформацію щодо її бачення граматики української мови, історії мови, методики викладання

Smolynsjka dijeprykmetnyky 6 tezy

Смолинська. Дієприкметник у 6 класі с. ш. / Тези доповіді. Київ, 1954

У тезах доповіді йдеться про класифікацію дієприкметників, порівняння дієприкметників у східносло... more У тезах доповіді йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янських мовах, методику викладання цієї складної теми в середній школі. Тема є актуальною й на початку 21 століття, не кажучи про інтерес для істориків мови.

Smolynsjka dijeprykmetnyky rukopys

Смолинська Євгена. Дієприкметники сучасної української мови на уроках граматики в середній школі. Київ, 1954

У статті йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янськи... more У статті йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янських мовах, методику викладання цієї складної теми в середній школі. Тема є актуальною й на початку 21 століття, не кажучи про інтерес для істориків мови.

Smolynsjka dijeprykmetnyky

Смолинська Є. Викладання дієприкметників у 6 класі середньої школи. Київ, 1953

У статті йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янськи... more У статті йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янських мовах, методику викладання цієї складної теми в середній школі. Тема є актуальною й на початку 21 століття, не кажучи про інтерес для істориків мови.

Smolynsjka - Gholota. Vykladannja dijeprykmetnykiv

Смолинська Є. С., Голота К. Д. Викладання дієприкметників у школі з російською мовою навчання. Київ, 1955

Машинопис статті Є. С. Смолинської та К. Д. Голоти "Викладання дієприкметників у школі з російськ... more Машинопис статті Є. С. Смолинської та К. Д. Голоти "Викладання дієприкметників у школі з російською мовою навчання", очевидно поданої в редакцію журналу "Українська мова в школі".

У статті йдеться про класифікацію дієприкметників, порівняння дієприкметників у східнослов'янських мовах, методику викладання цієї складної теми в середній школі. Тема є актуальною й на початку 21 століття, не кажучи про інтерес для істориків мови.

У редакційному листі від 13.3.1955 сказано:
"Шановна Євгеніє Сергіївно!
На жаль, вашу статтю про дієприкметники в українській мові не будемо брати на розгляд, оскільки інший автор (про якого я Вам говорила) свою статтю на цю тему обіцяє доробити за рецензією.
З повагою
Зав. науково-популярного відділу
/підпис/ (Е. Ржевуцька)"

Синтез дескриптивного та прескриптивного підходів у сучасній кодифікації українського наукового термінолексикону

Автореф. дис. на здобуття наук. ступеня док. філол. наук: спец. 10.02.21 “Структурна, прикладна та математична лінгвістика”. Національний педагогічний університет імені М. П. Драгоманова, Міністерство освіти і науки України. Київ. , 2020

Vakulenko M. O. Synthesis of Descriptive and Prescriptive Approaches in the Modern Codification o... more

Фемінативи: що вони нам дають і що відбирають

Лекція в КПІ 31 жовтня 2024 року, 2024

мові». Порозуміння з усіма ключовими сторонами. Передбачено протистояння між активістами та мовоз... more мові». Порозуміння з усіма ключовими сторонами. Передбачено протистояння між активістами та мовознавцями, яке відображає відмінності між позамовною дійсністю та мовою. Має діяти принцип «мухи окремо, котлети окремо»: • фемінізм окремо, фемінативи окремо; • граматичний рід окремо, стать / гендер окремо. Розкол української мови-допомога ворогам. ICMRSS Vienna, 28 November 2024. Ефір заполонили фемінативи. Журналісти, активісти. Мовознавці: переважно критичне ставлення. Лариса Кислюк: «вживання іменників чоловічого роду як «титульних імен» в офіційно-діловому стилі закріплене мовною традицією і правилами етикету» (2018, 456-457). Алла Архангельська: навіть у сприйнятливій до фемінативів чеській мові існують істотні обмеження їхнього вжитку. Фемінативи як засіб мовної ідентифікації жінок не підкреслюють їхнього соціального статусу, тому в позитивному контексті мовознавці віддають перевагу традиційним «маскулінним» формам (Архангельська 2019, 376). Олександр Тараненко: мовний андроцентризм не можна розглядати як прояв ставлення суспільства до жінки (Тараненко 2020, 28).

Прикладні та екзистенційні аспекти впровадження національного стандарту української латиниці

Доповідь на семінарі НАН України "Інформація. Мова. Інтелект" 30 вересня 2024 року, 2024

Гібридна війна -історичні дослідження. Євроінтеграція України: • багатомовні сценарії опрацювання... more Гібридна війна -історичні дослідження. Євроінтеграція України: • багатомовні сценарії опрацювання природної мови, • включення українських пам'яток у програми збереження історичної спадщини. => ізоморфна (взаємнооднозначна) романізація назв українських пам'яток. Актуально: латинізація українських текстів.

Український ворднет як технологічний засіб інтеграції української мови у світовий мовний простір

Актуальні питання сучасної лінгвістики - Київ, НаУКМА, 25 жовтня 2024 року, 2024

Ворднети (від англ. word і net, букв. словосіть): поняття (концепти)-> ряди лексичних синонімів (... more Ворднети (від англ. word і net, букв. словосіть): поняття (концепти)-> ряди лексичних синонімів (синсети) + означення, приклади вживання, лексико-семантичні відношення. 2 підходи: розширення (англ. expansion) та злиття (англ. merge). Концепти у різних мовах-> міжмовні індекси (англ. interlingual indices). Лексикографічна інформація: ворднети (Відкритий англійський ворднет (https://en-word.net/)-англ. Open English Wordnet) + традиційні словники. 2 Вступ Перша версія української словосіті-Ukrajinet 1.0: злиття, «Тлумачний словник із фізики», «Словник фізичних термінів-синонімів». Один ряд із провідними технологічними державами стосовно семантичних мовних ресурсів для автоматичного опрацювання мови. Подальша робота: додано близько 750 означень понять і 700 прикладів використання, переклад польського ворднету. Означення: «Словник української мови в 20-ти томах», «Тлумачний словник із фізики», Відкритий англійський ворднет. Поточна версія Ukrajinet 1.0: понад 15 тисяч лексичних гасел, понад 12 тисяч синсетів.

Українська фонетика в контексті перекладу: здобутки та міфи

Building bridges, upgrading competence: translation and interpreting in today's connected world, 2024

Фонетичні зауваги щодо практичної транскрипції японських слів українською мовою

КСУ імені Б. Грінченка