Computational Linguistics: Machine Translation

description24 papers

group520 followers

lightbulbAbout this topic

Computational Linguistics: Machine Translation is a subfield of computational linguistics focused on the automatic conversion of text or speech from one language to another using algorithms and computational models, integrating linguistic knowledge and statistical methods to improve translation accuracy and fluency.

lightbulbAbout this topic

Key research themes

1. How do statistical and linguistic models contribute to parameter estimation and alignment accuracy in Machine Translation systems?

This research area explores the mathematical and algorithmic foundations of statistical machine translation (SMT), focusing on how models estimate translation parameters and align words between bilingual sentence pairs. It is foundational because effective word alignment and parameter estimation directly impact translation quality. Understanding and improving these models provide groundwork for advanced MT methodologies.

The Mathematics of Statistical Machine Translation: Parameter Estimation

by Abhijit Das

2016

Key finding: This paper developed a series of five statistical models of the translation process and provided concrete algorithms for estimating their parameters using bilingual sentence pairs. It introduced the formalism of word-by-word... Read more

articleView Paper downloadDownload

Approaches to Machine Translation: A Review

by John Oladosu

2025, FUOYE Journal of Engineering and Technology

Key finding: This paper systematically reviewed SMT as a corpus-based approach, detailing the core machine learning problems: modeling translational equivalence, parameterization, parameter estimation, and decoding. It highlighted key SMT... Read more

articleView Paper downloadDownload

Linguistically motivated Evaluation of the 2022 State-of-the-art Machine Translation Systems for three Language Directions

by Eleftherios Avramidis and

2022, Seventh Conference on Machine Translation

Key finding: Building upon SMT and neural MT models, this study employed linguistically informed test suites for German-English, English-German, and English-Russian to analyze state-of-the-art MT system outputs. It introduced... Read more

articleView Paper downloadDownload

Setting a Methodology for Machine Translation Evaluation

by Widad MUSTAFA EL HADI

2024, issco.unige.ch

Key finding: This research proposed a structured methodology for evaluating MT system outputs focusing on syntactic and lexical fidelity as proxies for intelligibility and accuracy. It incorporated black-box evaluation protocols and... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are effective methodologies for evaluating machine translation quality, and how do linguistic and human-centered approaches compare?

This theme addresses the complex challenge of quantitatively and qualitatively assessing MT system performance. It spans approaches from purely automated, linguistically motivated test suites to semi-automatic and manual human evaluations, emphasizing the balance between scalability, objectivity, and linguistic nuance. These evaluation frameworks are critical for iterative MT development and deployment in real-world multilingual contexts, guiding researchers in measuring progress and identifying weaknesses.

A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

by Ahmed Guessoum

2024, Machine Translation

Key finding: This work introduces a semi-automated lexicon evaluation method grounded in domain-specific word sense importance. By weighting lexical items according to their relevance in application contexts, the method quantifies lexicon... Read more

articleView Paper downloadDownload

Looks like google to me: Instructor ability to detect machine translation in L2 Spanish writing

by Luciane Maimone and

2024, F L A

Key finding: This empirical study investigated human instructor ability to detect MT-generated texts in second language (L2) Spanish writing. Results revealed that instructors could reliably distinguish MT outputs from learner-produced... Read more

articleView Paper downloadDownload

The position of machine translation in translation studies: A definitional perspective

by Omri Asscher

2023, Translation Spaces

Key finding: This paper engages critically with how MT is conceptually situated within translation studies, discussing definitional approaches (prescriptive versus descriptive) that frame MT as a translational object. It argues for... Read more

articleView Paper downloadDownload

Computational Linguistics and Natural Language Processing

by Peter Revesz

2024, MDPI

Key finding: Though covering a broad spectrum of computational linguistics topics, this collection includes novel linguistic profiling and text genre classification methodologies that have implications for MT evaluation, such as... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How do recent Large Language Models (LLMs) compare to traditional Machine Translation systems in handling contextual meaning, fluency, and idiomatic expressions in multilingual translation tasks?

This theme investigates the capabilities and limitations of state-of-the-art LLMs such as ChatGPT relative to established MT systems like Google Translate. It focuses on their performance in generating contextually accurate, fluent, and culturally aware translations across language pairs involving Arabic, English, and others. Understanding their comparative strengths and deficiencies informs ongoing model improvements and encourages hybrid approaches integrating human expertise.

Large Language Models as Computational Linguistics Tools: A Comparative Analysis of ChatGPT and Google Machine Translations

by Mohammad Awad AlAfnan

2024, Journal of Artificial Intelligence and Technology

Key finding: This comparative study evaluated ChatGPT and Google Translate on Arabic-English and English-Arabic translation of high-profile speeches, assessing metrics including semantic adequacy, meaning preservation, style, and... Read more

articleView Paper downloadDownload

"DeepL translator: The grammatical and syntactic challenges of DeepL in translating a literary piece of work from Russian to English & Greek"

by Larisa Strikou

2025, Aristotle University of Thessaloniki

Key finding: This research evaluated DeepL’s ability to translate complex literary texts, revealing challenges in accurately rendering grammar, syntax, and pragmatic meanings inherent in literary discourse. While MT technology like DeepL... Read more

articleView Paper downloadDownload

The Mathematics of Statistical Machine Translation: Parameter Estimation

by Abhijit Das

2016

Key finding: Via statistically driven parameter estimation and alignment models, this seminal paper laid the groundwork for later neural and LLM approaches. The probabilistic frameworks and alignment algorithms established here enable... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Computational Linguistics: Machine Translation

Translation in the digital era COMPARATIVE STUDY OF AI AND HUMAN APPROACHES

by Eldar Shahgaldiyev

2025

descriptionView Paper arrow_downwardDownload

Journal of Scientific Development for Studies and Research (JSD) ‫اسات‬ ‫للدر‬ ‫العلمي‬ ‫التطوير‬ ‫ـة‬ ‫مجل‬ ‫والبحوث‬ P

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD)

The term 'dysarthria' refers to a speech motor execution disorder that arises from damage to the central and/or peripheral nervous system. The condition is linked to various neurological acute pathologies (like stroke or head trauma),... more

descriptionView Paper arrow_downwardDownload

دور الذكاء الاصطناعي في التعليم

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD)

This research aims to explore the role of artificial intelligence in enhancing the teaching and learning process. It highlights the immense potential of AI to improve assessment processes and provide flexible... more

descriptionView Paper arrow_downwardDownload

An Interdisciplinary Approach to Human-Centered Machine Translation

by Omri Asscher

2025, EMNLP

descriptionView Paper arrow_downwardDownload

الضرر البيئي كأساس لقيام المسؤولية الإدارية في التشريع الجزائري ــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــ ب/أمينة ريحاني

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD

Traditional medicine: towards a paradigm rooted in popular culture of the Eastern Rif (Oulad Settout tribe in Zaio as a model), a topic that reveals to us cultural and heritage worlds, in which the Eastern Rif of Morocco... more

descriptionView Paper arrow_downwardDownload

مقال+المجلة+الطب+الشعبي+البحث+

by Journal of Scientific Development for Studies and Research (JSD)

2025

descriptionView Paper arrow_downwardDownload

اشكالية مقاربة النص الادبي البحث

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD

This study aims to reveal the reality of teaching the language sciences component in the secondary stage, and the extent to which this reality contributes to the low level of students in the Arabic language subject, to diagnose the positive aspects with a desire to enrich it, and the negative aspects to enrich interest in it, and to draw attention to it as a problem worthy of successive research in order to eliminate it. And avoid it. The importance of this research paper will enable secondary school Arabic language teachers to improve their professional performance by identifying the problems that hinder the achievement of the intended objectives of the linguistics lesson through a comprehensive review of curricula, methods and educational means. To generalize and achieve this importance, this research paper was launched through a general realistic problem among secondary school students, which is the difficulty of students' comprehension of the linguistics lesson and its negative impact on their results in their linguistic performance. To overcome this problem, the study used the descriptive analytical approach, and the tool was content analysis. The research sample included educational directives and programs for teaching Arabic in secondary school education and analysis of the results of the questionnaires. The results of the study showed that learners' stumble in the language sciences component is due to: the difficulty of the component itself, confinement to old teaching methods that tend to memorize and memorize rules, examples, and evidence, the psychological complex of learners in the component, lack of clarity of the goals envisaged by teaching the component, The Arabic language curriculum in the preparatory secondary school is based on the principle of quantity rather than quality. The form and content of the textbook, the educational experiences it provides and the way they are presented are negatively affected by the learners' performance. The educational experiences provided do not measure the test. Considering the results, the study recommended the necessity of reconsidering the study plan in accordance with the objectives sought from the Arabic language curriculum, and then testing the textbook before implementing it. Classifying the presentation of the scientific material in a way that suits the steps of remembering, and balancing the presentation of the examples presented to include ancient, modern, and contemporary ones in a manner that is appropriate to the age stage of the learners. And building the content of the language sciences component according to the functional and logical approach, as this leads to achieving the desired goals, and teaching the language sciences component within the framework of the activities approach, through the students performing several tasks and activities instead of memorizing the rules and repeating them without awareness of them.
Keywords: Language sciences;pedagogy;didactics;descriptive analytical approach;teaching methodology;Personal project;interventional research

descriptionView Paper arrow_downwardDownload

تحليل الافتراضات المسبقة التداولية في المقالات الصحفية السياسية

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD

This study investigatesthe underlying presuppositions present in a newspaper article published in The Telegraph, a British daily. The newspaper article is authored by Con Coughlin, the foreign and defense editor... more

descriptionView Paper arrow_downwardDownload

Translation into Arabic A study of the negatives and positives

by Journal of Scientific Development for Studies and Research (JSD)

2025, Journal of Scientific Development for Studies and Research (JSD

The research aims to shed light on the importance of translation in the Arab Renaissance and its development of the Arabic language in the modern era.Translation is considered one of the most important factors of the... more

descriptionView Paper arrow_downwardDownload

Integrating Technology in Translating Laws from and into Arabic: Benefits, Implications and Limitations

by Rafat Y . Alwazna

2025, International Journal for the Semiotics of Law

descriptionView Paper arrow_downwardDownload

The Role of Artificial Intelligence in Preserving the Arabic Language Opportunities and Challenges

by MAHBOUBA BEKOUCHE

2025, الملتقى الوطني:استنطاق مسار الفعل الترجمي في كنف القاموس العربي بين الأصالة و المعاصرة

The Arabic language, known for its complexity and rich cultural significance, faces increasing pressure to adapt in a rapidly evolving technological landscape. This paper investigates the role of artificial intelligence (AI) in... more

descriptionView Paper arrow_downwardDownload

أثر الترجمة الآلية على البنية اللغوية العربية دراسة في ضوء اللسانيات الحاسوبية The Impact of Machine Translation on Arabic Linguistic Structure: A Study in Light of Computational Linguistics

by Dr Mahmoud A Al-Feky

2025, أثر الترجمة الآلية على البنية اللغوية العربية دراسة في ضوء اللسانيات الحاسوبية The Impact of Machine Translation on Arabic Linguistic Structure: A Study in Light of Computational Linguistics

Considering the technological revolution the world is witnessing today, machine translation has become an indispensable tool for knowledge transfer and cross-cultural communication. However, questions arise regarding the impact of these technologies on the Arabic language, particularly from the perspective of computational linguistics, which focuses on studying language through modern technologies. Arabic is considered one of the most complex languages in terms of morphological and syntactic structures, as it relies on a root-based system and precise linguistic constructions. This complexity poses a significant challenge to machine translation systems that depend on algorithms and natural language processing (NLP). Despite the continuous advancements in these systems, they still encounter notable difficulties when handling Arabic, which often affects the quality of the translated texts. One of the key issues machine translation faces with Arabic is the difficulty in maintaining the original syntactic structure or intended meaning, especially in long or compound sentences. Moreover, machine translation sometimes tends to be overly literal, resulting in the loss of subtle nuances that require a deeper understanding of the linguistic and cultural context. Common errors include confusion in verb tenses, inaccurate translation of idiomatic expressions, and neglecting the cultural and social context that language plays in shaping meaning. To clarify these challenges, this study will analyze popular machine translation models such as "Google Translate" and "DeepL" on morphological, syntactic, and semantic levels. The results indicate that these models still require further development to suit the unique nature of the Arabic language. The study will also propose solutions aimed at improving translation accuracy while ensuring cultural and linguistic depth. The significance of this research lies in its exploration of a vital topic that addresses the interaction between technology and the Arabic language. Understanding this interaction will help formulate strategies to preserve the authenticity of the Arabic language while adapting it to meet the demands of the digital age.

descriptionView Paper arrow_downwardDownload

DeepSeek vs. ChatGPT: A Comparative Evaluation of AI Tools in Composition, Business Writing, and Communication Tasks

by Mohammad Awad AlAfnan

2025, Journal of Artificial Intelligence and Technology

This study presents a comparative evaluation of DeepSeek and ChatGPT, two AI-powered text generation models, in composition, business writing, and communication tasks. The article assesses AI-generated content based on clarity, coherence,... more

descriptionView Paper arrow_downwardDownload

Фефелов А.Ф. Текстовые проблемы перевода

by Анатолий Федорович Фефелов

2025, А.Ф.Фефелов. Текстовые проблемы перевода // Вопросы теории и практики перевода: Вестник ИГЛУ. Сер. Лингвистика. – Иркутск: Иркутский гос. лингв. ун-т, 2001. – №6. С. 102-116

L'étude comparative des textes de Saint-Exupéry, Giraudoux, Bazin et de leurs traductions russes prouve pourquoi la véritable unité de traduction ne peut être que le texte, et non la phrase et surtout pas le mot. En dehors du texte, les phrases n'ont qu'un sens «lexico-grammatical», alors que comprendre le texte achevé c'est interpréter son sens socioculturel et, dans le cas d'une oeuvre littéraire, sa poétique.
Pour montrer l'ensemble des changements lexico-syntaxiques qu'a subi le TD, la méthode de partition en chaînes coréférentes fut utilisée. Elle permet d'identifier des zones d'asymétrie des formalismes lexico-syntaxiques des TD et TA tout en préservant pleinement la poétique du premier, ses sens et messages. L'asymétrie de ce type définit le mieux les normes d'une traduction dite textuelle, tout en illustrant la différence entre la logique de la traduction humaine et la «logique» algorithmique de la traduction automatique. Cependant, les écarts du TA sont mal expliqués par les approches de la linguistique textuelle. L'identification des micro- et macroprositions du TA s'avère insuffisante pour garantir une bonne traduction, bien que la linguistique textuelle ait, bien sûr, raison de postuler que les relations d'équivalence ne peuvent exister qu'entre les textes.
Le texte est un milieu sans lequel l'intention ou la vision conceptuelle du traducteur ne peuvent être réalisées comme il faut. Cette hypothèse ne s'inscrit ni dans la logique de la linguistique classique, ni dans la logique de la linguistique du texte. Les critères de traduction adéquate proposés par la linguistique textuelle sont tous purement théoriques. Leurs présence est visible dans un texte bien construit ou bien traduit, mais ils ne donnent pas lieu aux outils pour atteindre la conformité nécessaire.
L'article analyse, entre autres, les causes et les effets du remplacement des verbes par des noms dérivés de la même famille, la transformation du système temporel du texte, les changements du type communicatif des propositions, la pertinence des erreurs lexicales pour porter un jugement sur l'équivalence du TA, etc. L'équivalence linguistique entre les éléments du TD et du TA (leurs mots et constructions syntaxiques) n'existe en fait que comme correspondance de leurs significations lexicales et grammaticales. Pour les deux textes achevés ou deux oeuvres, cette équivalence linguistique est secondaire. Le destinataire (lecteur, éditeur, critique) attend dans le TA la parfaite union de la lettre et de l'esprit.
Mots-clés: traduction, traduction sémantique, traduction interprétative, linguistique du texte, linguistique contrastée, stylistique, Prosper Mérimée, enseignement de la traduction, analyse coréférentielle des TD et TA.

descriptionView Paper arrow_downwardDownload

ТЕКСТОВАЯ И ЛИНГВИСТИЧЕСКАЯ АДЕКВАТНОСТЬ ПЕРЕВОДА И МЕЖСИСТЕМНЫЕ СОПОСТАВЛЕНИЯ

by Анатолий Федорович Фефелов

2025, A.F.Fefelov. Textual vs linguistic adequacy of translation and comparative language studies // Quantitative linguistics and semantics. Collection of scientific works. Novosibirsk: Novosibirsk State Pedagogical University Publishing House, 2001. - Issue 3. pp. 191-199.

Аннотация. L’article analyse les différences fondamentales entre l'adéquation (ou équivalence) linguistique de la traduction (répandue dans son enseignement) et textuelle dominant dans tous les types de traduction professionnelle. Il est prouvé que les traductions de la même phrase en dehors du texte (cf. exercices de traduction) et dans le texte diffèrent presque jamais. Ce fait a une importance significative non seulement pour la pratique de la traduction éducative, les méthodes d'évaluation de la qualité de la traduction dans l'environnement éducatif, mais surtout et avant tout, pour les principes et les méthodes de recherche linguistique contrastée. En particulier, il est montré que le matériel des sources littéraires et de leurs traductions en russe ne convient pas très bien pour identifier les caractéristiques comparatives du vocabulaire et de la grammaire des langues, car la traduction textuelle transforme fortement la structure lexico-grammaticale du texte de départ (TD). La linguistique contrastée devrait recourir au matériel littéraire pour justifier ses conclusions intersystèmiques avec une extrême prudence.
La traduction des phrases hors contexte et la traduction d'un texte composé des mêmes phrases représentent deux types spécifiques d'activité traductrice visant à résoudre des tâches très différentes. Dans le premier cas, nous travaillons par excellence avec les significations lexicales et grammaticales des unités linguistiques de la phrase. Dans le second, nous avons affaire au sens du texte achevée, la personne de l’auteur et son esthétique stylistique. Dans le premier cas, il ne nous viendra jamais à l’esprit le désir de remplacer la phrase narrative par une phrase interrogative ou exclamative. Il n’y a aucune raison de le faire hors de texte. Dans le second cas, de telles substitutions (le terme transformations ne convient guère dans ce cas-là) sont assez fréquentes, et la probabilité de remplacement augmente à mesure que nous passons des types de texte rhétorique neutres (informatifs, scientifiques, commerciaux) aux types de texte rhétorique marqués (belles lettres, par exemple).
Ces deux types de traduction entretiennent un rapport très différent avec les soi-disant «difficultés» de traduction, populaires dans les manuels scolaires. Dans le premier cas, ils sont classés selon leurs caractéristiques grammaticales ou leur asymmetrie lexicale donnant parfois lieu aux lacunes. Dans le second, elles sont toutes de nature stylistique ou esthétique, toutes n’existent que dans un texte achevée et non par elles-mêmes, et varient d’une œuvre à l’autre, d’un auteur à l’autre.

descriptionView Paper arrow_downwardDownload

Challenges Encountered in Translation of Culture-bound and Subject-specific Terminology While Using Google Translate

by Javid S A B I R Babayev

2025, EuroGlobal Journal of Linguistics and Language Education

This study explores the limitations and challenges of using Google Translate as a translation tool, particularly in academic, professional, and literary contexts. While Google Translate provides rapid, accessible translation, various... more

descriptionView Paper arrow_downwardDownload

"DeepL translator: The grammatical and syntactic challenges of DeepL in translating a literary piece of work from Russian to English & Greek"

by Larisa Strikou

2025, Aristotle University of Thessaloniki

This research conducts the experiment of translating literary pieces by utilising one of the most recent as well as advanced tools, DeepL. Throughout the paper, the study elicits entities that discuss MT translation and computational... more

descriptionView Paper arrow_downwardDownload

The Influence of AI on Translation: A Transformative Change in the Language Industry

by Nguyễn Thị Tuyết Hạnh

2025

This research examines the tendency of changes of translation in connection with the development of artificial intelligence (AI). The study took place at an industrial university in Ho Chi Minh City, Vietnam, where one hundred students... more

descriptionView Paper arrow_downwardDownload

The Quality of Google Translate and ChatGPT English to Arabic Translation: The Case of Scientific Text Translation

by Khalil A Nagi

2025

The aim of the study is to investigate the quality of neural machine translation (NMT) and that of large language models (LLMs). The research team uses Google Translate and ChatGPT in the translation of various selected scientific texts.... more

mensions is represented in Figure 1 below. Figure 1. Error Distribution in Google Translate and ChatGPT.

Table 1. Results of annotators’ evaluation. Accuracy: This refers to errors that arise when the

Table 2. Number of Errors in the translated texts in Google Translate and ChatGPT.

descriptionView Paper arrow_downwardDownload

Arabic Text Formality Modification A Review and Future Research Directions

by Shadi Abudalfa

2025

Formality transfer seeks to adjust text formality without altering its core meaning, which carries substantial implications across diverse domains like machine translation, dialogue systems, and social media content creation. This study... more

descriptionView Paper arrow_downwardDownload

Applying Large Language Models in Legal Translation: The State-of-the-Art

by Martina Bajcic

2024, International Journal of Law and Language

While there is no denying that new AI technologies and tools are making a significant impact on translation, specialized translation remains problematic for automation especially in regard to terminology. Precise and consistent... more

Graph1. Papers in SCOPUS by subject area

Graph 2. Papers in WoSCC by subject area There were overlaps in results (identical papers retrieved), however overall, less papers were found in the field of legal translation. In regard to the area of law, within the ana- lysed period only two papers were detected in WoSCC (from the US and the UK). Gener- ally, more results concerned papers dealing with LLMs (than in SCOPUS), albeit not in relation to legal translation or translation. Some focused on using ChatGPT for tourism (Carvalho & Ivanov, 2023), ChatGPT for game jams (Grow & Khosmood, 2023), or tack- led bias of large language models (e.g. Gadiraju et al., 2023). Most papers that men- tioned LLMs such as ChatGPT, dealt with their application in the medical field, and in the context of bioethics. The four most cited papers were published in 2023 and 2024 on

Graph 3. Papers in SCOPUS by country or territory Comparing the countries with author and university affiliation, the biggest percent- age of papers is affiliated with the University of North Carolina at Charlotte (4), Dublin City University (4), The University of Edinburgh (3), etc.

Graph 4. Papers in SCOPUS by documents by affiliation

Graph 5. Papers in SCOPUS by country or territory under Computer Science

Graph 6. Papers in SCOPUS by affiliation under Social Science

Graph 7. Papers in SCOPUS by country under Arts and Humanities

Graph 9. Papers in WoSCC by country or territory

Table1. Groups of screened and analysed papers from WoSCC and SCOPUS

descriptionView Paper arrow_downwardDownload

Some Determinations as to whether or Not Academic Texts Are Produced by Artificial Intelligence

by Cemile Uzun

2024, Interdisciplinary Themes of Sociolinguistic Studies

In recent years, many studies have been carried out on the text generation of artificial intelligence (AI) tools. Some of these studies have analysed the text generation capability of AI tools, and some others have analysed the difference... more

Table 2. It was observed that AI uses terms in a more mechanised way regardless of context. The average number of words in a sentence of the AI tool is between 15 and 20 (Table 2).

Table 4. Some Determinations as to whether or Not Academic Texts Are Produced by Artificial Intelligence DOI: http://dx.doi.org/10.5772/intechopen.1007724

Table 3. The rate of difference in sentence structure in texts written by Human and Al. There were no spelling or punctuation errors in the texts produced by Human. In the texts produced by AI, only the word “anger” was misspelled as “anger”.

Table 5. ENT Ratio of differences in grammar in texts written by Human and Al. en A Cohen’ d value for the difference in paragraph structure between human-generated and Al-generated texts was found to be 0.5. The ratio of effect sizes between the two

Table 7. a adecietianieadl nal The rate of difference in the titles of the texts written by Human and AI. The content of a text written by two groups was analysed in depth and some observations were made. Factors such as the fluency of the style of the texts, the accuracy of the information, the structure of the thought and the support of the information in the texts with various data were analysed (Table 8). Cohen’ d value was found to be 0.4 in the difference between the human-generat

Table 8. The rate of difference regarding the structuring of information in the texts written by Human and Al.

Table 9. The rate - of difference in the originality of texts written by humans and by AI.

Table 10. =———e—es The rate of difference in the interpretation ability of the texts written by human and AI.

Table 12. Some Determinations as to whether or Not Academic Texts Are Produced by Artificial Intelligence DOI: http://dx.doi.org/10.5772/intechopen.1007724

descriptionView Paper arrow_downwardDownload

Revista Ideas Nº1

by Héctor Valencia

2024

Publicacion interna de la Escuela de Lenguas Modernas, de la Facultad de Filosofia, Historia y Letras de la Universidad del Salvador, dedicada integramente al estudio de las lenguas modernas, anual y multilingue. Numero 1 de la Primera... more

descriptionView Paper arrow_downwardDownload

الأخلاقيات الرقمية للموارد البشرية والضوابط الإدارية والتشريعات القانونية لتوظيف الذكاء الاصطناعي في قطاع البحث العلمي

by Dr. Shaimaa Osamaa Mohamed Saleh

2024, Digital Ethics for Human Resources and Administrative Controls and Legal legislation to Employ the Artificial Intelligence in Scientific Research Sector

The digital ethics of human resources in the scientific research sector in light of the artificial intelligence revolution is a vital field that requires careful organization to ensure that its use is directed towards human benefits in order to avoid generating bad purposes or unintended results through the development of administrative and legal controls, which requires the speedy preparation of a code of honor with values and ethical controls for scientific research.
The problem of the research is the lack of an administrative framework for the digital ethics of human resources in scientific research, which led to illegal practices of human resources when they use artificial intelligence, and the administrative controls related to it are unclear as a result of the existence of legal gaps, and the research aims to analyze the impact of digital ethics of human resources and administrative controls And legal legislation on the employment of artificial intelligence in the scientific research sector, and the research has relied on the inductive approach, and its results proved a strong positive relationship between the role of digital ethics and the employment of artificial intelligence in scientific research, as future aspirations are moving towards employing smart technology and the advanced industrial revolution to employ scientific research Achieving the sustainable development goals , its results also resulted in a strong correlation between the role of administrative controls and legal legislation and the employment of artificial intelligence in the scientific research sector, as laws do not respond to the requirements of artificial intelligence and digital technology, despite the potential risks of the growing use of artificial intelligence applications, there are few laws Which is indirectly related toartificial intelligence, and the research reached the development of an analytical framework that contributes to interpreting the role of digital ethics for human resources and the management of administrative controls and legal legislation in employing artificial intelligence in the scientific research sector, and its dimensions were represented in culture, digital leadership and digital training, and the research also recommended the need to move towards developing a global framework for values, principles and procedures necessary for their development.Administrative controls and legislation related to artificial intelligence in accordance with international law.
Keywords:
Digital ethics for human resources - administrative controls - legal legislation - employment of artificial intelligence in the scientific research sector.

descriptionView Paper arrow_downwardDownload

Down the rabbit hole: Machine translation, metaphor, and instructor identity and agency

by Kimberly Vinall

2024

While machine translation (MT) technologies have improved in profile and performance in recent years, there is still much to learn about the broad impact of these technologies on language educators. In this article, we investigate... more

descriptionView Paper arrow_downwardDownload

Использование возможностей искусственного интеллекта при переводе удмуртской литературы на русский язык: первый опыт

by Egor Lebedev

2024

В статье рассматривается первый опыт использования нейронных сетей для перевода художественной литературы с удмуртского языка на русский на примере сборника рассказов Багай Аркаша «Перепеч». Анализируются преимущества и недостатки... more

descriptionView Paper arrow_downwardDownload

Minoan Cryptanalysis: Computational Approaches to Deciphering Linear A and Assessing Its Connections with Language Families from the Mediterranean and the Black Sea Areas (In "Computational Linguistics and Natural Language Processing")

by Francesco Perono Cacciafoco

2024, Computational Linguistics and Natural Language Processing

Nepal, Aaradh, and Francesco Perono Cacciafoco. (2024). Minoan Cryptanalysis: Computational Approaches to Deciphering Linear A and Assessing Its Connections with Language Families from the Mediterranean and the Black Sea Areas. In Revesz,... more

Table 5. Python program results for Hittite. Table 6. Python program results for Proto-Celtic.

Table 7. Python program results for Uralic.

Figure 8. Paraphrasing sections with ChatGPT has a tendency to result in sections shorter than the original. The reduction in section length is most visible for the longer introduction and conclusion sections. For an analysis of lengths of generated fake scientific papers, see Figure 7 in the appendix.

Figure Al. Example 1 of prompting ChatGPT to produce sections of a scientific paper given the paper title.

Table 3. Python program results for Ancient Egyptian. Table 4. Python program results for Luwian.

Figure 6 shows the distribution of Flesch-Kincaid Grade Level [69] and Gunning Fog [70] readability metrics [71] for papers from the different generators and real papers. Flesch-Kincaid measures the technical difficulty of the papers, while Gunning Fog mea- sures the readability of the papers. The comparison confirms our observation that our machine-generated papers are representative of real papers with a slight increase in writing sophistication from SCIgen and GPT-2 to ChatGPT and GPT-3 generators, with Galactica being the median.

Figure 7. The generators exhibit different tendencies for the length of the generated fake scientific papers. (a) shows the length distribution of generated abstracts, (b) shows the same for introductions, and (c) shows conclusion lengths.

Figure 1. The workflow diagram of the proposed approach for sentiment classification. The proposed methodology’s workflow is depicted in Figure 1, illustrating the steps involved. Firstly, unstructured tweets related to ChatGPT are collected from Twitter using the Twitter Tweepy API. These tweets undergo several preprocessing steps to ensure cleanliness and remove noise. Lexicon-based techniques are then utilized to assign labels of oositive, negative, or neutral to the tweets. Feature extraction is performed using the Bag of Words (BoW) technique on the labeled dataset. The data is subsequently split into an 30/20 ratio for training and testing purposes. Following model training, evaluation metrics such as accuracy, precision, recall, and the F1 score are employed to analyze the model’s oerformance. Each component of the proposed methodology for sentiment classification is discussed in greater detail in the subsequent sections.

Figure 2. The architecture for the proposed sentiment classification. Robustly optimized BERT pretraining (RoBERTa) [72] is a transformer-based model used for various NLP tasks. It was developed in 2019. RoBERTa is a modification of the BERT model to overcome the limitations of the BERT model. RoBERTa is trained on 160 billion words, whereas BERT is trained on only 3.3 billion words. RoBERTa is trained on large data sets, is fast to train, and may use large batch sizes. ROBERTa uses a dynamic masking approach, and BERT uses a static approach.

Figure 3. Performance of models using the TextBlob and VADER techniques. The X-axis presents the machine learning models that we utilized in this study, and the Y-axis presents the accuracy score. Figure 3. Performance of models using the TextBlob and VADER techniques. The X-axis presents the Table 5 also shows the results of various models using the VADER technique. Us- ing a VADER lexicon-based technique, SVM performs best with an accuracy of 90.72%. The models SGD and GBM both achieved an 89% accuracy score. The model that performs worse, in this case, is KNN, with a 54.38% accuracy. This model also performs poorly on the TextBlob technique. The only model in machine learning that performs with the highest accuracy is SVM with the linear kernel. The accuracy score of various machine learning models using TextBlob and Vader are compared in Figure 3.

Figure 4. Comparison of LDA-based and BERT-based topic modeling techniques through word clouds: (a) Visualization of tweets using LDA topic modeling, and (b) Visualization of tweets using BERTopic modeling.

Figure 5. Most Prominent Topics extracted from ChatGPT Tweets using BERTopic. Figure 5 depicts the most prominent topics extracted by BERTopic. First, we load the BERT model and associated tokenizers. The tweet data are then preprocessed to extract the embeddings for the BERT model. Then, for dimension reduction or clustering, we used k-means clustering and the principal component analysis (PCA). The BERT model was used to extract the most prominent topics, which were then displayed in a scatter plot.

Figure 6. Words extracted from top ten topics with their frequency using the LDA model.

Figure 7. Visualization of highly discussed positive topics.

Figure 8. Visualization of highly discussed negative topics.

Figure 9. Sentiment ratio in extracted data. In this study, we observed that the majority of sentiment towards chatGPT was positive, indicating a generally favorable perception of the tool. This aligns with the notion that chatGPT has gained significant attention and popularity on various online platforms. The positive sentiment towards chatGPT can be attributed to its advanced language generation capabilities and its ability to engage in human-like conversations. Figure 9 shows the sentiment ratio for chatGPT.

Figure 10. SentimentViz output for chatGPT sentiment. Additionally, we conducted an analysis using an external sentiment analysis tool called SentimentViz [78]. This tool allowed us to visualize people’s perceptions of ChatGPT based on their data. The sentiment analysis results obtained from SentimentViz comple- mented and validated the findings of the proposed approach. Figure 10 presents visual representations of the sentiment expressed by individuals regarding ChatGPT. This visu- alization provides further support for the positive sentiment observed in our study and reinforces the credibility of our results.

Figure 1. Architecture of SPARSAR with main pipeline organized into three levels. processed at first at a syntactic and semantic level and grammatical functions are evaluated Then, the poem is translated into a phonetic form, preserving its visual structure and its subdivision into verses and stanzas. Phonetically translated words are associated with mean duration values taking into account position in the word and stress. At the end of the analysis of the poem, the system can measure the following parameters: mean verse length in terms of msec. and in number of feet. The latter is derived by a verse representation into metrical structure. Another important component of the analysis of rhythm is constituted by the algorithm that measures and evaluates rhyme schemes at the stanza level and then the overall rhyming structure at the poem level. In addition, the system has access to a restricted list of typical pragmatically marked phrases and expressions that are used to convey specific discourse function and speech acts, and need specialized intonational contours.

Figure 2. The eleven most positively marked sonnets: 7, 24, 43, 47, 52, 76, 85, 87, 128, 136, 154. As will appear clearly from the charts below, all the data show a contrasting behaviour which will be attested by correlation values. Where sentiment values increase, the cor- responding values for vowels and consonants decrease. To allow better perusing of the trends we split the sonnets into separate tables according to whether their sentiment values are positive or negative. The first chart contains the eleven sonnets which received the highest positive sentiment values. All the charts are drawn from the tables of data derived from the analysis files in xml format, which will be made available as supplementary data (please see Figure 2).

Figure 3. Chart of the 16 borderline sonnets positively marked for sentiment. In this chart we added the ratio for Abstract/Concrete, which shows a peak for son- net 73. As the chart clearly shows, the line for Sentiment borders 1, as to the remaining variables, Vowels is the one oscillating most after Abstract. Voiced and Consonants are fairly always aligned apart from sonnet 33 and 102. In both sonnets, the number of “Ob- struents” (|b,d,p,t,k,g |) is very low and real consonants are substituted by “Continuants” (|s,sh,th,f,v,h|) both voiced and unvoiced. In the following analysis, for this reason, I will only consider Voicing as the relevant variable for consonants and this will show better agreement in the overall data. Now, we show charts for all negatively marked sonnets using only three variables, starting from Figure 4 below.

Figure 4. Chart of the 42 negatively marked sonnets: 3, 8, 9, 19, 28, 30, 34, 35, 50, 55, 57, 58, 60, 62, 63, 65, 66, 71, 86, 89, 92, 103, 107, 112, 116, 120, 121, 124, 126, 127, 129, 132, 133, 134, 138, 139, 140, 143, 146, 148, 149.

Figure 5. The eleven most positively marked sonnets show the same slightly positive correlatio for Vowels—Voicing but very strong negative correlation between Vowels-Sentiment and slight! negative for Voicing /Sentiment at —0.11423482—colours in this case have no meaning. Figure 5. The eleven most positively marked sonnets show the same slightly positive correlatior Correlation between Vowel and Sentiment is positive but very weak; correlation be- tween the Voicing parameter and Sentiment is again negative and very weak at —0.0065037. Thus, results for the 42 sonnets negatively marked by sentiment show that we have negative correlation between vowels and voicing, and vowels and sentiment, but positive correlation between voicing and sentiment. So, it is just the opposite of what we obtain with positively marked sonnets. And finally, in Figure 5. we show the eleven most positively marked sonnets show the same contrasting results.

Figure 6. The 44 sonnets classified with Sarcasm with the highest level of Judgements—colours in this case have no meaning.

Figure 7. The 50 sonnets classified with Irony, with a lower level of Judgement Negative but higher Affect Negative.

Figure 8. The 60 Sonnets classified by critics as neutral.

Figure 9. Distribution of 89 sonnets manually classified by ATF with no contrast. 23a — RAP Agee PH

Figure 10. Distribution of 65 sonnets classified as Judgements with contrast and their sound data. All correlation measures with Judgements are negative:

Figure 11. Distribution of 65 sonnets classified by ATF as Affect with contrast and their sound data. In Figure 11 below we use again sound data and the second parameter Affect:

Figure 12. Distribution of 65 sonnets classified by ATF as Appraisal with contrast and their sound data.

Comparing the “contrast” criterion with the sentiment-based classification is not pos- sible; however, the “contrast” group of sonnets is included in majority by the “negatively” marked sonnets, with the exception of 16 sonnets which are the following ones: Now the only positive correlations are the ones shown by Affect with Vowels and with Appraisal; the remaining correlations are all negative. The subdivision operated now using our manual classification with ATF seems more consistent than the one made before using the critics’ evaluation. As a first comment, these data confirm our previous evaluation made on the basis of sentiment analysis, i.e., the sonnets are mainly disharmonic due to Shakespeare’s intention to produce ironic effects on the audience. Here below is the list of the 89 sonnets classified by our manual ATF labeling as having no contrast:

Figure 13. Poems considered as deviants evaluated for their degree of sense /sound harmony. WNT OER TAFE MANA EM SETI BE GSS | MEET ences ava | SE REOTOERY ier em I, In addition to the evaluation of positive /negative values, we consider the two parameters we already computed related to Metrical Length and Rhyming Scheme that we add together and use for its 10% added value to compensate for poetic relevant features. On the basis of poetic devices analyzed by SPARSAR, a list of 14 poems is considered as deviant, and they are the following: A Sunrise, The Gunner, The Explorer’s Wife, For My Grandfather, Idyll, Middle Harbour, Politician, To a Poet, The Captain of the Oberon, Palace of Dreams, The Room, Vancouver by Rail, Henry Lawson, and Achilles and the Woman. In Figure 13 we show the first map of sense-sound evaluation where the split of the “deviants” poems appears clearly:

Figure 14. Sixteen poems from different periods of Webb’s poetic production computed for their Sense/Sound Harmony.

Figure 15. Sixteen poems taken mainly from late poetic production computed for their sense /sound harmony. In Figure 16, I will now show a bigger picture containing 50 poems, where we can see again the great majority of them being positioned on the left hand side. The positive side is enriched by “Moonlight” from Early Poems, and “Song of the Brain” from Socrates, and the middle disharmonic list now counts 16 poems.

Figure 16. Fifty poems computed by sense/sound harmony.

Figure 1. Universal Dependency tree for “It gives us the basis for several deductions”. structure of an entire sentence, as visualized in a dependency tree such as the one showr below (Figure 1). The syntactic “path” from the sentence root to each “leaf” token is given by the ‘combination of head id and dependency relationship. The syntactic function of each worc s clearly and specifically defined by these two values. For example, the word basis is the bj of the word gives. obj is the UD label for what is traditionally called the “direct object of a verb (a list of syntax labels along with examples can be found on the UD website ittps: / /universaldependencies.org/en/dep/index.html, accessed 1 January 2024). The vord several is labeled as amod of deductions. amod indicates an adjectival modifier. The worc leductions itself is an nmod of basis. In UD annotation, nmod means “nominal modifier”, < 1oun or noun phrase directly dependent on and specifying another noun (or noun phrase ‘or example, the prepositional phrase in “toys for children”.

Figure 1. This work’s overview. Six methods are used to machine-generate papers, which are then mixed with human-written ones to create our benchmark dataset. Seven models are then tested as baselines to identify the authorship of a given output. Figure 1. This work’s overview. Six methods are used to machine-generate papers, which are then

3.2. Fake Papers Generation AUS This combination of models, ranging from CFG to state-of-the-art LLMs, aims to generate a diverse set of artificially generated scientific papers. Concrete examples of generated papers can be found in Appendix A.

Figure 3. Our co-created test dataset TEST-CC contains 4000 papers with varying shares of real and ChatGPT-paraphrased sections. The co-created component of our dataset mimics papers written by humans and models concurrently, a combination that is likely to appear in practice. That means texts originally written by either a human or an LLM and subsequently extended, paraphrased, or otherwise adjusted by the other. To create such papers at scale, we take a set of 4000 real papers from our TEST dataset (see Table 2) and paraphrase them with ChatGPT [8]. To stay within ChatGPT’s context length limits, we paraphrase each paper section—i.e., abstract, introduction, and conclusion—in a separate prompt. We then construct co-created papers with varying shares of human and machine input by combining original and paraphrased sections as shown in Figure 3.

Figure 4. LLMFE follows a four-step process: (1) Generate features suitable for distinguishing real and fake papers using the LLM based on multiple pairs of one real and one fake paper each. (2) Remove duplicate features through hierarchical clustering on embeddings of the feature descriptions. (3) Score scientific papers along the remaining features using the LLM. (4) Finally, train a Random Forest Classifier to predict the real or fake label based on the feature scores.

Figure A2. Example 2 of prompting ChatGPT to produce sections of a scientific paper given the paper title.

Figure A3. Extract from the hierarchical clustering dendrogram learned during the feature consolida- tion step of LLMFE. The full dendrogram lists all 884 features. The distance threshold was chosen so that 83 clusters were created from the 884 features. Figure A3. Extract from the hierarchical clustering dendrogram learned during the feature consolida-

Figure A5. Explainability insights from our Logistic Regression (LR) and Random Forest (RF)

Figure A12. RoBERTa: Example of SHAP explanation on a ChatGPT generated abstract correctly classified.

Figure A15. RoBERTa: Example of LIME explanation on a GPT-2 generated abstract correctly classified. Figure A14. RoBERTa: Example of LIME explanation on a SClgen generated abstract correctly classified.

Figure A18. Galactica: Example of SHAP explanation on a real paper correctly classified. Appendix C.3. Galactica

Figure A19. Galactica: Example of SHAP explanation on a misclassified real paper.

Figure A20. Galactica: Example of SHAP explanation on a Galactica generated paper correctly classified.

Figure A21. Galactica: Example of SHAP explanation on a misclassified Galactica generated paper.

_ _ de Oo _ YY - ne YY _ For a clear and systematic picture aiming to aid the reader with understanding this Paper, a concept map illustrating a summary of the main topics and their relations discussed in our work is shown in Figure 1. Figure 1. Concept map of the main topics and their relations discussed in the paper.

When we carefully examine the parts of speech distribution across non-fictional texts, we can note that instructional texts had significantly fewer adjectives, adverbs and auxiliary verbs when compared to other non-fictional texts. Also, the concentration of proper nouns in instructional texts is statistically higher than in any other text. Persuasive texts have a statistically significant fewer number of nouns, punctuation and adpositions, but higher values in pronouns and verbs overall. Based on the values of determiners, particle structure, subordinate conjunctions and interjections, we can group non-fictional texts into two groups: discussion—persuasive and explanatory-—instructional. No significant differences were noted in the lexical density across all the subgenres. Therefore, it can be noted that open class and closed class words are equally important in the classification of texts into fictional and non-fictional genres.

Figure 2. Variable importance plot of the RF genre model. NOTE: The x-axis shows the permutation relevance (mean decrease in accuracy) of each feature; the y-axis lists the features of the genre model.

Figure 3. Variable importance plot of the RF sub-genre model. NOTE: The x-axis shows the per- mutation relevance (mean decrease in accuracy) of each feature; the y-axis lists the features of the subgenre model.

Figure 4. SFS optimal features of each feature set.

precise meaning of a polysemous word. Notably, the 1980s witnessed significant progress in WSD research, facilitated by the availability of extensive lexical resources and corpora. Ultimately, WSD entails the task of identifying the accurate sense of a word within its specific contextual framework [3]. WSD is not considered a final objective; instead, it is recognized as an intermediary task with relevance to various applications within the field of NLP. Figure 1 presents the WSD conceptual diagram. In machine translation, WSD is an important step because a number of words in every language have a different translation according to the context of their usage [3-6]. It is an important issue to be considered during language translation. WSD assumes a crucial role in ensuring precise text analysis across a wide range of applications [7,8]. For example, an intelligence-gathering system could distinguish between references to illicit drugs and medicinal drugs through the application of WSD. Research works such as named entity recognition and bioinformatics research can also use WSD. In the realm of information retrieval (IR), the primary concern lies in determining the accurate sense of a polysemous word within a given query before initiating the search for its corresponding answer [9,10]. Enhancing the efficiency and effectiveness of an IR system entails the resolution of ambiguity within a query. Similarly, in sentiment analysis, the elimination of ambiguity is crucial for determining the correct sentiment tags (e.g., negative or positive) associated with a sentence [11,12]. In question-answering (QA) systems, WSD assumes a significant role in identifying the appropriate types of answers that correspond to a given question type [13,14]. Furthermore, WSD is necessary to accurately assign the appropriate part of speech tagging (POS) to a word, as its POS can vary depending on the contextual usage [15,16]. ei gag oes 1 ie eww # ae egy aw a ae 1. =e a = 1

Various approaches and methods used for WSD are classified into two categories, including knowledge-based approaches and ML (Machine Learning) based approaches. In knowledge-driven approaches, external lexical resources such as Wordnet, dictionary, and thesauri are required to perform WSD, and in ML-based techniques, classifiers are trained to carry out the WSD task on sense-annotated corpora. Figure 2 presents the different WSD approaches, and the explanation for each category can be explained further.

Figure 3. Decision Tree Example. A decision tree is a classification method that repeatedly divides the training dataset ind organizes the classification rules in a tree-like structure [26,27]. Every interior node »f the decision tree represents a test performed on an attribute value, and the branches epresent the outcomes of the test. The word sense is determined when a leaf node is eached. An illustration of a decision tree for WSD is depicted in Figure 3. In this example, he sense of the polysemous word “bank” that is active is a noun within the sentence, “I vill be at the bank of the Narmada River in the afternoon.” The tree has been constructed ind traversed to ultimately select the sense “bank/RIVER.” A null value in a leaf node ndicates that there is no sense selection present for that particular attribute value.

Figure 4. Illustrating SVM Classification. An SVM [32] serves the purpose of both classification and regression tasks. This approach is rooted in the concept of identifying a hyperplane that can effectively isolate positive examples from negative ones with the highest possible margin. The edge/margin represents the interspace between the hyperplane and the nearest examples for positive and negative, which are referred to as support vectors. In Figure 4, circle and square represent two different classes, the bold line represents the hyperplane that isolates the two classes while the dashed lines indicate the support vectors closest to positive and negative example. These support vectors play an important role in constructing an SVM classifier. The vectors have an impact on the position and the orientation of the hyperplane, and by removing or adding support vectors, adjustments can be made to the position of the hyperplane. In Figure 4,

Figure 5. Ensemble Methods: Combining the Strengths of Multiple Models. In order to enhance the accuracy of disambiguation, it is common to employ a combi- nation of different classifiers. This combination strategy is called ensemble methods, which combine algorithms of different nature or with different characteristics [37]. Ensemble methods are more powerful than single-supervised techniques as they can overcome the weakness of a single approach. Strategies such as majority voting, the AdaBoost system of Freund and Schapire [38], rank-based combination, and probability mixture can be utilized to combine the different classifiers to improve accuracy. Figure 5 presents the simple approach of the ensemble WSD approach. 2.2.2. Unsupervised Techniques Unsupervised techniques do not make use of sense annotated datasets or external knowledge sources. Instead, they operate under the assumption that senses with similar meanings occur in similar contexts. These techniques aim to determine senses from the text by clustering the word occurrences based on some measure of contextual similarity. This task is known as word sense induction or discrimination. Unsupervised techniques offer significant potential in overcoming the bottleneck of knowledge acquisition, as they do not require manual efforts. Here are some approaches that are used for unsupervised WSD.

Figure 6. Flowchart of WSD Execution Process.

Figure 1. A structure generated by the S3 system in the third iteration. The F, X, and Y symbols are mapped to the draw forward action, symbols [ and | traditionally represent the save and return to the position actions, and the characters + and — command the cursor to turn by an angle of +27.5°.

Figure 2. General Outline of the Grammar Inference Algorithm. 2.6. Image Parsing The process of parsing the image into an input sequence is divided into three steps. First, all of the straight lines are detected in the image. Then, all of these lines are connected, building a model of the structure in the image so in the last step we can generate a sequence that accurately describes this model.

Figure 3. A few line points cast into virtual space with granularity = 2.

Figure 4. General outline of a single iteration of the second phase of the fitness function. nee 2 ae ed vhich gives us a set of vectors [xo Xp wee Xn & Al , where x, and x, is the occurrence ‘ount of terminal symbol in the nth rule and the axiom, respectively. Because the number of ‘ombinations can be large, we can take the simplest ten for the best results. Now we know 1ow many symbols to insert but not where. To avoid exploring all of the possibilities, since he rules’ successors must appear in the target sequence, we can reduce the search space by only using the appropriate subsequences in the target sequence, which is done in step 3. ‘rom the found subsequences, in step 4, the algorithm generates a population for the GA. since the axiom does not appear in the sequence, we only know the number of symbols to »e added but not their positions; therefore, the symbols are randomly inserted. In the last step, a GA finds a system that can recreate the target sequence using the generated initial opulation.

Figure 5. Runtime distribution for single rule system. Figure 6. Runtime distribution for two rule system.

To test the replicability of our results, runtime distribution for the GA has been tested on systems with one and two rewriting rules. Test examples were taken from [14]. As seen in Figure 5, for a single rule system, most algorithm executions ran for a similar amount of time, around 100 ms, with very few stragglers that ran for more than 600 ms. For a system with two rules (Figure 6), we can notice similar behavior. However, here, we can see that a significant amount of runs finished quickly, meaning the initial population already contained a candidate with very high fitness. This lets us conclude that the algorithm has a low tendency to get stuck in areas of search space containing candidates with low fitness.

Figure 8. Fitness progression of our algorithm. Figure 9. Fitness progression of the algorithm from [12].

Figure 7. Hits heatmap of our algorithm.

Figure 10. Hits histograms of the algorithm from [12].

_ Below we show an enhanced image of the first sign from the right in the third row, our drawing of it, and the Old Hungarian n sign written with a mirror symmetry and an 4 sign:

Here we need to be careful to ignore the engravings that depict part of the back and the belly of a deer. The lines to be ignored are shown in black in our drawing. The seventh sign from the right in the third row is an Old Hungarian ft sign:

In the enhanced photo, the Old Hungarian 1 is clearly visible. In addition, there are two parallel lines that belong to the head of one of the engraved deers. These lines do not belong to the Old Hungarian inscription and should be ignored. Unfortunately,

ae Se eee In the fourth line, there are additional missing details in SartkoZauly’s drawing. The first sign from the left is missing its top half, the diamond sign misses on side, and in the second word, which is written with smaller signs, the third sign from the right misses a small horizontal crossing line segment. These can also be verified by a careful observation of the original photo of the Altai inscription. In addition, the following ligature was also overlooked: 3.2. Transliteration and Translation of the Altai Inscription

Table 1. Cont. There are certain peculiarities in the Latin transliteration that we made in order to ob- tain meaningful words. In particular, we believe that the scribe was not using the standard Old Hungarian signs but mixed up some of the similar looking signs. In particular, the scribe mixed up the Old Hungarian letters for r and z, which are the following, respectively:

Figure 2. An alternative drawing of inscription. Here the red lines are those that seem extra to the letters that are apparently needed for a meaningful reading of the inscription.

3. Data Sources and Data Curation KarZaubaj SartkoZauly’s drawing had some minor inaccuracies. He included a photo- graph in his work. A new drawing based on that photo is shown in Figure 1. The drawing shows that some parts of the inscription are unclear because of the drawings of the deer and some cracks in the rock.

Figure 1. The author’s redrawing of the inscription based on the photograph in SartkoZauly [4].

Figure 2. An enhanced drawing of the inscription with red highlighting of those elements that undisputedly belong to the inscription. The six sign groups are also labeled (a-f).

Figure 3. Two interpretations of sign group (d) in the middle of the photograph. The first interpretation of sign group (d) leads to the following sign sequence:

Figure 4. A feature analysis of the Old Hungarian Runic signs: 1 indicates that the sign in the row contains the feature in the column; —1 indicates that it does not contain the feature. This analysis uses the Altai Mountain version of the Z sign.

Among the above, the G-L pair has a similarity of 12, the R-Z and the Z-CS pairs have similarities 11 and 13, respectively, and the D-I pair has a similarity of 12. Hence, these frequently mixed up pairs also have high similarity scores according to the similarity matrix in Figure 5. Hence, the strong agreement between the mathematical model and the teacher’s experience shows that the G-L and R-Z pair mix-ups in the Old Hungarian Runic inscription in Figure 1 were likely due to an accident. 0 _ At my request, Klara Friedrich, a prominent researcher and teacher of the Old Hun- garian Runic script, verified that, in her decades of experience, it is common to mix up the following letters:

Figure 6. The Dulo clan’s tamga (a), Kayi tamga (b), and Peter Kun’s tamga (c). Picture credits: Wikipedia https: / /en.wikipedia.org /wiki/Dulo (accessed on 16 May 2022) and https: / /en.wikipedia org/wiki/Tamga (accessed on 16 May 2022).

Figure 7. Dr. Peter Kun’s email that verifies that he wrote the inscription in June 2000. This original email contains some minor misspellings. For example, the names of ethnic groups are written in lowercase letters, which is the common way of writing ethnic names in Hungarian. Figure 7. Dr. Peter Kun’s email that verifies that he wrote the inscription in June 2000. This original

Figure 8. Confusability of Peter Kun’s tamga (left) and Old Hungarian signs (right).

Figure 9. A valid decipherment needs to get three things correct: signs, syntax, and semantics. The above Venn diagram places four proposals for sign group (d) on the basis of correctness according to these three criteria.

Table 1. Summary of related work. As a result, this paper proposes a transformer-based BERT model that leverages self-attention mechanisms, which have demonstrated remarkable efficacy in the context of machine learning and deep learning. The proposed model addresses the problems mentioned in the literature review. They have the ability to comprehend the correlation between consecutive items that are widely separated. The transformers achieved an exceptional performance. Additionally, the performance of the proposed method was

Table 2. Dataset statistics after splitting. The most important step in natural language processing (NLP) is the pre-processing ge. It enables us to remove any unnecessary information from our data so that we can oceed to the following processing stage. The Natural Language Toolkit (NLTK), which vides modules, is an open-source Python toolkit that can be used to perform operations ch as tokenization, stemming, classification, etc. The first step in preprocessing is to nvert all textual data to lowercase. Conversion is an essential step in sentiment classifica- n, as the machine considers “ChatGPT” and “chatgpt” as individual words. The dataset ntains text in upper, lower, and sentence case, which the model takes separately, which ects the classification performance as well and makes the data more complex if we do not nvert it all into lowercase. The second step is to remove numbers from the text because 2y do not provide meaningful information and are useless in the decision-making process. e removal of numerical data enhances the quality of the data [44]. The third step is to nove punctuation such as [?,@,#,/,&,%] to increase the quality of the dataset and the rformance of the models. The fourth step is to remove HTML and URL tags that also ovide no important information. The URLs in the text data are meaningless because 2y expand the dataset and require extra computation. It has no impact on the machine ning performance. The fifth step is to remove stopwords like ‘an’, ‘the’, ‘are’, ‘was’, s’, ‘they’, etc., from the tweets during preprocessing. The model’s accuracy improves, d the training process is faster, with only relevant information [44]. Additionally, the re- oval of stopwords allows for a more thorough analysis, which is advantageous for a 1ited dataset [45]. The last step is to perform stemming and lemmatization. The effective- ss of machine learning is slightly influenced by the stemming and lemmatization steps. ter performing all important preprocessing steps, the sample tweets are presented in ble 3.

Table 4. Hyperparameters and their tuned values for experiments.

Table 5. Results of machine learning models using VADER and TextBlob techniques.

Table 6. Results of deep learning models using the TextBlob technique.

Table 7. Results of deep learning models using the VADER technique.

Table 8. Performance of transformer-based models using the TextBlob technique. Table 9. Performance of transformer-based models using the VADER technique.

Table 10 shows the correct and wrong predictions by deep learning and BERT models ising the TextBlob. Results are given only for the TextBlob technique, as the models perform vell using the TextBlob technique. Out of 4000 predictions, the RNN made 3614 correct oredictions and 386 wrong predictions. The LSTM made 3718 correct predictions while 282 predictions are wrong. The BiLSTM has 3725 correct and 275 wrong predictions. The GRU shows 3693 correct predictions, compared to 307 wrong ones. Out of 4160 pre- dictions, the XLNet made 3576 correct and 584 wrong predictions. On the other hand, he RoBERTa made 3897 correct and 263 wrong predictions. The BERT made 4015 correct oredictions whereas 146 predictions are wrong. The results demonstrate that the BERT nodel performed better than the machine learning and deep learning models. Only with 2835 correct and 1165 wrong predictions, the only CNN model performed poorly.

Table 10. Correct and wrong predictions by various models using the TextBlob technique.

Table 10. Cont. 4.4. Results of K-Fold Cross-Validation

Table 11. K-fold cross-Validation results using TextBlob and VADER approaches.

Table 12. Statistical test comparison with the proposed model.

Table 13. Comparison of proposed approach with state-of-the-art existing studies. 4.8. Validation of Proposed Approach on Additional Dataset

Table 14. Experimental results on the SemEvel2013 dataset. 4.9. Statistical Significance Test

Table 15. Statistical significance t-test. The t-test can be interpreted as if the output p-value is greater than the alpha value (0.05), it indicates that the H, is accepted and there is no statistical significance. Moreover, if the p-value is less than the alpha value, it indicates that Ho is rejected and H, is accepted which means that there is statistical significance between the compared results. We perform a t-test on results using Textblob and compare all models’ performances. In all scenarios, the proposed approach rejects the Hy and accepted the Hz, which means that the proposed approach is statistically significant in comparison with other approaches.

Table 1. Distribution of sounds of end-of-line rhyming words divided into four phonological classes.

Table 2. Subdivision of the sonnets by number of classes. There is one sonnet with only one class and it is sonnet 146; then, there are 13 sonnets with 2 classes of sounds: 8, 9, 64, 71, 79, 81, 87, 90, 92, 96, 124, and 149. These sonnets contain rhyming pairs with low and middle sounds, except for three sonnets: sonnet 71 which contains high-back and middle sounds; sonnet 9 which contains high-front and low sounds; and sonnet 96 containing high-front and middle sounds. The themes developed in these sonnets fit perfectly into the rhyming sound class chosen. Let us consider sonnet VIII which is all devoted to music and string instruments which require more than one string to produce their sound, thus suggesting the need to find a companion and get married. Consider the line “the true concord of well tunéd sounds,” where hints to the need that sounds should be “well” tuned. Sonnet 81 celebrates the poet and his verse which shall survive when death will come. Sonnet 92 is in fact pessimistic in the possibility that love will last “for the term of life” and no betrayal will ensue. As to sonnet 146, it is a mixture of two seemingly different themes: a criticism of extravagant display or rich clothing of wealth by writers of the time, or perhaps his mistress and trying to convince her to change her ways for eternal salvation. Some critics regard this as the most profoundly religious or meditative sonnet. But, the feeling of the lover renouncing something brings back his mistress and the feeling of being powerless against her chastity, so that religious life becomes a desirable aim. In this sense, death can also be depicted as desirable. It is important to notice the overall strategy of choice of sound in relation to meaning,

Table 3. Total count for vowel, final consonants and sonorant sounds organized into classes for all Shakespearean sonnets. Eventually, we come up with 61 more frequent heads with occurrences up to four and a total of 778 repeated vowel and consonant line-ending sounds. We now consider the remaining 288 rhyming pairs organized into “head” and “dependent”, i.e., the preceding end of the line’s rhyming word and the one in the corresponding alternate /adjacent end of line.

Table 4. (a) Distribution of stressed rhyming vowels in five phases. (b) Weighted values of the distribution of stressed rhyming vowels in five Phases.

Table 5. (a) Distribution of stressed diphthongs in the sonnets divided in 5 phases. (b) Weighted valued of the distribution of stressed diphthongs in the sonnets in 5 phases. Both Phases 1 and 4 show a decrease of middle vs. low diphthongs, while the remain- ing three phases behave in the opposite manner: more middle than low diphthongs. The total distribution indicates Phase 3 as the highest number of diphthongs and Phase 4 as the lowest, just the opposite of the previous distribution. General totals show a distribution of middle vs. low diphthongs which is strongly in favour of middle ones. This is just the opposite of what we found in previous counts, and in part then compensates with the lack of high diphthongs.

Table 6. Sound image of the sonnets. 3.2. Rhyming and Rhythm: The Sonnets and Poetic Devices 3.2.1. Contractions vs. Rhyme Schemes

Table 7. Number of rhyme violations x five phases. We call these (pseudo) rhyming violations because current reciters available on Youtube do not dare use the old pronunciation required and produce a rhyming vio- lation by using Modern English pronunciation. One of these reciters is the famous actor John Gilgoud, who when reading Sonnet 66, correctly pronounces DESERT with its original meaning, but then in Sonnet 116 produces three violations when rhyming pairs required transformations that were clearly mandatory in Early Modern English, and they are | love | to be pronounced with the vowel of |remove! in lines 2/4, |come! to be pronounced with the vowel of |doom| in lines 10/12, and | loved! to be pronounced with the vowel of | proved | in the couplet. How do we know that these words should be pronounced in that manner and not in the opposite way—say | remove]! as |love!, |doom! as |come! and | proved! as | loved |, as is being asserted by Ben Crystal son of David? There are three criteria that determine the way in which words should rhyme: the first one is the rhyming constraints which were so stringent at the time owing to the fact that poetry was only recited and not read on books. Okay, then, there are rhyming constraints but how do they work, in which direction? The direction is determined by two factors: the first one is determined by universal phonological principles, as for instance the one the governs phonological variations of vowel sounds—in the vowel shift of verbs or nouns due to morphological changes—which systematically changed “low” and “mid” features into “high” features and not vice versa [32]. The other factor is simply lexical: i.e., not all words will be subject to a transformation in that period. As a result, some words had double pronunciation. This was extensively documented in books and articles published at the time and written by famous poets like Ben Jonson and a great number of grammarians of the XVI and XVII century. All this information is made available by the famous historical phonologist Wilhelm Vietor of the XIX century in a book published at first in 1889 (2 (we use 1909 Vol 2. edition that can be freely visualized at: https:/ /books.google.it/books?id=rh™EQAWAAQBAJ&printsec= frontcover&hl=it&source=gbs_ge_summary_ré&cad=0#v=onepageé&q&f=false accessed on 6 July 2023), by the title “A Shakespeare Phonology” which we have adopted as our refer- ence. Variants are then lexically determined. Some words involved in the transformation are listed below using ARPAbet as the phonetic alphabet in the excerpt taken from the lexicon. As can be easily noticed, variants are related also to stress position, but also to consonant sounds. T exvicon 1

Table 8. Rhyme repetition rates in three Elizabethan poets. Table 9. Rhyme repetition word class-frequency distribution for Shakespeare’s sonnets.

Now, let us consider the distribution of rhyming words into the corpus of the sonnets. As to general frequency data, the Sonnets contain a number of tokens equal to 18,283 with 3085 types, so-called Vocabulary Richness that is used to measure the ability of a writer to use different words in a corpus, corresponds to 16.87%, a high value for that time when compared with other poets. Also, the number of Hapax and Rare Words (indicating the union of Hapax, Dis and TrisLegomena) corresponds to average values for other poets, respectively to 56%, the first type, and 79%, the second one. If we look at similar data for

Table 10. Quantitative data for six appraisal classes for sonnets with highest contrast. We report for each word frequency type in column 1—there is only one head word thee) with frequency 28—the corresponding number of tokens in Table 9, followed by he sum of tokens, the incremental sum and the corresponding percentage with respect o total corpus. As can be noticed from the last column, where incremental percent of hyme-pair words corpus coverage is reported, the total of rare words, i.e., type rhyme-pair vith frequency of occurrence lower than 4, is 62.59%, a fairly low value if compared to he measure evaluated on simple type/token ratios. If we look at most important English oets, as documented in a previous paper , we can see that the average value for Rare Nords is 77.88%. However, we are here dealing with rhyming words and the comparison nay not be so relevant.

Table 11. Quantitative data for six appraisal classes for sonnets with lowest contrast.

Table 12. Quantitative data for six appraisal classes for sonnets with no contrast. a a I I I a The experiment with ATF classes matching critics’ evaluation has been fairly successful, but how do these classes gauge with the Sound—-Sense harmony? In order to check this, we transferred the data related to vowels and consonants and matched them with ratios of the three main ATF categories: Appreciation Positive/Negative, Affect Positive/Negative, and Judgement Positive/Negative. As in previous computation, all data below 1 will be interpreted as a case of superior Negative Polarity and the opposite when data are above 1. To allow a better view of the overall data, we split them into sonnets with contrast to the first group that we show in Figure 9, and sonnets with no contrast to the second group, that we show in Figure 10. This time, however, we used our classification and abandoned the critics’ one.

Table 1. “Shallow” annotation output by UDPipe. Sentence: “It gives us the basis for several deductions” (Doyle, The Hound of the Baskervilles, 1901). Table 2. “Deep” annotation output by UDPipe. Sentence: “It gives us the basis for several deductions (Doyle, The Hound of the Baskervilles, 1901).

Table 3. “Shallow” annotation by UDPipe. Sentence: “There, however, stood only a single bowl” (Spyri, Heidi, 1880).

Table 4. “Deep” annotation by UDPipe. Sentence: “There, however, stood only a single bowl” (Spyri, Heidi, 1880). For each token, the analysis gives the form as it appears in the text and its lemma. This information is not used in the method described here since our goal is to examine the discriminative power of morphosyntactic features. In addition, as noted above, general vo- cabulary may be largely dependent on genre or subject matter and may confound analysis. It is worth noting that the elimination of word forms and lemmas from consideration simpli- fies preprocessing and, to some degree, compensates for the time required to extract input features from the parsed text. Minimal clean-up of the .txt file is required; chapter titles and the like can be left in the document without affecting the results of the classification.

Table 5. Number of input features by number of type-value components in each feature have selected among them based on frequency alone. For each combination length, only those type—value pairs which occur in approximately 5% of the tokens in the corpus have been included as input variables for classification. The process of populating feature type: with their values is computationally slow for combinations of more than two elements so we have used a smaller sample corpus for each language. Thus, the 5% cut-off i: an approximation. A separate set of variables has been identified in this way for eact language. Because UDPipe produces different types of morphological annotation, anc because syntactic annotation, although it largely consists of the same relationship labels has different frequency distributions in various languages, the same selection procedure with the same 5% cut-off results in a different quantity of features for each language. Detail: are given in Table 5.

Table 6. Results of classification by individual novel (45 classes). Clearly, the works in each corpus are sharply distinguishable at the morphosyntactic level. Unfortunately, there is little published research to which these results may usefully be compared. Generally, recent stylometric research has a quasi-forensic tendency, focused on the ability to “prove” authorship of particular texts. In such cases, there is no reason to examine the discriminability of the individual works of an author. In contrast, our interest is in the descriptive value of stylometric measures as applied to works as well as authors Our assumption in this study is that input features that both discriminate texts clearly and are understandable in terms of traditional stylistics may serve as the basis of valuable stylometric descriptions. Our results indicate that discriminability is high even with the relatively small 500-word samples; this success can be taken as an indication that a good deal of stylistic information is in fact conveyed by the features that we have proposed We will examine some of the most important of these distinguishing features in the next section. It is worth mentioning here that the same procedure (albeit with different input features for each corpus) works quite well for each language tested. In fact, it is apparent from the 500-word samples that the morphosyntactic signal is somewhat stronger the more morphologically complex the language is. This complexity is reflected in the number of features as reported in Table 5: a sharper distinction seems to exist between works in Polish which has 1137 total input features, compared to between works in English (653 total features).

Table 7. Results of leave-one-out classification by author (15 classes). The sharp decrease in classification accuracy is striking. Presumably, an explanation is to be found in the greatly increased difficulty of the problem. The results of the most closely comparable previous studies point to the same conclusion. Maciej Eder has published three important studies on authorship attribution [19,26,27] in which the corpora are similar to our own. The accuracy of Eder’s experiments is consistent with our results. For example, Eder (2010) classifies samples of various sizes drawn from 63 English novels; for samples of around 1000 words, accuracy falls between 40% and 50%. A more precise comparison is unfortunately not possible. All three of Eder’s works present their results in graphs rather than tables. Thus, only rough estimates for the accuracy of a given sample size are possible. Most of Eder’s data are based on the most frequent words. For a corpus of 66 German novels, samples ranging from 500 to 2000 words seem to yield accuracy scores from 30% to 60%. Evidently, the low accuracy of our authorship attribution tests (as compared to novel- by-novel classification) is not anomalous. Furthermore, it does not seem likely that the combination of input features and classifier that was quite good at identifying individual novels would become uninformative about the authorship of those same works. The field

Table 8. Selection of input features “preferred” in Oliver Twist. A few examples will help to illustrate the phenomena underlying these values. The first feature is grammatically transparent. This sentence from Oliver Twist has two examples: “They talked of hope and comfort”. The two bold-faced nouns are annotated with feature #1; obviously singular, they are preceded by their dependency parent, talked. Although this dependency—a noun upon a verb—is the most frequent structure annotated with feature

Table 9. Input features “avoided” in Oliver Twist. Feature #1B represents dependencies of the infinitive form of the verb. The English infinitive is morphologically the same as the dictionary lemma. It primarily occurs in one of two configurations. An infinitive can be “introduced” by the particle to, as in the following examples: “I have come out myself to take him there”; and “... the parish would like him to learn a right pleasant trade ...”. It is apparent that to plus the infinitive has a wide range of syntactic functions. In the first example, to take expresses the purpose for which the action of the main verb was undertaken. The infinitive phrase can be deleted from the sentence without making it ungrammatical. In contrast, the syntax (and semantics) of like in In addition to input features that are strongly “preferred” in Oliver Twist, there are others that are sharply “avoided”. We will look only at three of the most important, as given in Table 9.

Table 10. Summary of standard deviations of input features for selected authors.

Table 11. Selected input features preferred or avoided by the class “Dickens”.

Table 12. Selected input features where frequency variability weakens the “Dickens” signal. BH = Bleak House, GE = Great Expectations and OT = Oliver Twist.

Table 1. Data sources included in our dataset and their respective sizes. 3.1. Real Papers Collection

Table 2. Overview of the datasets used to train and evaluate the classifiers. Each column represents the number of papers used per source. Concerning real papers, unless indicated, we use samples extracted with parsing 1 (see Section 3.1).

Table 3. Experiment results reported with accuracy metric. Out-of-domain experiments, i.e., evalu- ation on unseen generators, are highlighted in blue. Highest values per test set are highlighted in bold. (*) ChatGPT-IO and LLMFE accuracies have been evaluated on randomly sampled subsets of 100 scientific papers per test set due to API limits.

Table A1. Hyperparameters used to generate each paper section in the Galactica model. Each row corresponds to a decoding of a section based on the previous input sections. Here we used parameters of the MODEL.GENERATE function provided by Huggingface [74].

Table A2. Experiment results for the different bag-of-words classifiers reported with accuracy metric. Out-of-domain experiments are highlighted in blue. The highest values per test set are highlighted in bold. Appendix B.2. GPT-3

Table A3. Experiment results for different ChatGPT prompting styles reported with accuracy metric. Out-of-domain experiments are highlighted in blue. Highest values per test set are highlighted in bold. (*) ChatGPT accuracies have been evaluated on randomly sampled subsets of 100 scientific papers per test set and prompting style due to API limits. Table A3. Experiment results for different ChatGPT prompting styles reported with accuracy metric.

Table 1. Overview of the linguistic markers for depression extracted in the selected papers. Source: Own work. Table 2. Overview of the linguistic markers for dementia extracted in the selected papers. Source: Own work.

Table 3. Overview of the linguistic markers for hallucinations from people extracted in the selected papers. Source: Own work. Table 4. Overview of the linguistic markers for artificial hallucinations extracted in the selected papers. Source: Own work.

Table 1. Features extracted from Profiling—UD. There have been increasingly large collections of data compiled across the internet. With advancements in technologies, these datasets are annotated and automatically anal- ysed for multiple purposes [22]. However, linguistic profiling of texts is usually carried out for multiple different projects with a variety of end goals in mind. Language verification, author identification and verification, and text classification are a few to highlight here. Our focus is to identify specific linguistic features of a given text that influence the text classification into genres and specific subgenres. A brief review of the studies which have focused on linguistic profiling of fictional and non-fictional texts points to the study by [11], where they tried to estimate the readability of Italian fictional prose based on the linguistic profiling of the texts. Even though their study shows promising results, from a fictional prose point of view the dataset considered in the study is devoid of the fictional texts or does not cover most of the subgenres of the fictional type. Therefore, it is very important to conduct studies that consider multiple fictional subgenres that are popularly noted in the literature and compare their linguistic composition with the non-fictional text type. In the study by [11], the four major categories considered were literature further divided into children and adult literature, journalism (newspaper), educational writing (educational materials for primary school and high school) and scientific prose. When we look at the datasets which are utilized across literature for the task of classification or readability or

Table 2. Summary of the dataset of the study. Hence, we built a dataset which consists of both fictional and non-fictional texts with a special focus on carrying out a detailed linguistic analysis. Table 2 highlights the number of text samples (shown in brackets) considered in each subgenre grouped across fictional and non-fictional genres. The selected texts were divided into chapters, and it was made sure that the overall size of each of the texts would be around 100-2000 words. Preprocessing of the selected text was carried out to remove licensing information, unnecessary spaces and punctuation.

Table 3. Summary of the raw textual features across genres. Table 4. Summary of the lexical variety features across genres.

Similarly, we looked at the parts of speech distribution in the various subgenres. Table 3 highlights the individual values of the distribution of parts of speech across various subgenres. When the values are compared across fiction and non-fictional texts, it can be noted that fictional texts have a lower number of adjectives but a higher number of adverbs, adpositions, pronouns and punctuation when compared to non-fictional texts. Whereas non-fictional texts have two times higher values of auxiliary verbs and nouns with slightly elevated values in numbers compared to fictional texts. No significant differences were noted in the values of coordinating and subordinating conjunctions, determiners, interjections, symbols and pronouns across fictional and non-fictional texts. Overall, the lexical density of fictional and non-fictional texts remained the same. Table 5 highlights the parts of speech distribution across all subgenres.

Table 6. Feature details after dimensionality reduction. 4.2. Constructing RF Models

Table 7. Subgenre selection of top features. Table 8. Genre selection of top features.

Table 9. Accuracy of the model with feature selection.

Table 1. Comparative Analysis of Knowledge-Based, Supervised, Unsupervised, and Semi- Supervised Techniques.

3. WSD Execution Process WSD is the task of determining an ambiguous word’s suitable sense based on context. WSD has seen a variety of methods. The majority of methods are based on different statistical methods. A few methods use corpora that have been sense-tagged, while others use unsupervised learning. The flowchart in Figure 6 shows the steps that are performed for WSD.

Unsupervised techniques are cost-effective, and they use unlabeled data. Thus, they can be used for languages that lack sense-tagged datasets. However, they may struggle with sense overlapping and lack deep semantic interpretation, leading to less precise disam- biguation compared to supervised methods. Data sparsity can also limit their effectiveness, requiring substantial data for satisfactory performance. Evaluating their performance can be challenging without a definitive gold standard for comparison. Combining unsu- pervised techniques with supervised or knowledge-based approaches can address their limitations and enhance overall WSD performance. me 2 oe ee eT OT TE ED eh ic eee el 2 Bw he ee kt beet See Ue. biguation compared to supervised methods. Data sparsity can also limit their effectiveness,

Table 3. Data Sources available for Hindi WSD.

1 Standard deviation. Table 1. Results of comparison with the algorithm from [14]. 3.2. More Complex Systems

Table 1. Elementary features with their corresponding weights. In Revesz [8], the weight of all features is 1. However, in this study, a different set of weights for each feature is used. The weight of each feature is the inverse of its frequency of occurrence across all symbols in Linear A. In other words, a feature that exists in most symbols will have a lower weight compared to a feature that only exists in some. This means that sharing a rarely occurring feature is given more importance than sharing a commonly occurring one.

Table 2. Feature-based similarity scores for a sub-set of symbol pairs.

6. Discussion anguages, suggesting that the result is coincidental rather than indicative of concrete links. The limited number of matches could be due to the phonetic values used for the comparison. The feature-based similarity measure, with the parameters utilized in this paper, was only successful in producing 43 matches for comparing Linear A with other anguages. In contrast, since Linear A and B potentially share 92 similar signs, naturally the phonetic grid based on Linear B includes more signs. There are several reasons for the derived phonetic grid being small. Firstly, it could simply indicate a lack of concrete inks between the scripts. Secondly, while the feature-based similarity measure allows for an analysis of different writing systems, it is not without its limitations. The method depends highly on the elementary feature set, and since we only had a few features, it is plausible to assume that certain important features may have been missed during the analysis. Additionally, a small feature set also increases the probability of finding multiple matches for any symbol with the same similarity scores, and breaking the tie becomes a challenging decision. In Revesz [8], for instance, the tie is broken by choosing the symbol that is earlier in the standard ordering of symbols.

Table 1. The Altai inscription and its row-by-row transliterations into Old Hungarian and Latin.

Table 1. The Old Hungarian Runic script with its Hungarian transliteration.

Table 2. The Altai Mountain inscription with incorrect signs highlighted in brown. It is apparent to Hungarian language speakers that some words do not make sense, although they are close to common Hungarian words. For example, in sign group (f), the intended name PETER can be easily recognized instead of the nonsense string PETEZ. This suggests that the scribe made a spelling mistake. In particular, the scribe wrote the Old Hungarian Z sign instead of the Old Hungarian R sign. Since Old Hungarian inscriptions are written from right to left, we first convert the sign groups into a left-to-right order as shown in Table 2. Next, we also attempted a transliteration to find the meaning of the words.

Table 3. The Altai Mountain inscription after replacing incorrect signs with intended ones. The mix-up of the above pairs of Old Hungarian signs is a natural consequence of their similar look. Nevertheless, it is possible to ask why exactly these signs are mixed up in the inscription. To answer that question, we can apply a mathematically based approach to sign similarities. This approach was developed in an earlier paper that compared the Minoan Linear A, the Carian, and the Old Hungarian script [7]. The approach starts by identifying which sign has which of the following thirteen features:

descriptionView Paper arrow_downwardDownload

Large Language Models as Computational Linguistics Tools: A Comparative Analysis of ChatGPT and Google Machine Translations

by Mohammad Awad AlAfnan

2024, Journal of Artificial Intelligence and Technology

This study investigates the effectiveness of Large Language Models (LLMs), specifically ChatGPT, in machine translation and compares them with traditional tools like Google Translate. The research focused on translating speeches by King Abdullah II of Jordan, delivered in Arabic and English at significant international events in 2023. The study evaluated the translations based on meaning, functional and textual adequacy, target language mechanics, style, register, and idiomaticity. The analysis revealed that Google Translate's Arabic-English translations were deficient, with contextual accuracy and meaning issues necessitating major revisions. The English-Arabic translations by Google Translate also required significant edits due to literal translation practices and inadequacies in several areas. Contrariwise, ChatGPT's Arabic-English translations were rated as acceptable, needing only minor edits, and offered more natural-sounding translations. The English-Arabic translations by ChatGPT, while better than Google Translate, still showed some deficiencies but were deemed acceptable with minor adjustments. The study underscores the irreplaceability of human translators in ensuring accurate and contextually rich translations. However, the study also highlights the potential of LLMs, such as ChatGPT, to significantly enhance the translation process. Developers are encouraged to enhance LLMs' contextual understanding and natural language processing capabilities. This involves expanding training datasets to include diverse and context-rich examples and improving the models' ability to handle different registers, styles, and idiomatic expressions. The study strongly advocates for a collaborative model in the translation industry that integrates machine translation with human expertise to enhance efficiency while maintaining quality. These insights are crucial for driving advancements in developing and applying machine translation tools, ensuring they complement rather than replace human translators.

Fig. 1. Translation evaluation rubric (adopted from Ustaszewski (2014)).

SL: source language; TL-MT: target language-machine translation; TL-HT: target language-human translation revisions before publication. This is the case as the translated text is a King’s speech that needs to be accurate and precise to avoid misunderstanding. meaning, tone, and style appropriately to the target audience. For textual adequacy, Google Translate ensures that the translation preserves the original text’s meaning, nuances, and organization as closely as possible. It involves maintaining the source text’s coherence, cohesion, and stylistic features in the target language. A textually adequate translation accurately represents the source text without adding or omitting significant information. The only possible functional and textual adequacy issue is related to some needed linguistic changes, as example 2 shows. As example 2 shows, ‘dre! dy yell Leal cdg) 44) peru! du pel! 4Sleal) Aircon! cle > was translated as ‘for hosting the Kingdom of Saudi Arabia for this Arab-Islamic summit’. This textual inadequacy can be edited as ‘for hosting this Arab-Islamic summit in the Kingdom of Saudi Arabia’ .

Table I. Style, register, and idiomaticity in MT Arabic—English translation national identity (4¢!/); it is about principles and values (és), In example 4, the translation of the ‘because that is who we are’ was also literally word-for-word translated to “4sle Gail sa lke GY’, This should have been translated as ‘tis 58 arwe ae dlits ll GY”, The ideal translation in Arabic means that ‘as this (turning our back to refugees) is against our values and principles’. Muhammad’. In addition, ‘e/4=’ was translated as ‘clash’ by Google Translate, whereas it was translated as ‘conflict’ by the human translator. ‘ex’ was translated as ‘gather’ by Google Translate, whereas it was translated as ‘convene’ by the human translation as the translator realized that it was a summit. In addition, ‘-/+is!’ was translated as ‘extension’ by Google Translate, whereas the human translator translated it as ‘continuation’ as the translator knows that the speech is about a war. Moreover, ‘3%’ was translated as ‘hotbed’ by Google. In contrast, it was translated as ‘source’ by the human translators, which reflects a higher register and an idiomatic expression by the human translator to match the register of the original speech (AlAfnan, 2018).

SL: source language; TL-MT: target language-machine translation; TL-HT: target language-human translation SL: source language; TL-MT: target language-machine translation; TL-HT: target language-human translation

Table Il. Style, register, and idiomaticity in MT English-Arabic translation

SL: source language; TL-MT: target language-machine translation; TL-HT: target language-human translation

Table Ill. Style, register, and idiomaticity in LLM Arabic—English translation In relation to functional and textual adequacy, the LLM- translated text seems and sounds more natural than the text translated by Google Translate. As example 8 _ shows, ‘Ape! Ay yal) dail odgt 40 peru!) Ly pel) 4Sledl! déleiu! le’ was trans- ated as ‘for hosting this Arab-Islamic summit in the Kingdom of Saudi Arabia’. This translation differs from the Google Translate- provided translation (for hosting the Kingdom of Saudi Arabia for this Arab-Islamic summit). It is precisely the same as the translation provided by the human translation. However, it is noticed that ChatGPT follows the English language punctuation style. In Ara- bic, if there is a list, the letter ‘_” which means ‘and’ is used, not a comma, to separate the items in the list. In the ChatGPT translation, commas are used, and no ‘s’ is added following the English anguage punctuation style. LLM translation provides the target text with accurate spelling and grammar for target language mechanics (i.e., grammar, punc- tuation, and spelling). However, it uses English language punctu- ation techniques, as mentioned earlier. Using commas to separate lists without using the word ‘.»’, which means ‘and’ in English is wrong as it does not follow the mechanics of the target language (Arabic). However, in terms of style, register, and idiomaticity, the text is produced more accurately than MT. As Table III shows, ‘taro lituw’ was translated as ‘Prophet Muhammad’, which is

Table IV. Style, register, and idiomaticity in LLM English-Arabic translation contrasting them with traditional machine translation tools, partic- ularly Google Translate. The study investigated both Arabic and English as source and target languages. Translations generated by ChatGPT and Google Translate were compared against the official translations of speeches of King Abdullah II of Jordan. The Arabic speech was delivered on 11 November, 2023, at the joint Arab- Islamic Extraordinary Summit on Gaza in Riyadh, while the English speech was delivered on 13 December, 2023, at the Global Refugee Forum in Geneva. register. The context of the speech is the Global Refugee Forum in Geneva. The attendees are heads of state and international dele- gates. The style of the original speech is formal and professional. In the machine translation, ‘as we speak’ was literally translated as ‘Gaati Gais’, which gives the impression of an informal context. The ideal translation shall be ‘e«isi 4’ to maintain the style and the formality (register) of the speech. In addition, we also have idiomaticity inaccuracy, as we see in the translation of ‘responsibility-sharing’, which was translated literally to “Aud sguvall grall cle 4S jléx’, The ideal and idiomatic translation shall be ‘Aus jucall Loai® aly gual! oad 4 J) LE? is idiomatic in Arabic, whereas ‘41! 5 jal! (stad) le 48 Li’ is a totally inaccurate translation that is not used in the Arabic language. However, unlike Google Translate, LLM translation provided a better translation for ‘let us make this forum count’. It translated this sentence as ‘guild ($8ay (gatiall |e Uzail’, which is acceptable as it provides an accurate translation. This translation means, ‘Let us make this forum achieve results’.

descriptionView Paper arrow_downwardDownload

Direkcionalnost u usmenom prevođenju

by Pia Oreskovic

2024, Zbornik radova Međimurskog veleučilišta u Čakovcu

Može li se prevoditi na materinski i na strani jezik jednakom lakoćom i kvalitetom? Ili bismo trebali preferirati određeni smjer prevođenja, a drugi čak i izbjegavati? Prevodilačka znanost, po tom pitanju, ima različita stajališta. Dok... more

descriptionView Paper arrow_downwardDownload

The Metaphorical Representation of Time in The Mind of The Multilingual: Speakers of English as a Third Language

by Achraf Ben Hmidou

2024, Achraf Ben Hmidou

This study examines the conceptualization of time in the mind of multilingual speakers. It targets specifically English as an L3 and It is an attempt to discover the complexities in the multifaceted and abstract levels of conceptualizing... more

descriptionView Paper arrow_downwardDownload

English-Russian Voice-Over Translation as a Means of Teaching a Foreign Language at a University

by Irina Martynenko

2024, Вестник Университета имени О.Е. Кутафина

В статье анализируется и оценивается англо-русский закадровый перевод видеоматериалов на базе нейросетей одной из ведущих российских IT-компаний. Авторы рассматривают данный технический сегмент на предмет возможного использования в... more

descriptionView Paper arrow_downwardDownload

II Colóquio Internacional de Línguas Estrangeiras: livro de resumos

by Elisabete Mendes Silva

2024

Contém os resumos do II Colóquio Internacional de Línguas Estrangeiras: livro de resumos, realizado na Escola Superior de Educação do Instituto Politécnico de Bragança nos dia 12 e 13 de Outubro de 201

descriptionView Paper arrow_downwardDownload

Estudios de lingüística aplicada III

by Francesca Romero Forteza

2024

Queda prohibida la reproducción, distribución, comercialización, transformación y, en general, cualquier otra forma de explotación, por cualquier procedimiento, de la totalidad o de cualquier parte de esta obra sin autorización expresa y... more

descriptionView Paper arrow_downwardDownload

Looks like google to me: Instructor ability to detect machine translation in L2 Spanish writing

by Luciane Maimone and

2024, F L A

This article reports the results of an empirical study designed to determine the degree to which college instructors of Spanish can distinguish between machine translation (MT) and non‐MT writing samples produced by second language (L2)... more

descriptionView Paper arrow_downwardDownload

Educational Research in Universal Sciences

by Shahnoza Almamatova

2024, ИЗУЧЕНИЕ УЗБЕКСКОЙ ФРАЗЕОЛОГИИ КАК НАУКА

Фраземы является устойчивыми сочетаниями, изучается в качетсве раздела фразеологии отдельной области языкознания. Устойчивые словосочетания в отличие от свободных сочетаний, и что фиксированные словосочетания являются строительным... more

descriptionView Paper arrow_downwardDownload

СЕМАНТИЧЕСКАЯ ДЕРИВАЦИЯ КАК УНИВЕРСАЛЬНЫЙПРИНЦИП ФУНКЦИОНИРОВАНИЯСОВРЕМЕННОГО РУССКОГО ЯЗЫКА

by Эльдар Хасанов

2024

Статья посвящается вопросам семантической деривации и её типам: смещению, метонимии и метафоризации. Описанию подвергается и сам термин деривация как универсальный принцип функционирования языка. Принцип деривации в языке не является... more

descriptionView Paper arrow_downwardDownload

ПРОБЛЕМЫ СТАНОВЛЕНИЯ ДЕРИВАТОЛОГИИ ЕЕ МЕТОДОЛОГИЯ И ИССЛЕДОВАНИЕ В КОММУНИКАТИВНОМ АСПЕКТЕ

by Эльдар Хасанов

2024

The article is devoted to topical issues of the methodological foundations and the methodology of the study of derivational processes manifested in modern Russian. The introduction describes the historical background of the development... more

descriptionView Paper arrow_downwardDownload

Иностранные языки в Узбекистане

by Эльдар Хасанов

2024, ПРОБЛЕМЫ СТАНОВЛЕНИЯ ДЕРИВАТОЛОГИИ, ЕЁ МЕТОДОЛОГИЯ И ИССЛЕДОВАНИЕ В КОММУНИКАТИВНОМ АСПЕКТЕ

Статья посвящена актуальным вопросам методологических основ и методике исследования деривационных процессов, проявляющихся в современном русском языке. Во введении описываются исторические предпосылки развития и становления дериватологии,... more

descriptionView Paper arrow_downwardDownload

"Ўзбекистонда хорижий тиллар" илмий-методик электрон журнал www.journal.fledu.uz №2(21)2018 СЕМАНТИЧЕСКАЯ ДЕРИВАЦИЯ КАК УНИВЕРСАЛЬНЫЙ ПРИНЦИП ФУНКЦИОНИРОВАНИЯ СОВРЕМЕННОГО РУССКОГО ЯЗЫКА

by Эльдар Хасанов

2024

descriptionView Paper arrow_downwardDownload

The Potential of Google Search for Studies in Cognitive Corpus Linguistics

by Inna Petrova

2024

Аннотация Статья посвящена использованию поисковой системы Google в качестве аналога корпу са текстов при проведении когнитивных исследований языка. Цель статьи-определить значимость статистических данных, доступных в результате... more

descriptionView Paper arrow_downwardDownload

The Potential of Google Search for Studies in Cognitive Corpus Linguistics

by Inna Petrova

2024, Theoretical and Applied Linguistics

The paper investigates the possibility to employ the Google search system as an analogue of the corpus of texts for potential use in further cognitive research of a language. The purpose of the article is to elucidate the significance of... more

descriptionView Paper arrow_downwardDownload

The Role of Eco-criticism in Urdu Literature

by Hamza Habib

2023, The Role of Eco-criticism in Urdu Literature

The term "Eco-criticism" was initially used by writer William Rueckert (1926-2006) in his 1978 essay "Literature and Ecology: An Experiment in Ecocriticism." The late 20th-century eco-criticism movement was characterized by a love and... more

descriptionView Paper arrow_downwardDownload

Revista Ideas Nº1

by hector valencia

2023

descriptionView Paper arrow_downwardDownload

NONLINEAR DIFFUSION IN SUPERCONDUCTORS

by Abduvasiyev Sardor

2023, NONLINEAR DIFFUSION IN SUPERCONDUCTORS

O`ta o`tkazvchanlik

descriptionView Paper arrow_downwardDownload

Quilting Art History in America

by Olima A B D I V A L I Y E V N A Kholmurodova

2023, International scientific-practical conference THE 2nd INTERNATIONAL CONFERENCE ON XXI CENTURY SKILLS IN LANGUAGE TEACHING AND LEARNING

In this article, the authors provide extensive information on the history
and developmental stages of the art of American quilting. At the same time, important ideas
about the importance of quilting art were noted.

descriptionView Paper arrow_downwardDownload

Fine-grained evaluation of German-English Machine Translation based on a Test Suite

by Eleftherios Avramidis

2023

We present an analysis of 16 state-of-the-art MT systems on German-English based on a linguistically-motivated test suite. The test suite has been devised manually by a team of language professionals in order to cover a broad variety of... more

descriptionView Paper arrow_downwardDownload

МОРФОЛОГИЧЕСКИЙ СПОСОБ ОБРАЗОВАНИЯ МАТЕМАТИЧЕСКИХ ТЕРМИНОВ В УЗБЕКСКОМ ЯЗЫКЕ

by Ozoda Ruzmetova

2023, Foreign Languages in Uzbekistan

descriptionView Paper arrow_downwardDownload

Multilingual education in Morocco and the question of cultural identity: Toward implementing a critical thinking approach in high school English textbooks

by Abdellah Elboubekri

2023, Educational Research Review

Intercultural pedagogies theorists and cultural studies scholars have no controversies over the fact that language is the appropriate realm for the formation, contestation and negotiation of identities. As a matter of fact, language... more

8. Suggest an effective way of teaching English language in Moroccan schools? 40 (T) generally recommend the communicative approach; 32 (T) suggest an eclectic method; 3 (T) go for the competency based approach; 3 (T) proposes teaching through drama, music and fun. As for the other informants they provide no clear pedagogical method for effective teaching. their evaluation with respect to their retention of the English language proficiency they acquired in high school. Table 2 demon- strates their views. DATA ANALYSIS

descriptionView Paper arrow_downwardDownload

Methods of teaching English idioms

by Umid Jumanazarov

2023

The article is devoted to the methods of teaching English idioms. If we teach idioms along with the other simple word lists. We believe, that the English Teacher has to help the learners get to know how to use the Dictionary very... more

descriptionView Paper arrow_downwardDownload

CLIL: graphic organisers and concept maps for noun identification within bilingual primary education natural science subject textbooks

by José Luis Gómez Ramos

2023, International Journal of Bilingual Education and Bilingualism

Though concept maps and graphic organisers are useful tools for bilinguals to organise the information being managed and learned, its systematic use is not widened and decisive in CLIL domains. Apart from helping students to acquire... more

descriptionView Paper arrow_downwardDownload

Computational Linguistics: Machine Translation

Key research themes

1. How do statistical and linguistic models contribute to parameter estimation and alignment accuracy in Machine Translation systems?

2. What are effective methodologies for evaluating machine translation quality, and how do linguistic and human-centered approaches compare?

3. How do recent Large Language Models (LLMs) compare to traditional Machine Translation systems in handling contextual meaning, fluency, and idiomatic expressions in multilingual translation tasks?

Related Topics

All papers in Computational Linguistics: Machine Translation