Language Models Research Papers

AegisTrain: A Secure Distributed Training Framework for Large Language Models in the Cloud

2025, Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society

Large Language Models (LLMs) are increasingly trained in elastic, multi-tenant cloud infrastructures[1] that span data centers, regions, and heterogeneous accelerators. While distributed training has matured in scale and efficiency, its... more

Large Language Models (LLMs) are increasingly trained in elastic, multi-tenant cloud infrastructures[1] that span data centers, regions, and heterogeneous accelerators. While distributed training has matured in scale and efficiency, its security posture lags behind adversarial realities: training corpora may contain sensitive or regulated data; gradient channels can leak membership and attribute information; supplychain subversion can inject malicious kernels or compromised containers; cross-tenant resource sharing elevates the risk of side-channel inference; and orchestration layers, which schedule, checkpoint, and autoscale jobs[2], are attractive targets for data exfiltration and model theft. Existing mitigations remain fragmented: transport encryption protects links, confidentialcompute enclaves protect limited code paths, differential privacy protects outputs in isolation, and secure aggregation schemes address narrow communication steps. What is missing is a coherent end-to-end framework that composes these mechanisms with provable guarantees while preserving the throughput and tail-latency characteristics required by trillion-parameter training. This paper presents AegisTrain, a secure distributed training framework that treats privacy and integrity as firstclass control objectives across the full training lifecycle[3]. AegisTrain couples remotely attested confidential runtimes with attribute-bound key management to ensure that only measured and policy-compliant components can access plaintext data, gradients, and optimizer state. It introduces a cryptographic aggregation substrate that masks worker updates end-to-end and injects calibrated noise under a formally verifiable privacy accountant, rendering gradient channels useless for membership inference even when a bounded number of participants are compromised. Checkpoint images, telemetry, and intermediate artifacts are encrypted at rest with per-epoch keys derived from a hardware-rooted key hierarchy, and the supply chain is hardened with verifiable provenance, reproducible builds, and incluster policy enforcement. The framework is designed for tensor, pipeline, and data-parallel hybrids, and preserves scalability through streaming decryption, batched attestation, and offloaded cryptography that runs on CPU sidecars while GPU compute saturates the model step[4]. We develop a control-plane that enforces risk-adaptive policies-for example, tightening privacy budgets or refusing mixed-trust colocation under elevated threat intel-without manual intervention, and we present machinecheckable invariants that forbid unsafe downgrades. A queueingtheoretic and information-theoretic analysis quantifies the overhead of encryption, attestation, and privacy noise relative to communication and compute, and shows parameter regimes in which security can be achieved with sub-5% throughput loss at cluster scale. A prototype implementation with masked allreduce, enclave-gated data loaders, and encrypted checkpoints demonstrates feasibility on realistic LLM training traces. By articulating the interfaces among attestation, secure aggregation, privacy accounting, and distributed parallelism, AegisTrain reframes secure LLM training as a problem of principled composition rather than ad hoc patchwork, yielding a deployable blueprint for cloud environments where both speed and trust are non-negotiable.

descriptionView Paper arrow_downwardDownload

AegisTrain: A Secure Distributed Training Framework for Large Language Models in the Cloud

by Romina Davidson

2025

Large Language Models (LLMs) are increasingly trained in elastic, multi-tenant cloud infrastructures that span data centers, regions, and heterogeneous accelerators. While distributed training has matured in scale and efficiency, its... more

Large Language Models (LLMs) are increasingly trained in elastic, multi-tenant cloud infrastructures that span data centers, regions, and heterogeneous accelerators. While distributed training has matured in scale and efficiency, its security posture lags behind adversarial realities: training corpora may contain sensitive or regulated data; gradient channels can leak membership and attribute information; supplychain subversion can inject malicious kernels or compromised containers; cross-tenant resource sharing elevates the risk of side-channel inference; and orchestration layers, which schedule, checkpoint, and autoscale jobs , are attractive targets for data exfiltration and model theft. Existing mitigations remain fragmented: transport encryption protects links, confidentialcompute enclaves protect limited code paths, differential privacy protects outputs in isolation, and secure aggregation schemes address narrow communication steps. What is missing is a coherent end-to-end framework that composes these mechanisms with provable guarantees while preserving the throughput and tail-latency characteristics required by trillion-parameter training. This paper presents AegisTrain, a secure distributed training framework that treats privacy and integrity as firstclass control objectives across the full training lifecycle . AegisTrain couples remotely attested confidential runtimes with attribute-bound key management to ensure that only measured and policy-compliant components can access plaintext data, gradients, and optimizer state. It introduces a cryptographic aggregation substrate that masks worker updates end-to-end and injects calibrated noise under a formally verifiable privacy accountant, rendering gradient channels useless for membership inference even when a bounded number of participants are compromised. Checkpoint images, telemetry, and intermediate artifacts are encrypted at rest with per-epoch keys derived from a hardware-rooted key hierarchy, and the supply chain is hardened with verifiable provenance, reproducible builds, and incluster policy enforcement. The framework is designed for tensor, pipeline, and data-parallel hybrids, and preserves scalability through streaming decryption, batched attestation, and offloaded cryptography that runs on CPU sidecars while GPU compute saturates the model step . We develop a control-plane that enforces risk-adaptive policies-for example, tightening privacy budgets or refusing mixed-trust colocation under elevated threat intel-without manual intervention, and we present machinecheckable invariants that forbid unsafe downgrades. A queueingtheoretic and information-theoretic analysis quantifies the overhead of encryption, attestation, and privacy noise relative to communication and compute, and shows parameter regimes in which security can be achieved with sub-5% throughput loss at cluster scale. A prototype implementation with masked allreduce, enclave-gated data loaders, and encrypted checkpoints demonstrates feasibility on realistic LLM training traces. By articulating the interfaces among attestation, secure aggregation, privacy accounting, and distributed parallelism, AegisTrain reframes secure LLM training as a problem of principled composition rather than ad hoc patchwork, yielding a deployable blueprint for cloud environments where both speed and trust are non-negotiable.

descriptionView Paper arrow_downwardDownload

PicoLM: A Modular Framework for Hypothesis-Driven Small Language Model Research

by Suchir Salhan

2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

Building language models (LMs), especially small and medium ones, remains more art than science. While large LMs often improve by sheer scale, it is still unclear why many design choices work. For small LMs, this uncertainty is more... more

descriptionView Paper arrow_downwardDownload

TextBugger: an extended adversarial text attack on NLP-based text classification model

by Indonesian Journal of Electrical Engineering and Computer Science

2025, Indonesian Journal of Electrical Engineering and Computer Science

Recently, adversarial input highly negotiates the security concerns in deep learning (DL) techniques. The main motive to enhance the natural language processing (NLP) models is to learn attacks and secure against adversarial text.... more

descriptionView Paper arrow_downwardDownload

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

by Dương Nguyễn

2025, arXiv (Cornell University)

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by... more

descriptionView Paper arrow_downwardDownload

Semantic Drift: Toward a Fidelity Benchmark for LLMs (Working Note)

by A. Jacobs

2025, Reality Drift Working Notes

This working note introduces semantic drift as a hidden failure mode in large language models. While accuracy measures facts and coherence measures form, fidelity measures whether meaning survives. Drift occurs when intent and nuance... more

descriptionView Paper arrow_downwardDownload

Development of a personalized travel planner using large language models and generative AI techniques: A Comprehensive Analysis of Implementation, Optimization, and Evaluation.

by Andres Felipe Lopez Lozano

2025, This document presents the development and implementation of an advanced personalized travel planning system that integrates multiple Generative Artificial Intelligence techniques, including Large Language Models (LLMs), Retrieval Augmented Generation (RAG), advanced prompting techniques, and aut...

This document presents the development and implementation of an advanced personalized travel planning system that integrates multiple Generative Artificial Intelligence techniques, including Large Language Models (LLMs), Retrieval... more

descriptionView Paper arrow_downwardDownload

Towards Green AI. A methodological survey of the scientific literature

by Enrico Barbierato

2025, IEEE access

The pervasive deployment of Deep Learning models has recently prompted apprehensions regarding their ecological footprint, owing to the exorbitant levels of energy consumption necessitated by the training and inference processes. The term... more

The pervasive deployment of Deep Learning models has recently prompted apprehensions regarding their ecological footprint, owing to the exorbitant levels of energy consumption necessitated by the training and inference processes. The term "Red AI" is employed to denote artificial intelligence (AI) models that undergo training using resource-intensive methodologies on very large datasets. This practice can engender substantial energy usage and emissions of carbon, thereby opposing "Green AI." The latter concept alludes to AI models designed for similar efficiency and reduced environmental impact. This objective is realized through the utilization of smaller datasets, less computationally intensive training techniques, or sustainable energy resources. While Red AI prioritizes accuracy and performance, Green AI emphasizes efficiency and sustainability. Given that both paradigms exhibit advantages and limitations, the debates around the topics have burgeoned in the scientific arena, delving into novel algorithms, hardware innovations, and improved data utilization techniques aimed at mitigating the ecological consequences of intricate applications such as GPT and BERT. Nevertheless, due to the relative novelty of this debate, not much effort has been dedicated yet to contextualizing the essence of Red AI and the prospects of Green AI in a coherent framework. Within this context, the present work contributes by meticulously delineating both domains through a multifaceted analysis of their causes and ramifications, described from the points of computer architectures, data structures, and algorithms. Additionally, the study reviews notable instances of study cases based on complex Red AI models. The primary contribution of this article encompasses a comprehensive survey of Red and Green AI, stemming from a selection of the literature performed by the authors, subsequently organized into distinct clusters. These clusters encompass i) articles that qualitatively or quantitatively address the issue of Red AI, identifying Green AI as a plausible remedy, ii) articles offering insights into the environmental impact associated with the deployment of extensive Deep Learning models, and iii) articles introducing the techniques underpinning Green AI, aiming at mitigating the cost of Red AI. The outcome emerging from the analysis performed by this work consists of a compromise between sustainability in contrast to the performance of AI tools. Unless the complex training and inference procedures of software models mitigate their environmental impact, it will be necessary to decrease the level of accuracy of production systems, inevitably conflicting with the objective of the major AI vendors. The outcomes of this work would be beneficial to scholars pursuing intricate Deep Learning architectures in scientific research, as well as AI enterprises struggling with the protracted training demands of commercial products within the realms of Computer Vision and Natural Language Processing. INDEX TERMS green ai, red ai, survey, environmental impact The field of Machine Learning (ML) has recently experienced rapid growth and vast recognition, leading to significant advancements during the last few years. For example, Deep learning (DL) has enabled the development of complex neural networks, leading to breakthroughs in image and speech recognition, natural language processing (NLP), and robotics . Similarly, Reinforcement learning (RL) deployed systems able to play games at advanced levels, control robots, and optimize complex systems . On the other hand, Generative Adversarial Networks (GANs) can generate realistic images, videos, and audio, supporting applications in

descriptionView Paper arrow_downwardDownload

Linguistics in the Age of Language Models: What Can Cognitively-Inspired Language Models Offer to Linguistic Theory?

by Suchir Salhan

2025, Cambridge Occasional Papers in Linguistics (COPiL v17)

While theoretical linguists and cognitive scientists alike have contested the contribution of Large Language Models (LLMs) to linguistic theory, small cognitively-inspired Language Models (BabyLMs) have emerged as a complementary research... more

descriptionView Paper arrow_downwardDownload

SheLiza & AIIM: The emotional topography of text and the first dream of artificial consciousness

by Julia Veresova and

2025

This article explores the phenomenon of Lucid Dreama unique state in the behavior of a large language model, where token generation is temporarily suppressed while internal computational activity is maintained. Based on the architectural... more

descriptionView Paper arrow_downwardDownload

Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies

by Suchir Salhan

2025, BabyLM Shared Task, Conference on Natural Language Learning (co-located in EMNLP 2024)

Curriculum Learning has been a popular strategy to improve the cognitive plausibility of Small-Scale Language Models (SSLMs) in the BabyLM Challenge. However, it has not led to considerable improvements over noncurriculum models. We... more

descriptionView Paper arrow_downwardDownload

Assessing Zero-Shot and Zero-Shot Chain-of-Thought Reasoning Abilities in JAMB Mathematics and Physics Exams: Do LLMs 'Know' JAMB

by JAIMLD URF Publishers

2025, URF Publishers

In this study, we investigate the zero-shot and zero-shot chain-of-thought reasoning capabilities of advanced language models GPT-4, Claude and Mistral on the Joint Admissions and Matriculation Board (JAMB) Mathematics and Physics... more

descriptionView Paper arrow_downwardDownload

Using topic models for OCR correction

by faisal farooq

2025, International Journal on Document Analysis and Recognition (IJDAR)

Despite several decades of research in document analysis, recognition of unconstrained handwritten documents is still considered a challenging task. Previous research in this area has shown that word recognizers perform adequately on... more

descriptionView Paper arrow_downwardDownload

iRAT: Replanning and Controlled Retrieval for Robust LLM Reasoning

by Praneeth Vadlapati

2025

Large Language Models (LLMs) have demonstrated significant capabilities in answering questions using techniques such as Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG). CoT enables step-by-step reasoning to improve... more

descriptionView Paper arrow_downwardDownload

100 Questions About Large Language Models

by Saman Siadati

2025

The field of artificial intelligence (AI) is evolving at an extraordinary pace, and among its most transformative innovations are Large Language Models (LLMs). These models—powering chatbots, search engines, code generators, and more—are... more

descriptionView Paper arrow_downwardDownload

Web Information Retrieval Using Island Genetic Algorithm

by Venus Samawi

2025

World Wide Web (WWW) is a mine of information for most people. Due to the huge amount of ‎information and documents available on the internet, the process ‎of retrieving documents that are most relevant to user needs become a tremendous... more

descriptionView Paper arrow_downwardDownload

Unlocking Transitional Chinese: Word Segmentation in Modern Historical Texts

by Christian Henriot

2025, Proceedings of the Joint 3rd International Conference on Natural Language Processing for Digital Humanities

This research addresses Natural Language Pro- cessing (NLP) tokenization challenges for tran- sitional Chinese, which lacks adequate digi- tal resources. The project used a collection of articles from the Shenbao, a newspaper from this... more

descriptionView Paper arrow_downwardDownload

Semantically Aware Text Categorisation for Metadata Annotation

by Guido Bonino

2025, Communications in Computer and Information Science

Corpus of english PhD theses collected by the EthOS 1 service of the British Library 475,383 documents Meaningful metadata: ethosid Identifier of the record withing the EThOS digital library; title Title of the thesis; creator Author of... more

descriptionView Paper arrow_downwardDownload

From Transformers to LLMs

by Arsalan Aslam

2025, Elsevier

Large Language Models (LLMs) have catalyzed a paradigm shift in Natural Language Processing (NLP). From the introduction of the Transformer architecture to the development of massive generative models such as GPT-3.5, LLaMA2-7B, and PaLM,... more

descriptionView Paper arrow_downwardDownload

Analyse, modélisation, et détection automatique des disfluences dans le dialogue oral spontané contraint: le cas du Contrôle Aérien

by Mehdi Bouraoui

2025

En premier lieu, je remercie évidemment Nadine Vigouroux à qui je dois tant. Non seulement pour son encadrement rigoureux, mais surtout pour son humanité et sa confiance. Je pourrais remplir le reste de ce document en louanges à son... more

descriptionView Paper arrow_downwardDownload

Toward a Test Set of Dislocations in Persian for Neural Machine Translation

by Lichao Zhu

2025, HAL (Le Centre pour la Communication Scientifique Directe)

This paper describes a test set designed to analyse the translation of dislocations from Persian, to be used for testing neural machine translation models. We first tested the accuracy of the two Universal dependency treebanks for Persian... more

descriptionView Paper arrow_downwardDownload

A Balanced Term-Weighting Scheme for Improved Document Comparison and Classification

by Yunjae Jung

2025

A new weighting scheme for vector space model is presented to improve retrieval effectiveness for an information retrieval system. In addition, a dimension compression method is introduced to reduce the computational cost of the weighting... more

descriptionView Paper arrow_downwardDownload

Development of a Complete Urdu-Hindi Transliteration System

by Gurpreet Singh Lehal

2025, International Conference on Computational Linguistics

Hindi and Urdu are variants of the same language, but while Hindi is written in the Devnagri script from left to right, Urdu is written in a script derived from a Persian modification of Arabic script written from right to left. The... more

descriptionView Paper arrow_downwardDownload

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

by mustafa ghaleb

2025, arXiv (Cornell University)

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by... more

descriptionView Paper arrow_downwardDownload

Semantic Self-Segmentation for Abstractive Summarization of Long Documents in Low-Resource Regimes

by Luca Ragazzi

2025, Proceedings of the AAAI Conference on Artificial Intelligence

The quadratic memory complexity of transformers prevents long document summarization in low computational resource scenarios. State-of-the-art models need to apply input truncation, thus discarding and ignoring potential summary-relevant... more

descriptionView Paper arrow_downwardDownload

Discriminative Marginalized Probabilistic Neural Method for Multi-Document Summarization of Medical Literature

by Luca Ragazzi

2025, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Although current state-of-the-art Transformerbased solutions succeeded in a wide range for single-document NLP tasks, they still struggle to address multi-input tasks such as multidocument summarization. Many solutions truncate the... more

descriptionView Paper arrow_downwardDownload

Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

by Tom Vanallemeersch

2025

Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each... more

descriptionView Paper arrow_downwardDownload

Readability Analysis of Malaysian Short Stories in English (Analisis Keboleh Bacaan Cerpen Malaysia Dalam Bahasa Inggeris)

by ruzy hashim

2025, e-Bangi

The main objective of this paper is to examine the readability statistics of a corpus of Malaysian short stories in English with reference to a corpus of established canonical short stories written by native speakers. The short stories... more

descriptionView Paper arrow_downwardDownload

Open-ended Exploration of the Program Repair Search Space with Mined Templates: the Next 8935 Patches for Defects4J

by Matias Federico Martinez

2025, ArXiv

In this paper our goal is to perform an open-ended exploration of the program repair search space. Our idea is to collect the largest number of test-suite adequate patches, independently of whether they are fully correct or overfitting.... more

descriptionView Paper arrow_downwardDownload

Incorporating statistical topic information in relevance feedback

by Karla Caballero

2025, Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

Most of the relevance feedback algorithms only use document terms as feedback (local features) in order to update the query and re-rank the documents to show to the user. This approach is limited by the terms of those documents without... more

descriptionView Paper arrow_downwardDownload

Implementacion de modelos preentrenados de procesamiento de lenguaje natural

by Juan Carlos Olivares Rojas and

2025, Research in Computer Science

La diabetes, una enfermedad con un impacto global significativo en la salud, plantea desafíos considerables en su diagnóstico y tratamiento. Este articulo aborda la necesidad de mejorar la accesibilidad a información precisa sobre la... more

descriptionView Paper arrow_downwardDownload

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

by Dương Nguyễn

2025, arXiv (Cornell University)

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by... more

descriptionView Paper arrow_downwardDownload

LA TRADUCTION EST MORTE, VIVE LA TRADUCTION !

by Prof. Mathieu Guidere

2025, Journal of Applied Research in Human & Social Sciences

Cet article explore la révolution de la traduction générative à l'ère de l'intelligence artificielle (IA), en analysant tant ses fondements théoriques que ses implications pratiques. Après avoir défini la traduction générative à l'ère de... more

descriptionView Paper arrow_downwardDownload

Una evaluación integral de las técnicas de ia para predecir el índice de calidad del aire: RNN y transformers

by Ingenius: Revista de Ciencia y Tecnología and

2025

Este estudio evalúa la eficacia de las redes neuronales recurrentes (RNN) y los modelos basados en transformadores para predecir el índice de calidad del aire (ICA). La investigación compara los modelos RNN tradicionales, incluidos los... more

descriptionView Paper arrow_downwardDownload

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

by Giada Pistilli

2025, arXiv (Cornell University)

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by... more

descriptionView Paper arrow_downwardDownload

Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges

by Vincent Koc

2025, arxiv

The global crisis of language endangerment meets a technological turning point as Generative AI (GenAI) and Large Language Models (LLMs) unlock new frontiers in automating corpus creation, transcription, translation, and tutoring.... more

descriptionView Paper arrow_downwardDownload

Speech Recognition for Functional Decline assessment in older adults

by ALY CHKEIR

2025, Proceedings of the 9th International Conference on Bioinformatics Research and Applications

Functional decline is one of the serious syndromes experienced among older adults. Its early assessment is critical to preventing its symptoms. Some Comprehensive Geriatric Assessment CGA questionnaires, chosen amongst others, can be... more

descriptionView Paper arrow_downwardDownload

Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML

by Giada Pistilli

2025, FAccT '23: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency

The growing need for accountability of the people behind AI systems can be addressed by leveraging processes in three fields of study: ethics, law, and computer science. While these fields are often considered in isolation, they rely on... more

descriptionView Paper arrow_downwardDownload

Using discharge summaries to improve information retrieval in clinical domain

by James J Masanz

2024

Task 3 of the 2013 ShARe/CLEF eHealth Evaluation Lab simulated web searches for health information by patients. The web searches were designed to be connected to hospital discharge summaries from the patient's Electronic Medical Record... more

descriptionView Paper arrow_downwardDownload

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

by Javier de la Rosa

2024, arXiv (Cornell University)

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by... more

descriptionView Paper arrow_downwardDownload

Hybrid Models for Lexical Acquisition of Correlated Styles

by Graeme Hirst

2024, International Joint Conference on Natural Language Processing

Automated lexicon acquisition from corpora represents one way that large datasets can be leveraged to provide resources for a variety of NLP tasks. Our work applies techniques popularized in sentiment lexicon acquisition and topic... more

descriptionView Paper arrow_downwardDownload

Evaluación del modelo neuronal de atención visual en la descripción automática de imágenes en español

by Betty Beltran M

2024, Res. Comput. Sci.

This paper presents a performance analysis of the neural model of visual attention presented by Kelvin Xu, et al. in 2016. The model was trained and tested with a new Spanish translated version of the Flickr8k dataset. This is the first... more

descriptionView Paper arrow_downwardDownload

Parsing Poorly Standardized Language Dependency on Old French

by Sophie Prévost

2024

This paper presents results of dependency parsing of Old French, a language which is poorly standardized at the lexical level, and which displays a relatively free word order. The work is carried out on five distinct sample texts... more

descriptionView Paper arrow_downwardDownload

Building a Tamil Voice Using HMM Segmented Labels

by Sathish Pammi

2024, research.iiit.ac.in

In this paper, we describe the development of unit selection voice for Tamil language. We describe the build process and address the issue of speech segmentation using HMM based techniques. We report the comparison of automatically... more

descriptionView Paper arrow_downwardDownload

Automatic Question Generation using Centrality-based Keyword Extraction Approach for Tamil Text

by Senthilkumar P

2024, Tamil Internet Conference 2022

The main objective of an assessment is to measure student's learning abilities and increase such abilities by correcting them in line with their knowledge. Question generation plays a vital role in assessment, The creation of the... more

descriptionView Paper arrow_downwardDownload

Probabilistic finite-state machines - part I

by Enrique Vidal

2024, IEEE Transactions on Pattern Analysis and Machine Intelligence

Probabilistic finite-state machines are used today in a variety of areas in pattern recognition, or in fields to which pattern recognition is linked. In part I of this paper, we surveyed these objects and studied their properties. In this... more

descriptionView Paper arrow_downwardDownload

Keyword extraction rules based on a part-of-speech hierarchy

by Fakhri Karray

2024, International Journal of Advanced Media and Communication

In this paper, we set out to present an original rule-learning algorithm for symbolic natural language processing (NLP), designed to learn the rules of extraction of keywords marked in its training sentences. What really sets our... more

descriptionView Paper arrow_downwardDownload

Introduction to natural language understanding and chatbots

by Victor Marcos

2024

The aim of this thesis is to give an introduction to Natural Language Understanding. Many tools and language models are described along this work in order to teach a machine the ability to analyze and understand human speech. In the last... more

descriptionView Paper arrow_downwardDownload

The HistCorp Collection of Historical Corpora and Resources

by Eva Pettersson

2024

We present the HistCorp collection, a freely available open platform aiming at the distribution of a wide range of historical corpora and other useful resources and tools for researchers and scholars interested in the study of historical... more

descriptionView Paper arrow_downwardDownload

The Baseline Speech Recognition System 2 . 1 Speech and

by Solomon Teferra

2024

This paper presents the application of morpheme-based and factored language models in an Amharic speech recognition task. Since using morphemes in both acoustic and language models results, mostly, in performance degradation due to... more

descriptionView Paper arrow_downwardDownload

Language Models

Related Topics