Name Entity Recognition

description22 papers

group36 followers

lightbulbAbout this topic

Named Entity Recognition (NER) is a subtask of information extraction that involves identifying and classifying key entities in text into predefined categories, such as names of people, organizations, locations, dates, and other specific terms, facilitating the understanding and processing of unstructured data.

lightbulbAbout this topic

Key research themes

1. How can Hidden Markov Models be applied to Named Entity Recognition across languages and domains?

This theme focuses on the use of Hidden Markov Models (HMM) as a statistical sequence labeling method for the identification and classification of named entities (NEs) in text. The research addresses the applicability of HMM to different languages, including low-resource and Indian languages, and the challenges therein, such as lack of capitalization, ambiguity, and resource scarcity. It also evaluates HMM's effectiveness compared to other ML and rule-based methods and explores integration with chunking and feature engineering to improve performance.

Named Entity Recognition using Hidden Markov Model (HMM)

by International Journal on Natural Language Computing (IJNLC) and

2015

Key finding: Demonstrates an HMM-based NER system that is language-independent and adaptable to any domain, emphasizing its utility in Indian languages where absence of capitalization and resource scarcity make conventional methods less... Read more

articleView Paper downloadDownload

Named Entity Recognition using an HMM-based Chunk Tagger

by Shubham Kolhe

2017

Key finding: Proposes an advanced HMM-based chunk tagger integrating four types of evidences (internal features like capitalization, semantic triggers, gazetteer features, and external context), achieving F-measures of 96.6% and 94.1% on... Read more

articleView Paper downloadDownload

A Hidden Markov Model Based Named Entity Recognition System: Bengali and Hindi as Case Studies

by Asif Ekbal

2023, Lecture Notes in Computer Science

Key finding: Develops an HMM NER system for Bengali and Hindi, demonstrating that despite lack of capitalization and morphological complexity in these languages, the HMM approach achieves robust 10-fold cross-validation F-Scores of 84.5%... Read more

articleView Paper downloadDownload

Biomedical Named Entity Recognition: A Poor Knowledge HMM-Based Approach

by Ferran Pla

2025, Lecture Notes in Computer Science

Key finding: Introduces an HMM-based biomedical NER system using only part-of-speech tags as additional features to mitigate class imbalances and enhance boundary detection. Despite minimal domain knowledge incorporation, the system... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are the benefits and limitations of leveraging linguistic parsing and syntactic structure for Named Entity Recognition?

This research area investigates the use of syntactic parsing techniques—both constituency and dependency parsing—to improve the identification and delimitation of named entities. It explores how deep structural information can guide or augment sequence labeling models to resolve ambiguities and better segment complex entities, with a focus on recent advances in parsing technology and their integration in NER pipelines. The discussion includes different parsing-informed approaches and their empirical benefits.

On the Use of Parsing for Named Entity Recognition

by Miguel Angel Alonso Pardo

2023, Applied Sciences

Key finding: Analyzes the incorporation of syntactic parsing information into NER systems, proposing that parsing provides crucial cues not only for detecting entity presence but also their precise span. Demonstrates that both... Read more

articleView Paper downloadDownload

Extraction of Family Relations Between Entities

by Jorge Baptista

2023, inforum.org.pt

Key finding: Employs rule-based semantic relation extraction from parsed Portuguese texts focusing on family relationships. The study highlights how syntactic structures and linguistic patterns help identify relational semantic links... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can domain- and language-specific corpora and annotation methodologies enhance Named Entity Recognition for low-resource and specialized languages?

This theme covers the development of annotated datasets and domain-adapted NER models for low-resource languages (e.g., Bhojpuri, Maithili, Magahi, Odia) and specialized domains (agriculture, biomedical, historical culture). It emphasizes corpus creation methodologies, automatic or semi-automatic annotation tools, lexicon generation, and domain-specific feature engineering. The research underlines the critical role of tailored datasets and linguistic insights for effective NER in underrepresented languages and specialized fields.

Development of a Dataset and a Deep Learning Baseline Named Entity Recognizer for Three Low Resource Languages: Bhojpuri, Maithili, and Magahi

by Praveen Gatla

2023, ACM Transactions on Asian and Low-Resource Language Information Processing

Key finding: Presents the first annotated NER dataset for Bhojpuri, Maithili, and Magahi, annotated with 22 entity labels on sizeable corpora (56k-228k tokens). The study highlights the unique challenges of these languages, such as... Read more

articleView Paper downloadDownload

An Algorithm for Automatic Text Annotation for Named Entity Recognition using spaCy Framework

by Murari Kumar

2023

Key finding: Introduces an algorithm and tool for automatic annotation of plant protection queries derived from a large agricultural helpline dataset, enabling creation of annotated corpora in the agricultural domain. The tool facilitates... Read more

articleView Paper downloadDownload

Chinese Named Entity Recognition Method in History and Culture Field Based on BERT

by Simon Kolmanič

2023, International Journal of Computational Intelligence Systems

Key finding: Develops a BERT-BiLSTM-CRF-based model applied to a newly constructed historical and cultural Chinese NER dataset, targeting entities like historical dynasties, figures, times, locations, and official titles. The approach... Read more

articleView Paper downloadDownload

Name Entity Recognizer for Odia using Conditional Random Fields

by Dr Deepak Sahoo

2023, Indian Journal of Science and Technology

Key finding: Implements a multi-level conditional random field (CRF) based NER for Odia language with a hierarchical tag set of 44 attributes. Despite linguistic challenges including absence of capitalization and morphological complexity,... Read more

articleView Paper downloadDownload

A Novel Technique for Name Identification from Homeopathy Diagnosis Discussion Forum

by RAHUL PRASAD

2021, Procedia Technology

Key finding: Designs a CRF-based NER system targeting noisy and informal homeopathic forum texts, where named entities are complex and include frequent spelling variations. By leveraging active learning and semi-supervised techniques on a... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Name Entity Recognition

Using a knowledge base to disambiguate personal name in web search results

by Quang Minh Vũ

2025, Proceedings of the 2007 ACM symposium on Applied computing

Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different people, an effective method is needed to measure document... more

descriptionView Paper arrow_downwardDownload

Disambiguation of People in Web Search Using a Knowledge Base

by Quang Minh Vũ

2025, 2007 IEEE International Conference on Research, Innovation and Vision for the Future

descriptionView Paper arrow_downwardDownload

Water quality index (WQI) in the dam La Boquilla in Chihuahua, Mexico

by Roberto Carlos Rodriguez Delgado

2025, Ecosistemas Y Recursos Agropecuarios

recibido: 09 de noviembre de 2013, aceptado: 02 de enero de 2014 RESUMEN. Un Índice de Calidad de Agua (ICA) es una herramienta estadística para estimar la calidad de un cuerpo de agua. El objetivo fue determinar un ICA para la presa La... more

descriptionView Paper arrow_downwardDownload

Procesamiento de datos masivos en tiempo real y consumo energético de sistemas paralelos

by emilio luque

2025

Los avances tecnológicos de los sistemas de cómputo paralelo y distribuido permiten el desarrollo de aplicaciones antes impensadas. Una de nuestras líneas de investigación se enfoca en aplicar estas tecnologías a Unidades de Cuidados... more

descriptionView Paper arrow_downwardDownload

Aplicaciones de cómputo intensivo con impacto social

by emilio luque

2024

descriptionView Paper arrow_downwardDownload

Desenvolvimento de Linked Data Mashups com o uso de LIDMS

by José Monteiro

2024

Semantic Web technologies like RDF model, URIs and SPARQL query language, can reduce the complexity of data integration by making use of properly established and described links between sources. However, the diculty to formulate distributed queries has been a challenge to harness the potential of these technologies due to autonomy, distribution and vocabulary of heterogeneous data sources. This scenario demands eective mechanisms for integrating data on Linked Data. Linked Data Mashups allow users to query and integrate structured and linked data on the web. This work proposes an architecture of Linked Data Mashups based on the use of Linked Data Mashup Services (LIDMS). A module for ecient execution of federated query plans on Linked Data has been developed and is a component of the proposed architecture. The results of experiments using the execution module were more ecient than other existing strategies. Furthermore, a LIDMS execution Web environment also has been dened and implemented as contribution of this work. Resumo. Tecnologias da Web Semântica como modelo RDF, URIs e linguagem de consulta SPARQL, podem reduzir a complexidade de integração de dados ao fazer uso de ligações corretamente estabelecidas e descritas entre fontes. No entanto, a diculdade para formulação de consultas distribuídas tem sido um obstáculo para aproveitar o potencial dessas tecnologias em virtude da autonomia, distribuição e vocabulário heterogêneo das fontes de dados. Esse cenário demanda mecanismos ecientes para integração de dados sobre Linked Data. Linked Data Mashups permitem aos usuários executar consultas e integrar dados estruturados e vinculados na web. O presente trabalho propõe uma arquitetura de Linked Data Mashups baseada no uso de Linked Data Mashup Services (LIDMS). Um módulo para execução eciente de planos de consulta federados sobre Linked Data foi desenvolvido e é um componente da arquitetura proposta. Os resultados de experimentos realizados com o uso do módulo de execução mostraram-se mais ecientes que outras estratégias existentes. Além disso, um ambiente Web para execução de LIDMS também foi denido e implementado como contribuição deste trabalho.

descriptionView Paper arrow_downwardDownload

Desenvolvimento de Linked Data Mashups com o uso de LIDMS

by Regis Magalhaes

2024

descriptionView Paper arrow_downwardDownload

Utilização de XML numa plataforma de Data Mining distribuído

by Carlos A Gonçalves

2024

dos (Knowledge Discovery in Databases -KDD) envolve a análise de extensas bases de dados e recurso a complexos algoritmos de análise de dados (Data Mining ). Este processo requer, geralmente, recursos computacionais dedicados e de elevado custo o que reduz signicativamente o número de utilizadores capazes de efectuar tais análises. Neste artigo apresentamos uma arquitectura baseada em computadores pessoais distribuídos numa rede de computadores de uma organização e que permite a realização de tarefas de KDD sem recursos computacionais dedicados e sem perturbar o funcionamento da organização. A arquitectura é denominada Harvard -HARV esting Architecture of idle machines foR Data mining. O Harvard utiliza uma linguagem de especicação e controlo de tarefas baseada em XML. A linguagem XML no caso do Harvard é imprescindível para a interoperabilidade entre os diferentes componentes do ambiente descrevendo claramente todos os aspectos da tarefa de KDD a ser executada de forma distribuída. Os resultados alcançados por diferentes nós do sistema são transcritos em XML, de modo a facilitar a apresentação ao utilizador do ambiente Harvard e ainda permitir integração com outros sistemas de extracção de conhecimento. O crescente uso das tecnologias da informação associado ao crescimento e desenvolvimento económico de vários sectores (empresarial, governo, comunidade cientíca e académica, entre outros) têm permitido que as organizações e a sociedade em geral reúnam uma enorme quantidade de dados e informações sistematicamente em Bases de Dados (BD) [?]. Do ponto de vista humano, torna-se praticamente impossível interpretar, analisar e obter resultados a partir de grandes quantidades de dados sem o auxílio de computadores. Torna-se muito difícil diferenciar informações verdadeiramente importantes e úteis em tão grande quantidade de dados [?,?]. Neste caso, a aplicação de técnicas automáticas de análise capazes de extraír informação útil torna-se inevitável dada a complexidade e a extensão das BD.

descriptionView Paper arrow_downwardDownload

Botanical characterisation of honey (Apis mellifera L.) from four regions of the state of Tabasco, Mexico, by means of melisopalynological techniques

by Juan Manuel Zaldivar Cruz

2024, Universidad Y Ciencia

Artículo recibido: 23 de noviembre de 2009, aceptado: 08 de abril de 2013 RESUMEN. Un total de 12 mieles fueron colectadas durante los ciclos de cosecha 2006-2007 en los municipios de Huimanguillo, Cárdenas, Paraíso (región de La Chontalpa), Centla (región de los Pantanos) y Tacotalpa (región de La Sierra), del estado de Tabasco, con el objetivo de caracterizarlas botánicamente. Para la caracterización de las mieles se emplearon técnicas melisopalinológicas (análisis del polen) y técnicas sicoquímicas (pH, cenizas y conductividad eléctrica). Los análisis sicoquímicos no mostraron diferencias entre las muestras de miel, por lo que no pudieron ser útiles para su clasicación, mientras que la caracterización palinológica permitió identicartres mieles monoorales de Cocos nucifera (Centla), Mimosa orthocarpa var. berlandieri (Paraiso) y Psidium sp. (Tacotalpa), siete mieles poliorales y dos mieles biorales de Acalypha sp. / Bursera simaruba (Tacotalpa) y Gramineae / Celtis sp. (Centla). Estos resultados muestran que en un mismo apiario se pueden cosechar dos diferentes tipos de miel según la temporada de oración. En general, en Tabasco se producen diferentes tipos de miel dependiendo de la región geográca, aún cuando dicho estado no destaca en volumen de producción de miel, se comprobó que si podría competir a nivel comercial en la producción de mieles monoorales que son apreciadas principalmente en la Unión Europea. Palabras clave: caracterización de miel, Apis mellifera, Tabasco, melisopalinología, monooral, bioral y multioral. ABSTRACT.A total of 12 honey samples were collected for botanical characterisation during the 2006-2007 harvest cycle from the municipalities of Huimanguillo, Cárdenas, Paraíso (Chontalpa region), Centla (Wetlands region) and Tacotalpa (Sierra region) in the state of Tabasco. Melisopalynological (pollen analysis) and physicochemical (pH, ash and electrical conductivity) techniques were used to characterise the honey samples. The physicochemical analyses showed no dierences among the honey samples, for which reason they were not useful. In contrast, the palynological characterisation made it possible to identify three unioral honey samples of Cocos nucifera (Centla), Mimosa orthocarpa var. berlandieri (Paraíso) and Psidium sp (Tacotalpa), seven multioral honey samples and two bioral honey samples of Acalypha sp / Bursera simaruba (Tacotalpa) and Gramineae / Celtis sp (Centla). The results indicate that, according to the blooming season, it is possible to harvest two dierent types of honey in one apiary. Tabasco in general produces dierent types of honey in dierent geographic regions and, although the state has a low production of honey, it was proved that it may compete commercially in the production of unioral honey that is well received, particularly in the European Union

descriptionView Paper arrow_downwardDownload

Postharvest characterization of selections of Mamey Sapote (Pouteria sapota (Jacq.) H. E. Moore & Stearn) Selections from Soconusco, Chiapas

by JUAN MANUEL VILLARREAL FUENTES

2024

Fruits from six selections (S) of mamey sapote were harvested in the Soconusco, Chiapas at a mature stage and ripened at room temperature (23 oC and 70 % R. H.) for 12 d. The fruits from the six selections showed maximum respiration and... more

descriptionView Paper arrow_downwardDownload

Sentence selection methods for text summarization

by Banu Diri

2024

Bu çalışmanın amacı, bir dokümandaki en önemli cümleleri seçerek ilgili dokümanın özetini çıkarmaktır. Bu amaçla 15 farklı cümle seçim metodu kullanılmıştır. Bu metotlar, 15 kadın ve 15 erkek olmak üzere, toplam 30 kişi tarafından... more

descriptionView Paper arrow_downwardDownload

Good Computing Reports for Computer Ethics

by Christina Bellon

2024

Good Computing Reports (From Charles Hu , "Practical Guidance for Teaching the Social Impact Statement (SIS). From Proceedings of the 1996 Symposium on Computers and the Quality of Life, pp. 86-89. New York, ACM Press.) Key Links 1.... more

descriptionView Paper arrow_downwardDownload

A Machine-Verified Theory of commuting strategies for product-line reliability analysis

by Thiago Mael de Castro

2024

ADD elgeri heision higrmF CTL gomputtion ree vogiF CTMC gontinuousEime wrkov ghinF DTMC hisreteEime wrkov ghinF JML tv wodeling vngugeF MDP wrkov heision roessF PCTL roilisti gomputtion ree vogiF PMC rmetri wrkov ghinF PVS rototype... more

descriptionView Paper arrow_downwardDownload

Turismo de saúde para pessoas idosas: um estudo de caso

by Zelia Breda

2024, Revista Turismo & Desenvolvimento

Resumo | O turismo fornece às pessoas idosas uma atividade social que melhora a qualidade de vida. Emergiu, assim, um produto que tem ganho grande visibilidade: o turismo de saúde. Apesar da importância, ainda não é possível determinar o seu impacto em Portugal e para o mercado sénior. O estudo tem como objetivos retratar a situação atual, em Portugal, e na região Norte, das políticas públicas dirigidas ao turismo de saúde para pessoas idosas e criar as bases para uma política promotora direcionada para seniores em Portugal. Optou-se por realizar um estudo qualitativo exploratório, com recurso a entrevistas semiestruturadas. A amostra foi constituída por 10 atores-chave no âmbito. Os resultados indicam que: (i) existe um parco conhecimento dos decisores sobre o conceito de turismo de saúde e as suas divisões; (ii) Portugal não tem denida uma estratégia especíca para o setor; (iii) o envelhecimento é visto como uma oportunidade para o fenómeno; (iv) as intervenções que devem ser realizadas são o desenvolvimento de uma estratégia concertada, a promoção internacional e a identicação dos intervenientes na área do turismo de saúde. Os resultados obtidos permitem conhecer melhor a situação atual no contexto do turismo de saúde em Portugal (o que falta fazer, o que se está a fazer, como e por quem), assim como o conhecimento que os decisores da área têm no que diz respeito a conceitos e boas-práticas afetas ao tema. Palavra-chave | Envelhecimento, turismo sénior, turismo de saúde, políticas públicas Abstract | Tourism provides to the elderly a social activity that improves their quality of life. Emerged, in that context, a product that has gained great visibility: health tourism. Despite the importance it is still not possible to determine its impact in Portugal, and for the senior market. The objective of the study was to portray the current situation in Portugal and in the North of the public policies aimed at health tourism for the elderly, and to create bases for a promotional policy aimed at seniors in Portugal. It was decided to conduct a qualitative exploratory study, using semi-structured interviews. The sample

descriptionView Paper arrow_downwardDownload

Academic tourism: A study about the Erasmus students at the University of Aveiro

by Zelia Breda

2024

Resumo | Este trabalho tem como objetivo analisar o conceito de turismo académico, inserido na esfera do turismo educacional, e aplicar o conceito aos estudantes que realizaram mobilidade internacional ao abrigo do Programa Erasmus na... more

descriptionView Paper arrow_downwardDownload

Transição no regime de desgaste por deslizamento dos aços: uma abordagem termodinâmica

by cristian arango

2024

Ao professor Amilton Sinatora pela oportunidade para vir ao Brasil e ter uma inesquecível experiência além do doutorado, pela amizade, apoio, conança e motivação para realizar este trabalho. À FAPESP pela bolsa de doutorado (processo... more

descriptionView Paper arrow_downwardDownload

Proyecto Tesis Maestria Milton Aycho Flores

by Milton A . Aycho Flores

2024, Proyecto de tesis

Proyecto de tesis para obtener el grado academico de magister en matematica aplicada. Escuela de posgrado UNMSM, Octubre del 2013.

descriptionView Paper arrow_downwardDownload

Spectroscopic Investigation of Rare-Earth Doped Phosphate Glasses Containing Silver Nanoparticles

by M. Reza Dousti

2024, Acta Physica Polonica A

Phosphate glasses having compositions (59.5x)P2O540MgOxAgCl0.5Er2O3, where x = 0, 1.5 mol.% is prepared using melt-quenching technique. Infrared, absorption and photoluminescence spectra of Er 3+-doped magnesium phosphate glasses have... more

descriptionView Paper arrow_downwardDownload

Evolución de casos positivos y fallecidos por Covid-19 en el Perú, y el factor f a través de un modelo físico

by David Correa

2024, Revista de Investigación de Física

En el contexto de la propagación del coronavirus (Covid-19), a nivel mundial, se han reportado datos sobre el número de casos positivos, fallecidos, hospitalizados, recuperados, etc. En el caso de Perú por el Ministerio de Salud (MINSA),... more

descriptionView Paper arrow_downwardDownload

Functional PCA vs Artificial Neural Networks

by Mattis Hallberg

2024

Denna rapport fokuserar på jämförelsen av några olika klassificeringsmetoder applicerade på bilddatan Fashion-MNIST. De olika metoderna är artificiella neurala nätverk och funktionell principalkomponentanalys och principalkomponentanalys.... more

descriptionView Paper arrow_downwardDownload

Transformation of Binary Relationships with Particular Multiplicity

by Zdeněk Rybola and

2024

The paper deals with one small step in the process of model driven development (MDD) or model driven architecture (MDA) widely used terms nowadays. MDD denes techniques to develop software systems using variety of models together with a... more

descriptionView Paper arrow_downwardDownload

Polymeric Track Etched Membranes - Application for Advanced Porous Structures Formation

by Wojciech Starosta

2024, Acta Physica Polonica A

Track etched membranes are porous systems consisting of a thin polymer foil with channels from surface to surface. Latent ion tracks are the result of the passage of swift ions through solid matter and they can be etched selectively. As a... more

descriptionView Paper arrow_downwardDownload

Phytochemical screening, Antioxidative and anti-α-amylase properties of endophytic fungal extracts isolated from Ocimum basilicum

by Magdi El-Sayed

2024

Ocimum basilicum belongs to the family Lamiaceae which is known to have anticancer and many other bioactivities. Phoma eupyrena, Emericella nidulans lata and Chaetomium olivaccium were isolated from the different organs of the Basil... more

descriptionView Paper arrow_downwardDownload

Evaluation of Propeller Inflow Improving Device with Adaptive Grid Refinement Computation

by Michel Visonneau

2024, HAL (Le Centre pour la Communication Scientifique Directe)

A ship design with boundary layer alignment device (BLAD) aiming at improving the inow of a propeller is evaluated using a RANSE computation with adaptive grid renement. This paper is focused on model scale simulation for which... more

descriptionView Paper arrow_downwardDownload

Dagstuhl Seminar Proceedings 04171 Logic Based Information Agents

by Jürgen Dix

2024

From 18.04.04 to 23.04.04, the Dagstuhl Seminar 04171 Logic Based Information Agents was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several participants presented their current... more

descriptionView Paper arrow_downwardDownload

Construção de estruturas ontológicas a partir de textos: um estudo baseado no método formal concept analysis e em papéis semânticos

by Silvia Moraes

2024

Agradeço a Deus, primeiramente, por conseguir concluir esta tese sem qualquer prejuízo ao Lucas. Agradeço muito ao Diego pelo companheirismo e pela grande compreensão demonstrada em relação a todos os momentos que estive ausente em decorrência deste trabalho. Seu incentivo para conclusão da tese foi muito importante. Meu agradecimento especial à minha mãe, Adélia, sem a qual eu não conseguiria levar adiante este projeto prossional. Enquanto eu trabalhava, ela cuidava com carinho sem igual da minha família. Também agradeço à tia Marli pela atenção que recebi todas as vezes que fui ao Rio de Janeiro por razões relacionadas a este trabalho. Agradeço igualmente à tia Martha, que com frequência vinha nos ajudar a cuidar do Vítor, e, desta forma, permitiu que eu usasse esses momentos para me dedicar à tese. Agradeço à minha orientadora, Prof. Vera, pelas suas preciosas contribuições, disponibilidade e incentivo no decorrer deste trabalho. Agradeço também pela compreensão em relação aos momentos que tive que me afastar para cuidar do Vítor. Agradeço ao Prof. Ruy por gentilmente ter me recebido na PUC-Rio e permitido, na época, meu acesso ao processador de papéis de semânticos F-EXT-WS, o qual utilizo nesta tese. Agradeço também ao Prof. António Branco e a sua equipe de pesquisadores. Todos que me receberam muito bem e com muita atenção, quando estive em Lisboa, por questões relativas a um projeto conjunto entre o nosso grupo de pesquisa e o dele, durante o doutoramento. Agradeço também aos professores da banca por terem aceito a tarefa de avaliar meu trabalho. Agradeço, ainda, ao meu amigo Marcelo Cohen pela sua amizade e inestimável ajuda. Ele sempre se dispôs a me auxiliar nos momentos em que eu precisava resolver problemas relativos à conversão de algum arquivo, conguração de ferramentas e dúvidas relacionadas ao Lyx e ao L A T E X. Agradeço também ao Marco Gonzalez por ter disponibilizado a ferramenta FORMA e resolvido dúvidas quanto ao seu uso. Agradeço ao Marco Mangan por ter me indicado um artigo que foi fundamental para a organização dos capítulos referentes à tese. Agradeço aos meus amigos e colegas pelos momentos de descontração na sala dos professores horistas da PUCRS, os quais tornaram o caminho até a conclusão da tese mais ameno. São eles:

descriptionView Paper arrow_downwardDownload

Cálculo de reputação em redes sociais a partir de dados da colaboração entre os participantes

by Edith Mamani

2024

A Deus, que sempre esteve com a gente em cada viagem e em cada momento de nossa vida. Muito obrigada Deus por dar fortaleza especialmente nos momentos de solidão. Resumo MAMANI, E. Z. S. Cálculo de reputação em redes sociais a partir da... more

descriptionView Paper arrow_downwardDownload

Análisis de aplicaciones móviles utilizando métodos formales

by Ana Rosario E S

2024

descriptionView Paper arrow_downwardDownload

Análisis de aplicaciones móviles utilizando métodos formales

by Ana Rosario E S

2024

La tecnologia movil ha surgido de la necesidad de las personas de llevar consigo un medio de comunicacion con opciones de entretenimiento, una biblioteca y acceso a Internet. Actualmente, se ha masificado el uso de dispositivos moviles... more

descriptionView Paper arrow_downwardDownload

Integración del recurso eólico marino en los sectores del transporte y climatización: estudio de transición energética en la costa Este de EEUU

by Isabel Cristina Gil García

2024

del recurso eólico marino en los sectores del transporte y climatización: estudio de transición energética en la costa Este de EEUU".

descriptionView Paper arrow_downwardDownload

Spatial model of L7 dimer from E.coli with one hinge region in helical state

by Konstantin Pavlov

2024

with specic help available everywhere you see the i O symbol. The following versions of software and data (see references i O) were used in the production of this report:

descriptionView Paper arrow_downwardDownload

Aplicaciones de cómputo intensivo con impacto social

by E. Fadón

2024

Los avances tecnológicos de los sistemas de cómputo paralelo y distribuido permiten el desarrollo de aplicaciones antes impensadas. Nuestra

descriptionView Paper arrow_downwardDownload

Computación de altas prestaciones: problemáticas y aplicaciones

by E. Fadón

2024

Nuestra investigación está centrada en dos líneas. Por un lado, el estudio del consumo energético de los sistemas de Cómputo de Altas Prestaciones (HPC, de High Performance Computing) cuya alta demanda energética tiene serias... more

descriptionView Paper arrow_downwardDownload

09192 Abstracts Collection – From Quality of Service to Quality of Experience

by Gerardo Rubino

2024

From 05.05. to 08.05.2009, the Dagstuhl Seminar 09192 ``From Quality of Service to Quality of Experience'' was held in Schloss Dagstuhl~--~Leibniz Center for Informatics. During the seminar, several participants presented their... more

descriptionView Paper arrow_downwardDownload

Retrieval Performance of Arabic Light Stemmers

by Roslina Othman

2024, International Journal of Modern Trends in Social Sciences

Despite the fact that stemming greatly improves Arabic information retrieval performance, yet no standard stemmer emerges in the field of Arabic IR due to some limitations and shortcomings. Among the recurring problems is that the stemmer... more

descriptionView Paper arrow_downwardDownload

A Semi-Supervised BERT Approach for Arabic Named Entity Recognition

by Mohsen Shamas

2023

Named entity recognition (NER) plays a significant role in many applications such as information extraction, information retrieval, question answering, and even machine translation. Most of the work on NER using deep learning was done for... more

Figure 3: Semi-Supervised Learning Approach 3.2 Semi-Supervised Learning Model for Arabic NER

The third and final dataset we used for evaluation is the TWEETS dataset. As can be seen from Table 3, the MADAMIRA and FARASA tools performed poorly compared to the deep learning approaches with an F-measure of 24.6 and 39.9, respectively. Only in this dataset, the Deep Co-learning approach has the highest score, which is better than the AraBERT trained in a fully supervised fashion and to the AraBERT trained in a semi-supervised fashion with an F-measure of 59.2. The reason behind this result is that the AraBERT model was pre-trained on MSA corpora, which highly differ in nature from tweets that are mostly in the Egyptian dialect and contain mistakes or misspellings. We conclude that our semi-supervised approach is making a significant improvement in the performance of the Arabic NER task when the texts are written in MSA. To have better results on other types of Arabic texts like tweets, we need to study the performance of our approach when pre-trained on this type of Arabic texts.

descriptionView Paper arrow_downwardDownload

Retrieval Performance of Arabic Light Stemmers

by Roslina Othman

2023, International Journal of Modern Trends in Social Sciences

descriptionView Paper arrow_downwardDownload

Name Entity Recognizer for Odia using Conditional Random Fields

by Dr Deepak Sahoo

2023, Indian Journal of Science and Technology

Name Entity Recognition (NER) is a process of information extraction that seeks to locate atomic elements in text and classify them into predefined categories such as the name of persons, organizations, locations, expressions of times,... more

descriptionView Paper arrow_downwardDownload

Identificación de entidades con nombre basada en modelos de Markov y árboles de decisión

by Fernando Enriquez

2023

Resumen: Este artículo presenta un sistema para el reconocimiento de entidades con nombre apoyándonos en dos técnicas clásicas de aprendizaje automático: los modelos de Markov y losárboles de decisión. Se han desarrollado varios sistemas... more

descriptionView Paper arrow_downwardDownload

Arabic Rule-Based Named Entity Recognition Systems Progress and Challenges

by Lailatul Qadri Zakaria

2023, International Journal on Advanced Science, Engineering and Information Technology

Rule-based approaches are using human-made rules to extract Named Entities (NEs), it is one of the most famous ways to extract NE as well as Machine Learning. The term Named Entity Recognition (NER) is defined as a task determined to... more

descriptionView Paper arrow_downwardDownload

Benchmark of Arabic morphological analyzers challenges and solutions

by Younes Jaafar

2022, 2014 9th International Conference on Intelligent Systems: Theories and Applications (SITA-14)

Arabic Natural Language Processing (ANLP) has known an important development during the last decade. Nowadays, several ANLP tools are already developed such as morphological analyzers. These analyzers are often used in more advanced... more

descriptionView Paper arrow_downwardDownload

Exploring Spanish health social media for detecting drug effects

by Isabel Bedmar

2022, BMC Medical Informatics and Decision Making

Background: Adverse Drug reactions (ADR) cause a high number of deaths among hospitalized patients in developed countries. Major drug agencies have devoted a great interest in the early detection of ADRs due to their high incidence and... more

descriptionView Paper arrow_downwardDownload

Sentence selection methods for text summarization

by Banu Diri

2022, 2014 22nd Signal Processing and Communications Applications Conference (SIU)

descriptionView Paper arrow_downwardDownload

Advancing Morphological Analyser for Kannada Using NLP and Machine Learning Approaches

by Dr. Anitha G

2022, International Journal of Advanced Science and Technology

The Morphological analysis tool recognizes and investigates the structure of the words given internally and provides the syntactical and morphological info related to words given as input. The Kannada language is morphologically much... more

descriptionView Paper arrow_downwardDownload

A Text Mining Approach for the Extraction of Kinetic Information from Literature

by Ana Alão

2022, Advances in Intelligent Systems and Computing

descriptionView Paper arrow_downwardDownload

An Integrated Web-based System for MEDLINE Analysis: A Case Study of Chronic Kidney Disease

by Chun-Wei Tung

2022

In the era of big data, medical researchers attempt to utilize some analysis techniques like machine learning and text mining on their large-scale corpora to save valuable labor work and time. Consequently, many data analysis platforms... more

descriptionView Paper arrow_downwardDownload

A description and demonstration of SAFAR framework

by hamid jaafar

2022, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations

Several tools and resources have been developed to deal with Arabic NLP. However, a homogenous and flexible Arabic environment that gathers these components is rarely available. In this perspective, we introduce SAFAR which is a... more

descriptionView Paper arrow_downwardDownload

A Survey of Arabic question answering: challenges, tasks, approaches, tools, and future trends

by Ahmed Magdy

2022

Arabic is the 6th most important language in the world with more than 300 million speakers. Arabic Question Answering systems are gaining great importance due to the increasing amounts of Arabic content on the Internet and the increasing... more

descriptionView Paper arrow_downwardDownload

A Semi-Supervised BERT Approach for Arabic Named Entity Recognition

by Mohsen Shamas

2022

descriptionView Paper arrow_downwardDownload

NERWS: Towards Improving Information Retrieval of Digital Library Management System Using Named Entity Recognition and Word Sense

by Ayad Abbas

2022, Big Data and Cognitive Computing

An information retrieval (IR) system is the core of many applications, including digital library management systems (DLMS). The IR-based DLMS depends on either the title with keywords or content as symbolic strings. In contrast, it... more

descriptionView Paper arrow_downwardDownload

Name Entity Recognition

Key research themes

1. How can Hidden Markov Models be applied to Named Entity Recognition across languages and domains?

2. What are the benefits and limitations of leveraging linguistic parsing and syntactic structure for Named Entity Recognition?

3. How can domain- and language-specific corpora and annotation methodologies enhance Named Entity Recognition for low-resource and specialized languages?

Related Topics

All papers in Name Entity Recognition