Academia.eduAcademia.edu

Personal Name Disambiguation

description11 papers
group3 followers
lightbulbAbout this topic
Personal Name Disambiguation is the process of identifying and distinguishing between individuals with the same or similar names in datasets, ensuring accurate attribution of information, publications, or contributions to the correct person. This is crucial in fields such as bibliometrics, information retrieval, and social network analysis.
lightbulbAbout this topic
Personal Name Disambiguation is the process of identifying and distinguishing between individuals with the same or similar names in datasets, ensuring accurate attribution of information, publications, or contributions to the correct person. This is crucial in fields such as bibliometrics, information retrieval, and social network analysis.

Key research themes

1. How can semantic and co-author network features improve automated author name disambiguation in large bibliographic databases?

This research theme investigates computational methods, especially machine learning and deep learning techniques, to resolve ambiguities in author names by leveraging semantic representations of publication metadata and co-authorship relationships. Addressing the challenges of homonymy and synonymy in author names is critical for the accuracy of digital libraries, bibliometric studies, and academic information retrieval. The use of neural network models combining textual semantic embeddings with co-author network information offers promising approaches to link ambiguous author names to their respective real-world individuals in large-scale datasets such as DBLP.

Key finding: This paper proposes 'WhoIs', a novel neural network model trained on DBLP bibliographic data that combines character-level embeddings of co-author names (Char2Vec) with contextual semantic embeddings of paper titles and... Read more
Key finding: The authors develop an enhanced unsupervised person name disambiguation approach integrating reduction and supervised query strategies over web-extracted online profiles, achieving approximately 67% accuracy and supporting... Read more
Key finding: This work presents an unsupervised clustering method for personal name disambiguation based on automatically extracted biographical features such as birth date, nationality, and occupation from text, illustrating that... Read more
Key finding: This paper introduces a plugin-based framework enabling systematic comparison and combination of name disambiguation methods using a shared dataset (WePS-2). Empirical results support that using full text representations... Read more

2. What socio-cultural and linguistic factors influence personal naming conventions and their implications for identity across diverse societies?

Personal naming practices reflect and construct social identities including ethnicity, gender, religion, social class, and cultural affiliation. This theme encompasses investigations into the semantic, sociolinguistic, and historical analyses of naming conventions from various global contexts, highlighting naming as a dynamic mediator of identity, social roles, and cultural continuity or change. Research in this area also addresses the perception of names, the social meanings encoded within names, and the impact of naming on experiences of discrimination or inclusion.

Key finding: Using survey data from Belgium, the study evidences that people primarily distinguish Belgian from non-Belgian names rather than identifying specific ethnic-national origins, and that names also encode perceptions of gender,... Read more
Key finding: This qualitative study on the Datooga pastoralist society reveals that personal names are semantically linked to socio-cultural phenomena such as birth circumstances, traditions, gender roles, and social norms, indicating... Read more
by Rizwan Ahmad and 
1 more
Key finding: Through corpus analysis of 2,000 names in North India, this study identifies four major typologies of Muslim personal names and notes significant etymological roots in Arabic and Persian languages. It documents generational... Read more
Key finding: This analytic sociolinguistic study explores American English naming practices, identifying diverse sources of personal names including religious, occupational, and popular names, and discusses how naming operates as a... Read more
Key finding: Research based on 18th-century Hungarian census tax data demonstrates that systematic analysis of personal names can robustly reflect language borders and ethnic group distributions historically. The study discusses... Read more

3. How do personal names function ontologically and symbolically in individual identity and social memory across cultural and historical contexts?

This theme investigates the philosophical, anthropological, and cultural significance of personal names, focusing on their role in constituting personal identity, mediating social recognition, and connecting individuals to historical lineage or collective memory. Studies explore how names embody power dynamics, spiritual beliefs, and legal or bureaucratic realities, and how naming intersections with societal structures impact self-perception and interpersonal relations.

Key finding: Through collaborative autoethnography coupled with DNA analysis, this paper reveals how surnames, specifically anglicised Irish and Jewish names Fitzpatrick and Keesing, are complex socio-political constructs shaped by... Read more
Key finding: Ethnographic data from Macau, Brazil, and Portugal reveal that speakers attribute a notion of 'truth' to personal names, understood as ontological markers that mediate individuals' subjecthood, unitariness, and... Read more
Key finding: This study details the vital role of names in ancient Egyptian culture, where personal names are inseparable from identity, social status, and even spiritual existence. It highlights how names functioned legally and ritually... Read more
Key finding: This article argues the intimate connection between a person’s given name and their sense of self and identity, emphasizing how names influence personality development and function as crucial narrative tools in literature for... Read more
Key finding: This paper discusses the complexities surrounding the accuracy and reliability of personal name data, emphasizing the necessity of balancing automated analytical processes with manual verification, and incorporating domain... Read more

All papers in Personal Name Disambiguation

by Qin Lu
Web Person Disambiguation (WPD) is often done through clustering of web documents to identify the different namesakes for a given name. This paper presents a clustering algorithm using key phrases as the basic feature. However, key... more
An alumni database is a valuable information source for the development of a university. However, alumni databases tend to be incomplete. It is always possible for phone numbers and home or e-mail addresses to change. In this study, the... more
Finding information about people using search engines is one of the most common activities on the Web. However, search engines usually return a long list of Web pages, which may be relevant to many namesakes, especially given the... more
Considering the difficulties of distributing hardcopy surveys and the options provided by technology advances (especially the Internet), some universities focus their surveys on electronic versions. The process could be done... more
An alumni database is a valuable information source for the development of a university. However, alumni databases tend to be incomplete. It is always possible for phone numbers and home or e-mail addresses to change. In this study, the... more
En este trabajo presentamos un sistema no supervisado para agrupar los resultados proporcionados por un motor de búsqueda cuando la consulta corresponde a un nombre de persona compartido por diferentes individuos. Las páginas web se... more
En este trabajo presentamos un sistema no supervisado para agrupar los resultados proporcionados por un motor de búsqueda cuando la consulta corresponde a un nombre de persona compartido por diferentes individuos. Las páginas web se... more
En este trabajo presentamos un sistema no supervisado para agrupar los resultados proporcionados por un motor de búsqueda cuando la consulta corresponde a un nombre de persona compartido por diferentes individuos. Las páginas web se... more
In this paper, we propose a Markov CLustering (MCL) based text mining approach for namesake disambiguation on the Web. The novelty of the proposed technique lies in modeling the collection of webpages using a weighted graph structure and... more
We describe about the system description of the PSNUS team for the SemEval-2007 Web People Search Task. The system is based on the clustering of the web pages by using a variety of features extracted and generated from the data provided.... more
A person may have multiple personal name aliases and that same thing might available be on the web. Identifying aliases of a name is useful in information retrieval, investigating about things, knowledge management, sentiment analysis,... more
We describe about the system description of the PSNUS team for the SemEval-2007 Web People Search Task. The system is based on the clustering of the web pages by using a variety of features extracted and generated from the data provided.... more
A person may have multiple personal name aliases on the web. Identifying aliases of a name is useful in information retrieval and knowledge management, sentiment analysis, relation extraction and name disambiguation. The objective of... more
— A person may have multiple personal name aliases on the web. Identifying aliases of a name is useful in information retrieval and knowledge management, sentiment analysis, relation extraction and name disambiguation. The objective of... more
Person name disambiguation on the Web (PNDW) consists of grouping the Web pages retrieved by a search engine when a person's name is queried according to the individuals they refer to. This problem is of interest to the research community... more
The user has requested enhancement of the downloaded file. All in-text references underlined in blue are added to the original document and are linked to publications on ResearchGate, letting you access and read them immediately.
People name search often returns a lot of Web pages con- taining the strings of personal names. Due to namesake, extracting target person attributes (such as birthday, occu- pation, affiliation, nationality, contact information, etc.) is... more
This paper describes a simple clustering approach to person name disambiguation of retrieved documents. The methods are based on standard IR concepts and do not require any task-specific features. We compare different term-weighting and... more
This paper describes a simple clustering approach to person name disambiguation of retrieved documents. The methods are based on standard IR concepts and do not require any task-specific features. We compare different term-weighting and... more
Searching for information about people in search engines is a common and straightforward task that is often hampered by name ambiguities. While users are interested in information about a single person, results pages usually comprise many... more
It's common that different individuals share the same name, which makes it time-consuming to search information of a particular individual on the web. Name disambiguation study is necessary to help users find the person of interest more... more
Searching for information about people in search engines is a common and straightforward task that is often hampered by name ambiguities. While users are interested in information about a single person, results pages usually comprise many... more
abstract A search engine query for a person's name often brings up web pages corresponding to several people who share the same name. The Web People Search (WePS) problem involves organizing such search results for an ambiguous name query... more
Abstract We describe about the system description of the PSNUS team for the SemEval-2007 Web People Search Task. The system is based on the clustering of the web pages by using a variety of features extracted and generated from the data... more
Download research papers for free!