Academia.eduAcademia.edu

Linguistic Computing

description9 papers
group23 followers
lightbulbAbout this topic
Linguistic Computing is an interdisciplinary field that combines linguistics and computer science to develop algorithms and software for processing, analyzing, and generating human language. It encompasses areas such as natural language processing, computational linguistics, and language technology, focusing on enabling machines to understand and interact with human language effectively.
lightbulbAbout this topic
Linguistic Computing is an interdisciplinary field that combines linguistics and computer science to develop algorithms and software for processing, analyzing, and generating human language. It encompasses areas such as natural language processing, computational linguistics, and language technology, focusing on enabling machines to understand and interact with human language effectively.
Chapter 1: 1.1 The word and its associative field 1.2 Syntagmatic and paradigmatic relations 1.3 (a) Lexical fields in the total vocabulary 1.3 (b) Example of a lexical field Chapter 6: 6.1 The vocabulary of English according to the OED... more
"The Revival of Cornish: An Dasserghyans Kernewek" by John J. Parry (1889–1954). With unpublished comments, corrections and updates by "Caradar" (A. S. D. Smith, 1883–1950), letter dated May 27, 1946. Cornish, a Celtic language of... more
this is the case, orthographic practice has a lot in common with Romance languages in general. Where Cornish has borrowed from English, Cornish spelling frequently resembles that found in the works of Chaucer. But there are differences... more
The aims of this study were to figure out the number of homographs in Acehnese and English languages and the examples of homographs. Qualitative approach was used to conduct the study. Both Acehnese dictionary and Oxford dictionary were... more
There are estimated to be upwards of 6,000 languages in the world today, although a disturbingly high proportion of these are under threat of extinction. All these languages have their histories. And then there are those languages that... more
This paper looks at how Spanish-mente adverbs are shown in DAELE, an electronic dictionary for advanced-level students of Spanish, currently being developed at the Universitat Pompeu Fabra (Barcelona). Since a learners' dictionary is a... more
A word-based morphological analyzer and a dictionary for recognizing inflected forms of French words have been built by adapting the UDICI" system. We describe the adaptations, emphasizing mechanisms developed to handle French verbs. This... more
The present study is an attempt at assessing the level of consistency in the orthographic systems of selected sixteenth and seventeenth-century printers and at tracing the influence that normative writings could have potentially exerted... more
The paper deals with that all-too-familiar situation where the lexicographer must make a decision whether to include a particular item in the dictionary or not, and if so then in what form. Some borderline cases are investigated where... more
Cornish is the vernacular language of Cornwall, the most SouthWestern part of Great Britain. It is widely believed the language died out in the eighteenth century with the death of Dolly Pentreath, the so-called last speaker of the... more
Morphology consists of inflection and word formation. In foreign language teaching it occurs mainly in the form of inflectional paradigms. While this is certainly an important part of mastering a foreign language, an adequate use of... more
Proceedings of PACLIC 19, the 19th Asia-Pacific Conference on Language, Information and Computation. ... Vowel Sound Disambiguation for Intelligible Korean Speech Synthesis ... Ho-Joon Lee Computer Science Division EECS department, KAIST... more
The aim of this paper is to research the word class adjective in one sequence of the ESP: Business English, more precisely English business magazines online. It is an empirical study on the corpus taken from a variety of business... more
This article introduces Corpus PalaeoHibernicum (CorPH), a corpus currently consisting of 78 texts in Early Irish (c. 7th–10th cent.) created by the ERC-funded Chronologicon Hibernicum (ChronHib) project by bringing together pre-existing... more
Many lexical databases are modelled simply as digital version of paper dictionaries. However, for many purposes the demands on a lexical database are different from those on a dictionary database. Therefore, the MorDebe database system... more
This article describes the design of a computational system for the development and maintenance of inflected lexica, developed as part of the Open Source Lexical Information Network (OLSIN). The system is built as a tool for... more
This paper explains the roles of the lexicon and the lexicographer to the nature of words.
This article introduces Corpus PalaeoHibernicum (CorPH), a corpus currently consisting of 78 texts in Early Irish (c. 7th–10th cent.) created by the ERC-funded Chronologicon Hibernicum (ChronHib) project by bringing together pre-existing... more
The Welsh language, as a lesser-used language with English as an immediate neighbour, has inevitably borrowed much of its vocabulary from that language (or its precursors) as well as inheriting a considerable vocabulary from Latin via... more
The present study is an attempt at assessing the level of consistency in the orthographic systems of selected sixteenth and seventeenth-century printers and at tracing the influence that normative writings could have potentially exerted... more
Large Lexical Data Bases are one of the earliest applications of NLP. The initial stage of their rise, with the admiration for the automation of lexicographic work itself, came to an end long ago. In the following stages LexicalData Bases... more
Large Lexical Data Bases are one of the earliest applications of NLP. The initial stage of their rise, with the admiration for the automation of lexicographic work itself, came to an end long ago. In the following stages LexicalData Bases... more
Languages other than English have received little attention as far as the application of natural language processing techniques to text composition is concerned. The present paper describes briefly work under development aiming at the... more
Languages other than English have received little attention as far as the application of natural language processing techniques to text composition is concerned. The present paper describes briefly work under development aiming at the... more
The problem addressed in this thesis concerns the accuracy of Māori language vocabulary counts, e.g Boyce (2006), where Māori was found to use a very small vocabulary in comparison with e.g. English. As Boyce (2006, ii) acknowledges, this... more
Lexicography should Ьѳ based on the dual dependency of nearly every dictionary entry; word dependence and morpheme de­ pendence. The overt or Implied assumption that lexicography deals only wlth words and their combinations is. therefore,... more
The aims of this study were to figure out the number of homographs in Acehnese and English languages and the examples of homographs. Qualitative approach was used to conduct the study. Both Acehnese dictionary and Oxford dictionary were... more
Languages other than English have received little attention as far as the application of natural language processing techniques to text composition is concerned. The present paper describes briefly work under development aiming at the... more
This paper presents a proposal for a recognition model for the appraisal value of sentences. It is based on splitting the text into independent sentences (full stops) and then analysing the appraisal elements contained in each sentence... more
An examination of the evidence that a medieval Cornish Bible written by John Trevisa once existed
, to a Polish father and English mother, Sarah Frances Field Sommerville, and brought up speaking English, Polish and French. As an Anglican clergyman, writer and historian he contributed to the Cornish Revival in the early twentieth... more
The derivational morphology in learners' English narrative compositions was the main focus of this research. This research aims to unravel the different inflectional suffixes used in the selected poems of Luis G. Dato driven from the... more
This paper focuses on a diachronic study of compound adjectives found in the Old and Middle English texts of the Helsinki Corpus. The compound adjectives of both periods are analysed, and further classified into types on the basis of the... more
In some languages, spaces and punctuation marks are used to delimit word boundaries. This is the case with Cornish. However there is considerable inconsistency of segmentation to be found within the Corpus of Cornish. The individual texts... more
Large Lexical Data Bases are one of the earliest applications of NLP. The initial stage of their rise, with the admiration for the automation of lexicographic work itself, came to an end long ago. In the following stages LexicalData Bases... more
Large Lexical Data Bases are one of the earliest applications of NLP. The initial stage of their rise, with the admiration for the automation of lexicographic work itself, came to an end long ago. In the following stages LexicalData Bases... more
To combine corpus data with dictionary data has two advantages: (i) It embeds the vocabulary of the corpus texts within the overall system of the language, and it semantically disambiguates the texts. (ii) The corpus data enrich the... more
This paper describes the implementation of Screffva, a computer system written in Prolog that employs a parallel corpus for the automatic generation of bilingual dictionary entries. Screffva provides a lemmatised interface between a... more
In this paper, I present data from three corpora of written Uyghur showing that the conventionally voiceless letter h, which occurs in words of Arab-Persian etymology, sometimes patterns as voiced in stem-final environments where it is a... more
We discuss two types of asymmetry between wordforms and their (morphological) characteristics, namely (morphological) variants and homographs. We introduce a concept of multiple lemma that allows for unique identification of wordform... more
Thanks to ❙ Anthony P. Cowie, Raphael Gefen, Doron Rubinstein, Merav Kernerman Miriam Shlesinger, Nili Sadeh, Lionel Kernerman K DICTIONARIES LTD Nahum 10 Tel Aviv 63503 Israel ❙ tel 972-3-5468102 ❙ fax 972-3-5468103 ❙... more
A GENERATIVE GRAMMAR APPROACH FOR THE MORPHOLOGIC AND MORPHOSYNTACTIC ANALYSIS OF ITALIAN Marina Russo ... singular plural passa-porto (pass-port) passa-porti porta-cenere (ash-tray) porta-cenere cava-tappi (cork-screw) cava-tappi rule 1... more
The paper reports ongoing work for the implementation of a system for automatic translation from English-to-Veneto and viceversa. The system does not have parallel texts to work on because of the almost inexistence of such manual... more
Cornish and Welsh are closely related Celtic languages and this paper provides a brief description of a recent project to publish an online bilingual English/Cornish dictionary, the Gerlyver Kernewek, based on similar work previously... more
Download research papers for free!