Genre detection is the computational process of identifying and classifying the genre of a text, audio, or visual content based on its characteristics and patterns. It employs algorithms and machine learning techniques to analyze features such as language, structure, and style to categorize content into predefined genres.
lightbulbAbout this topic
Genre detection is the computational process of identifying and classifying the genre of a text, audio, or visual content based on its characteristics and patterns. It employs algorithms and machine learning techniques to analyze features such as language, structure, and style to categorize content into predefined genres.
Word adjacency networks constructed from written works reflect differences in the structure of prose and poetry. We present a method to disambiguate prose and poetry by analyzing network parameters of word adjacency networks, such as the... more
Word adjacency networks constructed from written works reflect differences in the structure of prose and poetry. We present a method to disambiguate prose and poetry by analyzing network parameters of word adjacency networks, such as the clustering coefficient, average path length and average degree. We determine the relevant parameters for disambiguation using linear discriminant analysis (LDA) and the effect size criterion. The accuracy of the method is 74.9 ± 2.9% for the training set and 73.7 ± 6.4% for the test set which are greater than the acceptable classifier requirement of 67.3%. This approach is also useful in locating text boundaries within a single article which falls within a window size where the significant change in clustering coefficient is observed. Results indicate that an optimal window size of 75 words can detect the text boundaries.
In this paper, we report on a preliminary study we carried out for identifying patterns that characterize the genre type of Greek texts. In the course of our study, we address four distinct genre types, we record their observable... more
In this paper, we report on a preliminary study we carried out for identifying patterns that characterize the genre type of Greek texts. In the course of our study, we address four distinct genre types, we record their observable stylistic elements and we indicate their exploitation for automatic genre-based document classification. The findings of our study demonstrate that texts contain lexical features with discriminative power as far as genre is concerned, however modeling those features so that they can be explored by computer-based applications is still in early stages.