Academia.eduAcademia.edu

Semi-Automatic Classification

description15 papers
group116 followers
lightbulbAbout this topic
Semi-Automatic Classification refers to a methodological approach in data analysis where human expertise and automated algorithms collaboratively classify data. This technique leverages machine learning and user input to enhance accuracy and efficiency in categorizing complex datasets, often used in fields such as remote sensing, image analysis, and natural language processing.
lightbulbAbout this topic
Semi-Automatic Classification refers to a methodological approach in data analysis where human expertise and automated algorithms collaboratively classify data. This technique leverages machine learning and user input to enhance accuracy and efficiency in categorizing complex datasets, often used in fields such as remote sensing, image analysis, and natural language processing.

Key research themes

1. How can semi-supervised learning algorithms effectively leverage both labeled and unlabeled data to improve classification accuracy?

This research theme focuses on the development and evaluation of semi-supervised learning (SSL) methods that integrate small amounts of labeled data with large pools of unlabeled data to enhance classification performance. The challenge lies in designing algorithms that utilize unlabeled data beneficially without degrading accuracy, especially given that labeled data are costly or hard to procure. Various algorithmic frameworks including self-training, co-training, graph-based methods, and boosting-based approaches are explored to optimize learning when labels are scarce, addressing scenarios across text, image, and multidomain data classification.

Key finding: This paper categorizes semi-supervised regression (SSR) methods, highlighting the relative scarcity compared to semi-supervised classification, and emphasizes the importance of utilizing unlabeled data alongside labeled... Read more
Key finding: The study introduces adaptive confidence parameter strategies (FlexCon-G, FlexCon, FlexCon-C) for automatic selection of unlabeled instances in self-training and co-training algorithms. These adaptive methods outperform the... Read more
Key finding: This review synthesizes semi-supervised learning techniques, detailing classification and clustering approaches where unlabeled data improves model generalization with fewer labeled examples. It contrasts generative and... Read more
Key finding: The paper presents MSSBoost, a multiclass semi-supervised boosting algorithm combining pairwise similarity information and classifier predictions to jointly optimize labeled and unlabeled data classification. By embedding a... Read more
Key finding: This survey categorizes and contrasts semi-supervised learning algorithms, including self-training, generative models, co-training, and multiview learning. It emphasizes key assumptions such as smoothness, cluster, and... Read more

2. What methodologies enable effective integration of pairwise constraints, soft label estimation, and discriminant embeddings for enhancing semi-supervised classification performance?

This theme investigates advanced semi-supervised classification techniques that go beyond conventional label propagation by incorporating pairwise constraints, soft labeling strategies, and discriminant feature embedding. These methods aim to encode relational information and data structure more explicitly, fostering better class separability and improved decision boundaries in both binary and multiclass contexts. Such integration is crucial for domains where label information is scarce but relational or intrinsic geometric information among data points is available or can be inferred.

Key finding: This work introduces PCSVM, a margin-based semi-supervised classification algorithm that incorporates pairwise constraints indicating similarity or dissimilarity between data points into the SVM framework. Experiments show... Read more
Key finding: The paper proposes a novel framework (JSLDE) that jointly estimates soft label matrices and discriminant linear embeddings via optimizing margin-based criteria enhanced with graph smoothness regularization. This simultaneous... Read more
Key finding: Combining boosting with pairwise similarity information and classifier predictions, MSSBoost introduces a novel multiclass loss function minimizing inconsistency between similarity-induced pseudo-labels and classifier... Read more
Key finding: This study proposes a one-vs-each (OVE) associative classification approach focusing on rule relevancy for multiclass imbalanced datasets, where rules must be relevant and irrelevant per class individually, resolving... Read more

3. How can feature selection and evaluation metrics be adapted or redefined to capitalize on unlabeled data and better assess semi-supervised classification models?

This research area explores semi-supervised feature selection frameworks and advanced performance evaluation metrics specially designed for semi-supervised learning scenarios. Considering limited labeled data, enhancing feature selection by incorporating unlabeled examples can improve classifier generalization and robustness. Simultaneously, assessing model quality in imbalanced or multi-class contexts necessitates more interpretable and customizable metrics that reflect class distributions, skew-sensitivity, and training progress effectively, thus facilitating trustworthy semi-supervised classification evaluation.

Key finding: This paper introduces a wrapper-type forward feature selection method that utilizes both labeled and unlabeled data by extending training sets with predicted labels on unlabeled instances. Experimental results demonstrate... Read more
Key finding: Contingency Space is proposed as a bounded semimetric framework mapping single-value classification metrics into visual surfaces enabling comprehensive comparison and analysis. It addresses key limitations of traditional... Read more
Key finding: The study compares various post-processing techniques to estimate reliable multiclass posterior probabilities from SVM outputs. It finds that combining 'one against all' coupling with optimized softmax functions yields the... Read more

All papers in Semi-Automatic Classification

Automated and semi-automated image classi cations have made their way into archaeological applications, but early attempts have been strongly criticized. is study examines semi-automated detection methods of archaeological evidence... more
Slag buildup in the Pouring ladle has been a continuous challenge. Buildup also occurred in the Mg-treated metal in transfer ladles. However, the major problem area was significant buildup downstream fixed pouring ladle at CCM area, huge... more
Palaeoenvironmental data indicate that the climate of south-western Madagascar has changed repeatedly over the past millennium. Combined with sociopolitical challenges such as warfare and slave raiding, communities continually had to... more
The successful analysis of LiDAR data for archaeological research requires an evaluation of effects of different vegetation types and the use of adequate visualization techniques for the identification of archaeological features. The... more
Complete, accurate and up-to-date topographic data is of considerable importance as it is widely required by different government agencies, non-governmental organisations, the private sector as well as the general public for urban... more
Illegal archaeological excavations, generally denoted as looting, is one of the most important damage factors to cultural heritage, as it upsets the human occupation stratigraphy of sites of archaeological interest. Looting identification... more
The successful analysis of LiDAR data for archaeological research requires an evaluation of effects of different vegetation types and the use of adequate visualization techniques for the identification of archaeological features. The... more
The significant economic and health impacts of the COVID-19 pandemic have forced archaeologists to consider the concept of resilience in the present day, as it relates to their profession, students, research projects, cultural heritage,... more
Despite decades of archaeological research, roughly 75% of Madagascar's land area remains archaeologically unexplored and the oldest sites on the island are difficult to locate, as they contain the ephemeral remains of mobile... more
This chapter describes semi-automatic methods for detailed mapping of some types of archaeological structure from airborne laser scanning (ALS) data in Norway. The archaeological structures include point or circular features like: (1)... more
Illegal archaeological excavations, generally denoted as looting, is one of the most important damage factors to cultural heritage, as it upsets the human occupation stratigraphy of sites of archaeological interest. Looting identification... more
Bottom pour of certain metal alloys is used regularly in casting industries. Good quality castings are produces as optimization of parameters like pour height, metal flow and temperature range of pouring etc. can easily be achieved. Metal... more
This paper describes a new method for the automatic detection of pit structures in airborne laser scanning data collected with at least five emitted pulses per square metre. Oppland County, Norway, has a large number of ancient iron... more
Illegal archaeological excavations, generally denoted as looting, is one of the most important damage factors to cultural heritage, as it upsets the human occupation stratigraphy of sites of archaeological interest. Looting identification... more
The successful analysis of LiDAR data for archaeological research requires an evaluation of effects of different vegetation types and the use of adequate visualization techniques for the identification of archaeological features. The... more
Automatic classification of documents has become an important research area due to the exponential growth of digital content and because manual or semi-automatic organization is not effective. On one hand, manual and semi-automatic... more
This article presents results of a case study within a project that seeks to develop heavily automated analysis of digital topographic data to extract archaeological information and to expedite large area mapping. Drawing on developments... more
Africa represents a vast region where remote sensing technologies have been largely uneven in their archaeological applications. With impending climaterelated risks such as increased coastal erosion and rising sea levels, coupled with... more
Indian iron foundries are modernizing their operations for last several years. This has resulted in installation of new green sand high pressure molding lines and also box less vertical / horizontal molding lines. In order to meet molten... more
Remote sensing technology has become a standard tool for archaeological prospecting. Yet the ethical guidelines associated with the use of these technologies are not well established and are even less-often discussed in published... more
Recently, artificial intelligence (AI) has become increasingly important in many archaeological fields, as testified by the growing number of publications, dedicated workshops, and sessions at international conferences (Schneider et alii... more
Transformer Winding Machine winds the transformer when number parameters are given to the machine as input. Transformers are used in Voltage Regulator, Voltage Stabilizer, Power Supply, Welding Machine, etc. The different types of... more
One persistent archaeological challenge is the generation of systematic documentation for the extant archaeological record at the scale of landscapes. Often our information for landscapes is the result of haphazard and patchy surveys that... more
Africa represents a vast region where remote sensing technologies have been largely uneven in their archaeological applications. With impending climate related risks such as increased coastal erosion and rising sea levels, coupled with... more
Despite decades of archaeological research, roughly 75% of Madagascar's land area remains archaeologically unexplored and the oldest sites on the island are difficult to locate, as they contain the ephemeral remains of mobile... more
Computer-aided methods for the automatic detection of archaeological objects are needed to cope with the ever-growing set of largely digital and easily available remotely sensed data. In this paper, a promising new technique for the... more
One persistent archaeological challenge is the generation of systematic documentation for the extant archaeological record at the scale of landscapes. Often our information for landscapes is the result of haphazard and patchy surveys that... more
Artificial mounds and rings are morphologically heterogeneous. While they share many compositional traits, their outlines on landscapes widely vary across geographic regions. As such, automated identification of these deposits requires... more
This paper describes a new method for the automatic detection of heap structures in airborne laser scanning data, and reports on a work in progress. The heaps could be ancient grave mounds, dating to 1500-2000 years ago. Such grave mounds... more
Semi-automated approaches to archaeological feature detection can be invaluable aids to the investigation of high resolution, large area archaeological prospection datasets. In order to obtain stable and reliable classification results,... more
The project, LiDAR based semi-automatic pattern recognition within an archaeological landscape, is focused on adapting and creating semi-automatic procedures for handling and processing LiDAR data within cultural heritage monument... more
Automatic classification of documents has become an important research area due to the exponential growth of digital content and because manual or semi-automatic organization is not effective. On one hand, manual and semi-automatic... more
Automatic classification has become an important research area due to the rapid increase of digital information. Evidently, manual classification of documents is a tough work due to occurrences of vocabulary ambiguities of classification... more
This paper describes an approach to the classification of nominal compounds based on their subcategorisation. German compound noun predicates, such as Grundproblem, Beweislast and Schlussfolgerung subcategorizing for a subordinate clause... more
Download research papers for free!