Rule Induction

description954 papers

group41 followers

lightbulbAbout this topic

Rule induction is a machine learning technique that involves the extraction of useful if-then rules from data. It aims to create a model that can predict outcomes based on input features by identifying patterns and relationships within the dataset.

lightbulbAbout this topic

Key research themes

1. How do search strategies and heuristics influence rule learning performance and over-searching in inductive rule induction?

This research theme investigates the impact of different search strategies—hill-climbing, beam search, and exhaustive search—and rule evaluation heuristics on the performance and characteristics of rule induction algorithms. It addresses the over-searching phenomenon, where increasing search effort may deteriorate learning performance, by examining the interplay between search mechanisms and heuristics. Understanding this interaction is critical for optimizing rule learning algorithms to balance theory size, predictive accuracy, and rule generality.

A Re-evaluation of the Over-Searching Phenomenon in Inductive Rule Learning

by Frederik Janssen

2016

Key finding: This study demonstrated that the traditionally observed over-searching phenomenon in inductive rule learning depends significantly on the choice of heuristic evaluation function. Exhaustive search tends to find longer but... Read more

articleView Paper downloadDownload

On Trading Off Consistency and Coverage in Inductive Rule Learning

by Frederik Janssen

2016

Key finding: This paper analyzed key rule learning heuristics—m-estimate, F-measure, and Klösgen measures—characterizing how each parametrically manages the trade-off between rule consistency (accuracy on covered examples) and coverage... Read more

articleView Paper downloadDownload

RuleKit: A comprehensive suite for rule-based learning

by Marek Sikora

2023, Knowledge-Based Systems

Key finding: RuleKit exemplifies a flexible, scalable sequential covering rule induction system that supports extensive customization of rule quality measures (over 40), including user-guided induction and multi-threaded execution.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What methodologies enable effective rule extraction from complex black-box models, particularly support vector machines, enhancing interpretability without compromising performance?

A key challenge in machine learning is extracting comprehensible symbolic rules from high-performance but opaque models like support vector machines (SVMs). This theme explores learning-based and decompositional approaches for rule extraction that convert SVM decision boundaries into human-readable rules, facilitating trust, explanation, and validation especially in high-stakes domains such as medicine. The theme includes evaluation of techniques that treat SVMs as black boxes and generate rule sets approximating SVM predictions while maintaining accuracy.

Learning-based Rule-Extraction from Support Vector Machines

by Nahla Barakat

2021

Key finding: This work presented a novel learning-based method for extracting symbolic classification rules from SVMs by treating the SVM as a black box to generate labeled examples, which are then used to train rule-based learners like... Read more

articleView Paper downloadDownload

A Rule-Learning Approach for Detecting Faults in Highly Configurable Software Systems from Uniform Random Samples

by Victoria Ruiz

2022, Proceedings of the Annual Hawaii International Conference on System Sciences

Key finding: This study applied rule induction algorithms (e.g., AQ, CN2, RIPPER) to detect faults from test results on uniform random samples of software configurations. Evaluations on large-scale datasets demonstrate that rule learning... Read more

articleView Paper downloadDownload

Using Prior Knowledge in Rule Induction

by DungDuc Nguyen

2022

Key finding: By integrating prior knowledge as existing rule sets and user constraints into the rule induction process, this work proposed a two-step approach of generating rule seeds and specializing them to obtain more accurate rules.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can constructive induction and complex condition formulation extend the expressivity and predictive capacity of rule induction algorithms?

Traditional rule induction algorithms typically generate rules with simple logical conditions, which may limit their ability to capture complex relationships in data. This theme investigates methodologies for constructive induction—creating new features or complex rule conditions such as M-of-N combinations—and how these enhance the descriptive and predictive capabilities of rule learning. The research also addresses practical aspects such as heuristic control and knowledge-driven user guidance to manage combinatorial explosion and improve model interpretability.

Multistrategy Constructive Induction

by Eric Bloedorn

2021

Key finding: This paper proposed a multistrategy constructive induction framework combining data-driven and hypothesis-driven inference alongside expanding and contracting operations in representation space. The approach simultaneously... Read more

articleView Paper downloadDownload

A Practical Approach for Knowledge-Driven Constructive Induction

by Jose Augusto

2023, Citeseer

Key finding: The proposed methodology incorporates expert knowledge to guide constructive induction by suggesting new composite features that augment original datasets. By iteratively augmenting data with user-defined features and... Read more

articleView Paper downloadDownload

Classification, Regression, and Survival Rule Induction with Complex and M-of-N Elementary Conditions

by Cezary Maszczyk

2025, Machine learning and knowledge extraction

Key finding: This study introduced an extension to sequential covering rule induction algorithms allowing complex and M-of-N conditions in rule premises by analyzing frequent sets of elementary conditions. The approach effectively induced... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Rule Induction

Logistic-based patient grouping for multi-disciplinary treatment

by Laura Maruster

2025, Artificial Intelligence in Medicine

Present-day healthcare witnesses a growing demand for coordination of patient care. Coordination is needed especially in those cases in which hospitals have structured healthcare into specialtyoriented units, while a substantial portion of patient care is not limited to single units. From a logistic point of view, this multi-disciplinary patient care creates a tension between controlling the hospital's units, and the need for a control of the patient flow between units. A possible solution is the creation of new units in which different specialties work together for specific groups of patients. A first step in this solution is to identify the salient patient groups in need of multi-disciplinary care. Grouping techniques seem to offer a solution. However, most grouping approaches in medicine are driven by a search for pathophysiological homogeneity. In this paper, we present an alternative logistic-driven grouping approach. The starting point of our approach is a database with medical cases for 3603 patients with peripheral arterial vascular (PAV) diseases. For these medical cases, six basic logistic variables (such as the number of visits to different specialist) are selected. Using these logistic variables, clustering techniques are used to group the medical cases in logistically homogeneous groups. In our approach, the quality of the resulting grouping is not measured by statistical significance, but by (i) the usefulness of the grouping for the creation of new multi-disciplinary units; (ii) how well patients can be selected for treatment in the new units. Given a priori knowledge of a patient (e.g. age, diagnosis), machine learning techniques are employed to induce rules that can be used for the selection of the patients eligible for treatment in the new units. In the paper, we describe the results of the aboveproposed methodology for patients with PAV diseases. Two groupings and the accompanied classification rule sets are presented. One grouping is based on all the logistic variables, and another Artificial Intelligence in Medicine 26 (2002) 87-107

descriptionView Paper arrow_downwardDownload

Principled design of evolutionary learning sytems for large scale data mining

by Maria Clara Gomez Gaviria

2025

Currently, the data mining and machine learning fields are facing new challenges because of the amount of information that is collected and needs processing. Many sophisticated learning approaches cannot simply cope with large and complex domains, because of the unmanageable execution times or the loss of prediction and generality capacities that occurs when the domains become more complex. Therefore, to cope with the volumes of information of the current realworld problems there is a need to push forward the boundaries of sophisticated data mining techniques. This thesis is focused on improving the efficiency of Evolutionary Learning systems in large scale domains. Specifically the objective of this thesis is improving the efficiency of the Bioinforma tic Hierarchical Evolutionary Learning (BioHEL) system, a system designed with the purpose of handling large domains. This is a classifier system that uses an Iterative Rule Learning approach to generate a set of rules one by one using consecutive Genetic Algorithms. This system have shown to be very competitive so far in large and complex domains. In particular, BioHEL has obtained very important results when solving protein structure prediction problems and has won related merits, such as being placed among the best algorithms for this purpose at the Critical Assessment of Techniques for Protein Structure Prediction (CASP) in 2008 and 2010, and winning the bronze medal at the HUMIES Awards for Human-competitive results in 2007. However, there is still a need to analyse this system in a principled way to determine how the current mechanisms work together to solve larger domains and determine the aspects of the system that can be improved towards this aim. To fulfil the objective of this thesis, the work is divided in two parts. In the first part of the thesis exhaustive experimentation was carried out to determine ways in which the system could be improved. From this exhaustive analysis three main weaknesses are pointed out: a) the problem-dependancy of parameters in BioHEL's fitness function, which results in having a system difficult to set up and which requires an extensive preliminary experimentation to determine the adequate values for these parameters; b) the execution time of the learning process, which at the moment does not use any parallelisation techniques and depends on the size of the training sets; and c) the lack of global supervision over the generated solutions which comes from the usage of the Iterative Rule Learning paradigm and produces larger rule sets in which there is no guarantee of minimality or maximal generality. The second part of the thesis is focused on tackling each one of the weaknesses abovementioned to have a system capable of handling larger domains. First a heuristic approach to v set parameters within BioHEL's fitness function is developed. Second a new parallel evaluation process that runs on General Purpose Graphic Processing Units was developed. Finally, post-processing operators to tackle the generality and cardinality of the generated solutions are proposed. By means of these enhancements we managed to improve the BioHEL system to reduce both the learning and the preliminary experimentation time, increase the generality of the final solutions and make the system more accessible for end-users. Moreover, as the techniques discussed in this thesis can be easily extended to other Evolutionary Learning systems we consider them important additions to the research in this field towards tackling large scale domains. vi

descriptionView Paper arrow_downwardDownload

Applications of machine learning and rule induction

by Arya Nanda

2025, Communications of the ACM

M achine learning is the study of computational methods for improving performance by mechanizing the acquisition of knowledge from experience. Expert performance requires much domain-specific knowledge, and knowledge engineering has... more

descriptionView Paper arrow_downwardDownload

The Selection of Optimal Data Mining Method for Small-Sized Hotels

by Verka Jovanovic

2025, Proceedings of the International Scientific Conference - Synthesis 2015

Small-sized hotels that prevail in the tourist destination of Serbia rarely use any kind of property management or intelligence systems. The issue that pervades throughout this paper is related to the ways in which they can benefit from... more

descriptionView Paper arrow_downwardDownload

Multi-feature Error Detection in Spoken Dialogue Systems

by M. Swerts

2025, Computational Linguistics in the Netherlands 2001

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal... more

descriptionView Paper arrow_downwardDownload

Detecting problematic turns in human-machine interactions

by M. Swerts

2025, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics - ACL '01

We address the issue of on-line detection of communication problems in spoken dialogue systems. The usefulness is investigated of the sequence of system question types and the word graphs corresponding to the respective user utterances.... more

descriptionView Paper arrow_downwardDownload

Improving machine-learned detection of miscommunications in human-machine dialogues through informed data splitting

by M. Swerts

2025, Proceedings of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics

In this paper we study two types of machine learning techniques, rule-induction and memorybased learning, for error detection in spoken dialogue systems. The learners are trained and tested on two tasks: predicting whether the current... more

descriptionView Paper arrow_downwardDownload

Concept acquisition in example-based grammar authoring

by Ye-yi Wang

2025, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

To facilitate the development of speech enabled applications and services, we have been working on an example-based semantic grammar authoring tool. Previous studies have shown that the tool has not only significantly reduced the grammar... more

descriptionView Paper arrow_downwardDownload

NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting

by GERARDO MARTINEZ FIGUEROA

2025, IEEE Transactions on Power Systems

The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-Technical Losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect... more

descriptionView Paper arrow_downwardDownload

UC Merced Proceedings of the Annual Meeting of the Cognitive Science Society Title Grammar Induction Profits from Representative Stimulus Sampling Publication Date Grammar Induction Profits from Representative Stimulus Sampling

by Fenna Poletiek

2025

Sensitivity to distributional characteristics of sequential linguistic and nonlinguistic stimuli, have been shown to play a role in learning the underlying structure of these stimuli. A growing body of experimental and computational... more

descriptionView Paper arrow_downwardDownload

Building Explanations for Fuzzy Decision Trees with the ExpliClas Software

by Raúl Salgado Vilas

2025, 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

Fairness, Accountability, Transparency and Explainability have become strong requirements in most practical applications of Artificial Intelligence (AI). Fuzzy sets and systems are recognized world-wide because of their outstanding... more

descriptionView Paper arrow_downwardDownload

Evolutionary selection of hyperrectangles in nested generalized exemplar learning

by Salvador garcia

2025, Applied Soft Computing

The nested generalized exemplar theory accomplishes learning by storing objects in Euclidean n-space, as hyperrectangles. Classification of new data is performed by computing their distance to the nearest "generalized exemplar" or... more

descriptionView Paper arrow_downwardDownload

Spherical model for Minimalist Machine Learning paradigm in handling complex databases

by Raul Jimenez Cruz and

2025

This paper presents the development of the N-Spherical Minimalist Machine Learning (MML) classifier, an innovative model within the Minimalist Machine Learning paradigm. Using N-spherical coordinates and concepts from metaheuristics and... more

descriptionView Paper arrow_downwardDownload

A visual interactive framework for attribute discretization

by Ramana Venkata

2025, Proc. Third Conf. Knowledge and …

Discretization is the process of dividing a continuous-valued base attribute into discrete intervals, which highlight distinct patterns in the behavior of a re-lated goal attribute. In this paper, we present an in-tegrated visual... more

descriptionView Paper arrow_downwardDownload

Effective Rule Induction from Molecular Structures Represented by Labeled Graphs

by Tamas Horvath

2025

Acyclic conjunctive queries form a polyno-mially evaluable fragment of definite non-recursive first-order Horn clauses. Labeled graphs, a special class of relational struc-tures, provide a natural way for represent-ing chemical compounds.... more

descriptionView Paper arrow_downwardDownload

When Looks Are Everything

by Heidi Kloos

2025, Psychological Science

The goal of this research was to examine mechanisms underlying early induction-specifically, the relation between induction and categorization. Some researchers argue that even early in development, induction is based on... more

descriptionView Paper arrow_downwardDownload

A computational framework for interpreting clusters through inductive learning: a case study

by Gisele Halembeck

2025

In supervised learning the inductive algorithm seeks to develop a conceptual description, or prescriptive model, from examples or objects that have been pre-classified. On the other hand, in unsupervised learning, or clustering, the task... more

descriptionView Paper arrow_downwardDownload

A computational framework for interpreting clusters through inductive learning: a case study

by Gisele Halembeck

2025

descriptionView Paper arrow_downwardDownload

Effects of query and database sizes on classification of news stories using memory based reasoning

by brij masand

2025, Proceedings of the 1993 Spring Symposium on Case- …

In this paper we explore the effects of query and database size on news story classification performance. Memory Based Reasoning (MBR) (a k-nearest neighbor method) used as the classification method. There are 360 different possible... more

descriptionView Paper arrow_downwardDownload

Mathematical Analysis of Information Systems through Technology

by En-Bing Lin

2025

We live in a world submerged with more information than ever before. We express information or data mathematically and it is growing faster than ever. If the data is imperfect, out of context or otherwise contaminated, it can lead to... more

descriptionView Paper arrow_downwardDownload

Concurrent Validation in the Treatment of Uncertainty in a Expert System

by Lucimar Maria Fossatti De Carvalho

2025

There are inherent open problems arising when developing and running Intelligent Environmental Decision Support Systems (IEDSS). During daily operation of IEDSS several open challenge problems appear. The uncertainty of data being... more

descriptionView Paper arrow_downwardDownload

A System Dynamics Model Approach for Simulating Hyper-inflammation in Different COVID-19 Patient Scenarios

by David Nettleton

2025, Proceedings of the 11th International Conference on Simulation and Modeling Methodologies, Technologies and Applications

The exceptionally high virulence of COVID-19 and the patients' precondition seem to constitute primary factors in how pro-inflammatory cytokines production evolves during the course of an infection. We present a System Dynamics Model approach for simulating the patient reaction using two key control parameters (i) virulence, which can be "moderate" or "high" and (ii) patient precondition, which can be "healthy", "not so healthy" or "serious preconditions". In particular, we study the behaviour of Inflammatory (M1) Alveolar Macrophages, IL6 and Active Adaptive Immune system as indicators of the immune system response, together with the COVID viral load over time. The results show that it is possible to build an initial model of the system to explore the behaviour of the key attributes involved in the patient condition, virulence and response. The model suggests aspects that need further study so that it can then assist in choosing the correct immunomodulatory treatment, for instance the regime of application of an Interleukin 6 (IL-6) inhibitor (tocilizumab) that corresponds to the projected immune status of the patients. We introduce machine learning techniques to corroborate aspects of the model and propose that a dynamic model and machine learning techniques could provide a decision support tool to ICU physicians. Infectious pandemic corona-virus disease (COVID-19), caused by severe acute respiratory disease Corona-virus 2 syndrome (SARS-CoV-2) is rapidly spreading worldwide . In the case of COVID-19, a worsening has been observed from 7 to 8 days. However, this only occurs in some patients; and because of an over-reaction of the immune system (Pedersen and Ho, 2020). During this pandemic, the challenge is to diagnose those patients who are not getting worse; and thus free up space for those who need intensive care when they develop respiratory failure due to acute respiratory distress syndrome; the main cause of mortality. In a recent retrospective, multi-center study of 150 confirmed cases of COVID-19 in Wuhan, China, the authors suggest that mortality could be due to hyper-inflammatory sepsis . a

descriptionView Paper arrow_downwardDownload

Sequential Cover Rule Induction with PA3

by Pedro de Almeida

2025

Algorithms for induction of concept descriptions from examples are important tools in the fields of machine learning and knowledge discovery in databases. This paper presents an induction algorithm, named PA3, that learns a set of ordered... more

descriptionView Paper arrow_downwardDownload

Direct Domain Knowledge Inclusion in the PA3 Rule Induction Algorithm

by Pedro de Almeida

2025, Lecture Notes in Computer Science

Inclusion of domain knowledge in a process of knowledge discovery in databases is a complex but very important part of successful knowledge discovery solutions. In real-life data mining development, non-structured domain knowledge involvement in the data preparation phase and in the final interpretation/evaluation phase tends to dominate. This paper presents an experiment of direct domain knowledge integration in the algorithm that will search for interesting patterns in the data. In the context of stock market prediction work, a recent rule induction algorithm, PA3, was adapted to include domain theories directly in the internal rule development. Tests performed over several Portuguese stocks show a significant increase in prediction performance over the same process using the standard version of PA3. We believe that a similar methodology can be applied to other symbolic induction algorithms and in other working domains to improve the efficiency of prediction (or classification) in knowledge-intensive data mining tasks. DK can be included in the data mining phase through direct integration (implicit or explicit) in the data mining algorithm, or through an associated knowledge base. In the first case, specific changes to the core data mining algorithm must be performed, in order to directly represent the involved domain knowledge through a biasing of the search. In the latter case, a very tight coupling between the domain theory description in the knowledge base and the bias representation language accepted by the learner is need, eventually involving an intermediate knowledge "translator" . Anyway, both of these forms of DK integration tend to need software specifically adapted for each application case, since different kinds of domain knowledge usually involve different representations, and most data mining algorithms (and commercial data mining programs) don't allow the integration any form of DK not contained in the data. Direct integration of DK in data mining software generally intends to direct and focus the pattern search that takes place at that KDD step. This can raise another potential limitation of this technique: If badly directed, the focused search can miss some of the potentially interesting patterns that an unbiased search could find in the data . However, in spite of the limitations and potential problems, we believe that, in some cases, careful DK integration in the data mining step of a KDD process can produce significant improvements in the overall efficiency of the process. This paper presents an experiment that integrates two domain theories directly in a rule induction data mining algorithm. The domain is short-term stock market prediction, and the two theories bias the algorithm, during rule search, against a specific class of rules, and towards another. The theories are tested over five data sets that correspond to multivariate information based on daily quotes of five of the most significant stocks in the Portuguese BVLP stock exchange. The base rule induction algorithm used, PA3 , is a recent general-purpose sequential cover algorithm that combines general-to-specific and specific-to-general search to develop each rule.

descriptionView Paper arrow_downwardDownload

Shuttling Between Depictive Models and Abstract Rules: Induction and Fallback

by John Black

2025, Cognitive Science

A productive way to think about imagistic mental models of physical systems is as though they were sources of quasi‐empirical evidence. People depict or imagine events at those points in time when they would experiment with the world if... more

descriptionView Paper arrow_downwardDownload

Constructing Hierarchical Rule Systems

by Michael Berthold

2025, Lecture Notes in Computer Science

Rule systems have failed to attract much interest in large data analysis problems because they tend to be too simplistic to be useful or consist of too many rules for human interpretation. We present a method that constructs a... more

descriptionView Paper arrow_downwardDownload

Constructing Hierarchical Rule Systems

by Michael Berthold

2025, Lecture Notes in Computer Science

descriptionView Paper arrow_downwardDownload

Classification, Regression, and Survival Rule Induction with Complex and M-of-N Elementary Conditions

by Cezary Maszczyk

2025, Machine learning and knowledge extraction

Most rule induction algorithms generate rules with simple logical conditions based on equality or inequality relations. This feature limits their ability to discover complex dependencies that may exist in data. This article presents an... more

descriptionView Paper arrow_downwardDownload

Analyzing Employee Attrition Using Decision Tree Algorithms

by sesan adeyemo

2025

Employee turnover is a serious concern in knowledge based organizations. When employees leave an organization, they carry with them invaluable tacit knowledge which is often the source of competitive advantage for the business. In order... more

descriptionView Paper arrow_downwardDownload

A Genetic Programming Approach Applied to Feature Selection from Medical Data

by Ernesto Costa

2025, Practical Applications of Computational Biology and Bioinformatics, 12th International Conference

Genetic programming represents a flexible and powerful evolutionary technique in machine learning. The use of genetic programming for rule induction has generated interesting results in classification problems. This paper proposes an... more

descriptionView Paper arrow_downwardDownload

Special Issue on Innovative techniques and applications of artificial intelligence

by Daniel Neagu

2025, Knowledge-Based Systems

This special issue of Knowledge Based Systems comprises expanded versions of the best papers submitted to the conference AI-2010, the thirtieth SGAI International Conference on Artificial Intelligence, which was held in Cambridge, England... more

descriptionView Paper arrow_downwardDownload

Estimating the credibility of examples in automatic document classification

by Marcos Goncalves

2024

Classification algorithms usually assume that any example in the training set should contribute equally to the classification model being generated. However, this is not always the case. This paper shows that the contribution of an... more

descriptionView Paper arrow_downwardDownload

Model updating after interventions paradoxically introduces bias Supplementary Materials

by Samuel Emerson

2024

James Liley Samuel R. Emerson Bilal A. Mateen Catalina A. Vallejos Louis J. M. Aslett Sebastian J. Vollmer 1 Alan Turing Institute, London, UK; 2 MRC Human Genetics Unit, Univ. of Edinburgh, UK; 3 Department of Mathematical Sciences,... more

descriptionView Paper arrow_downwardDownload

Genetic rule induction at an intermediate level

by Ton Weijters

2024, Knowledge-Based Systems

Lists of if±then rules (i.e. ordered rule sets) are among the most expressive and intelligible representations for inductive learning algorithms. Two extreme strategies searching for such a list of rules can be distinguished: (i) local... more

descriptionView Paper arrow_downwardDownload

A Rule-Based Approach for Process Discovery: Dealing with Noise and Imbalance in Process Logs

by Ton Weijters

2024, Data Mining and Knowledge Discovery

Effective information systems require the existence of explicit process models. A completely specified process design needs to be developed in order to enact a given business process. This development is time consuming and often... more

descriptionView Paper arrow_downwardDownload

Logistic-based patient grouping for multi-disciplinary treatment

by Ton Weijters

2024, Artificial Intelligence in Medicine

Present-day healthcare witnesses a growing demand for coordination of patient care. Coordination is needed especially in those cases in which hospitals have structured healthcare into specialtyoriented units, while a substantial portion of patient care is not limited to single units. From a logistic point of view, this multidisciplinary patient care creates a tension between controlling the hospital's units, and the need for a control of the patient flow between units. A possible solution is the creation of new units in which different specialties work together for specific groups of patients. A first step in this solution is to identify the salient patient groups in need of multidisciplinary care. Grouping techniques seem to offer a solution. However, most grouping approaches in medicine are driven by a search for pathophysiological homogeneity. In this paper, we present an alternative logistic-driven grouping approach. The starting point of our approach is a database with medical cases for 3603 patients with peripheral arterial vascular (PAV) diseases. For these medical cases, six basic logistic variables (such as the number of visits to different specialist) are selected. Using these logistic variables, clustering techniques are used to group the medical cases in logistically homogeneous groups. In our approach, the quality of the resulting grouping is not measured by statistical significance, but by (i) the usefulness of the grouping for the creation of new multidisciplinary units; (ii) how well patients can be selected for treatment in the new units. Given a priori knowledge of a patient (e.g. age, diagnosis), machine learning techniques are employed to induce rules that can be used for the selection of the patients eligible for treatment in the new units. In the paper, we describe the results of the aboveproposed methodology for patients with PAV diseases. Two groupings and the accompanied classification rule sets are presented. One grouping is based on all the logistic variables, and another Artificial Intelligence in Medicine 26 (2002) 87-107

descriptionView Paper arrow_downwardDownload

Rule Quality Measures Settings in Classification, Regression and Survival Rule Induction — an Empirical Approach

by Marek Sikora

2024, Fundamenta Informaticae

The paper presents the results of research related to the efficiency of the so-called rule quality measures which are used to evaluate the quality of rules at each stage of the rule induction. The stages of rule growing and pruning were... more

descriptionView Paper arrow_downwardDownload

GuideR: A guided separate-and-conquer rule learning in classification, regression, and survival settings

by Marek Sikora

2024, Knowledge-Based Systems

This article presents GuideR, a user-guided rule induction algorithm, which overcomes the largest limitation of the existing methods-the lack of the possibility to introduce user's preferences or domain knowledge to the rule learning... more

descriptionView Paper arrow_downwardDownload

RuleKit: A comprehensive suite for rule-based learning

by Marek Sikora

2024, Knowledge-Based Systems

Rule-based models are often used for data analysis as they combine interpretability with predictive power. We present RuleKit, a versatile tool for rule learning. Based on a sequential covering induction algorithm, it is suitable for... more

descriptionView Paper arrow_downwardDownload

Applying Bagging Techniques to the SA Tabu Miner Rule Induction Algorithm

by Ivan Chorbev

2024, ICT Innovations 2009

This paper presents an implementation of bagging techniques over the heuristic algorithm for induction of classification rules called SA Tabu Miner (Simulated Annealing and Tabu Search data miner). The goal was to achieve better... more

descriptionView Paper arrow_downwardDownload

Learning Problem and BCJR Decoding Algorithm in Anomaly-based Intrusion Detection Systems

by Milen Baltov

2024, Journal of Software

The anomaly-based intrusion detection systems examine current system activity do find deviations from normal system activity. The present paper proposes a method for normal activity description using the Hidden Markov Models (HMM), which... more

descriptionView Paper arrow_downwardDownload

Constrained clustering of gene expression profiles

by Suzana Loskovska

2024

In this paper a querying environment for analysis of patient clinical data is presented. The data consists of two parts: patients' pathological data and data about corresponding gene expression levels. The querying environment includes a... more

descriptionView Paper arrow_downwardDownload

Incremental Versus Non-incremental: Data and Algorithms Based on Ordering Relations

by Jiajun Chen

2024, International Journal of Computational Intelligence Systems

Based on multi-dominance discernibility matrices, a non-incremental algorithm RIDDM and an incremental algorithm INRIDDM are proposed by means of Dominance-based Rough Set Approach. For the incremental algorithm, when a new object... more

descriptionView Paper arrow_downwardDownload

Non-monotonic characterization of induction and its application to inductive learning

by Javier Larrosa

2024, International Journal of Intelligent Systems

In this article a new approach to the formalization of inductive inference in terms of non-monotonic inference is proposed. Induction is characterized as closed-world reasoning from the available data, followed by an inductive jump, which... more

descriptionView Paper arrow_downwardDownload

Blood Tumor Prediction Using Data Mining Techniques

by Alaa El-Halees

2024, Zenodo (CERN European Organization for Nuclear Research)

Healthcare systems generate a huge data collected from medical tests. Data mining is the computing process of discovering patterns in large data sets such as medical examinations. Blood diseases are not an exception; there are many test... more

descriptionView Paper arrow_downwardDownload

An Accelerator for Rule Induction in Fuzzy Rough Theory

by Xizhao Wang

2024, IEEE Transactions on Fuzzy Systems

Rule-based classifier, that extract a subset of induced rules to efficiently learn/mine while preserving the discernibility information, plays a crucial role in human-explainable artificial intelligence. However, in this era of big data,... more

descriptionView Paper arrow_downwardDownload

Classification, Regression, and Survival Rule Induction with Complex and M-of-N Elementary Conditions

by Marek Sikora

2024, Machine learning and knowledge extraction

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY

descriptionView Paper arrow_downwardDownload

Concept acquisition in example-based grammar authoring

by Ye-Yi Wang

2024, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

descriptionView Paper arrow_downwardDownload

Predicción de la intencion de compras usando árboles de decisión

by Jelena Franjković

2024, Millennium: Journal of International Studies

Shopping intention prediction using decision trees. Millenium, 2(4), 13-22. m 4 14 RESUMO Introdução: O preço é um elemento negligenciado na literatura em marketing devido à complexidade da sua gestão e sensibilidade dos clientes sobre as... more

descriptionView Paper arrow_downwardDownload

ENGENHARIAS, TECNOLOGIA, GESTÃO E TURISMO ENGINEERING, TECHNOLOGY, MANAGEMENT AND TOURISM INGENIERÍA, TECNOLOGÍA, ADMINISTRACIÓN Y TURISMO millenium PREVISÃO DE INTENÇÃO DE COMPRA UTILIZANDO ÁRVORES DE DECISÃO SHOPPING INTENTION PREDICTION USING DECISION TREES PREDICCIÓN DE LA INTENCION DE COMPRA...

by Jelena Franjković

2024

RESUMO Introdução: O preço é um elemento negligenciado na literatura em marketing devido à complexidade da sua gestão e sensibilidade dos clientes sobre as mudanças de preços. Consequentemente, o processo de tomada de decisões de compra... more

descriptionView Paper arrow_downwardDownload

Rule Induction

Key research themes

1. How do search strategies and heuristics influence rule learning performance and over-searching in inductive rule induction?

2. What methodologies enable effective rule extraction from complex black-box models, particularly support vector machines, enhancing interpretability without compromising performance?

3. How can constructive induction and complex condition formulation extend the expressivity and predictive capacity of rule induction algorithms?

Related Topics

All papers in Rule Induction