Key research themes
1. How do search strategies and heuristics influence rule learning performance and over-searching in inductive rule induction?
This research theme investigates the impact of different search strategies—hill-climbing, beam search, and exhaustive search—and rule evaluation heuristics on the performance and characteristics of rule induction algorithms. It addresses the over-searching phenomenon, where increasing search effort may deteriorate learning performance, by examining the interplay between search mechanisms and heuristics. Understanding this interaction is critical for optimizing rule learning algorithms to balance theory size, predictive accuracy, and rule generality.
2. What methodologies enable effective rule extraction from complex black-box models, particularly support vector machines, enhancing interpretability without compromising performance?
A key challenge in machine learning is extracting comprehensible symbolic rules from high-performance but opaque models like support vector machines (SVMs). This theme explores learning-based and decompositional approaches for rule extraction that convert SVM decision boundaries into human-readable rules, facilitating trust, explanation, and validation especially in high-stakes domains such as medicine. The theme includes evaluation of techniques that treat SVMs as black boxes and generate rule sets approximating SVM predictions while maintaining accuracy.
3. How can constructive induction and complex condition formulation extend the expressivity and predictive capacity of rule induction algorithms?
Traditional rule induction algorithms typically generate rules with simple logical conditions, which may limit their ability to capture complex relationships in data. This theme investigates methodologies for constructive induction—creating new features or complex rule conditions such as M-of-N combinations—and how these enhance the descriptive and predictive capabilities of rule learning. The research also addresses practical aspects such as heuristic control and knowledge-driven user guidance to manage combinatorial explosion and improve model interpretability.