Key research themes
1. How can rule extraction methods enhance interpretability and explainability of black-box machine learning models such as SVMs and neural networks?
This research area focuses on extracting interpretable symbolic rules from complex, opaque machine learning classifiers like Support Vector Machines (SVMs) and Artificial Neural Networks (ANNs). Such rule extraction improves user trust and acceptance, especially in critical domains like medical diagnosis, by providing an explanation of how a decision is reached. Multiple approaches treat the rule extraction as a learning task or analyze internal components to generate human-understandable rules. These methods aim to maintain predictive accuracy while increasing transparency and enabling better data exploration and knowledge refinement.
2. What approaches enable automated extraction of business, manufacturing, and legal rules from unstructured or legacy data sources?
This theme investigates methodologies for automatically extracting structured and formalized rules from diverse, traditionally unstructured data sources including legacy databases, informal documents, regulatory texts, and manufacturing process descriptions. The goal is to reduce manual knowledge acquisition effort, improve business process understanding, and enable compliance automation by transforming textual or database artifacts into machine-interpretable business vocabularies and rules. The papers cover ontology-based acquisitions, constraint-based modeling with NLP, natural language processing for informal and regulatory documents, and legacy system mining techniques, often utilizing semantic technologies or heuristic algorithms.
3. How can grammar-based and rule-based methods be developed for extracting semantic and structural rules from complex data, including graphs and multi-word terms, to support advanced natural language processing and sound design?
This theme addresses the extraction of formal grammatical or linguistic rules from highly structured or complex data forms, such as semantic meaning representations (graphs), multi-word lexical terms, and parameter spaces in generative systems. The goal is to develop methods that facilitate parsing, formal representation, and interpretability of semantic structures or parameter influences. Included are graph grammar extraction algorithms enabling polynomial-time parsing, rule-based lexical multi-word term extraction combined with lemmatization for inflected languages, and rule extraction for perceptual generalization in sound design. These methods enable improved semantic understanding and automation in applications ranging from NLP to computational music.