Association Mining

description354 papers

group6 followers

lightbulbAbout this topic

Association mining is a data mining technique used to discover interesting relationships, patterns, or correlations among a set of items in large datasets. It identifies frequent itemsets and generates association rules, which help in understanding the co-occurrence of items and can inform decision-making in various domains.

lightbulbAbout this topic

Key research themes

1. How can algorithmic scalability and efficiency be improved in frequent itemset discovery for association mining?

This research theme addresses computational challenges in discovering frequent itemsets efficiently from large-scale transactional databases. It explores algorithmic strategies that reduce I/O overhead, manage complex search spaces using structural decompositions, and adapt processing routines dynamically to dataset characteristics. Efficient frequent itemset mining is critical because the exponential search space and repeated data scans severely impact scalability in practical applications.

Scalable Algorithms for Association Mining

by Lisa Chen

2016

Key finding: Introduced algorithms (e.g., Eclat, MaxEclat) using a vertical tid-list database format and lattice-theoretic decomposition to partition the search space into manageable sublattices processed in-memory. This method minimizes... Read more

articleView Paper downloadDownload

Mining Frequent Patterns Based on Data Characteristics

by lan vu

2023

Key finding: Proposed DFEM, an algorithm combining FP-growth and Eclat techniques with a dynamic runtime threshold that adapts its mining strategy to database sparsity and density. DFEM automatically chooses the most efficient mining... Read more

articleView Paper downloadDownload

Adaptive and resource-aware mining of frequent sets

by Paolo Palmerini

2025, 2002 IEEE International Conference on Data Mining, 2002. Proceedings.

Key finding: Presented DCI, an algorithm that adaptively switches from horizontal counting-based mining to vertical tidlist intersection-based mining as the pruned database shrinks to fit in memory. It includes heuristics adjusting to... Read more

articleView Paper downloadDownload

Implementation of Association Rule Mining using Reverse Apriori Algorithmic Approach

by Kanwalvir Singh Dhindsa

2022

Key finding: Proposed the reverse Apriori algorithm, which enhances the classical Apriori's efficiency by scanning transactions in reverse order and leveraging existing frequent patterns more effectively. This approach demonstrated... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How can association rule mining be integrated effectively with classification to improve predictive accuracy and reduce rule redundancy?

This theme investigates combining association rule mining with classification tasks to develop classifiers based on association rules that maintain high predictive accuracy while generating fewer, less redundant rules. The research focuses on integrating itemset generation with rule generation, applying measures like information gain, and filtering rule conflicts within the mining process. These techniques aim to yield compact and interpretable classifiers improving over traditional classification or separate mining-classification pipelines.

A new approach to classification based on association rule mining

by imam maulana

2017

Key finding: Presented GARC, a classification algorithm that integrates information gain measures into candidate itemset generation, merges frequent itemset mining with rule generation, and embeds redundancy and conflict avoidance... Read more

articleView Paper downloadDownload

Apriori Algorithm and Hybrid Apriori Algorithm in the Data Mining: A Comprehensive Review

by Joyece Jane

2023

Key finding: Reviewed Apriori and hybrid Apriori-TID algorithms, highlighting that hybrid methods combining transaction and itemset information classification can better handle large itemsets and improve classification accuracy. The study... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What are the methodological advancements and limitations in interpretability and evaluation of association rule interestingness and common-sense knowledge integration?

This research area focuses on evaluating and improving the measures used to identify meaningful and actionable association rules, including confidence, support, lift, and novel probabilistic or statistical models. It also explores approaches for semantic interpretation of association rules via frameworks like semantic frames and their application in building common-sense knowledge bases, thus enhancing the semantic richness and usability of mined association rules.

Implications of Probabilistic Data Modeling for Mining Association Rules

by Thomas Reutterer

2016

Key finding: Developed a simple probabilistic framework modeling transaction data as independent Bernoulli trials to simulate random, no-association data. Using real and simulated datasets, the study showed that confidence is influenced... Read more

articleView Paper downloadDownload

Replacing Support in Association Rule Mining

by Rosa Meo

2025, Technologies for Infrequent and Critical Event Detection

Key finding: Proposed a Bayesian statistical framework that replaces the traditional support measure in association rule mining with probabilistic criteria based on posterior probability estimations. This approach addresses limitations... Read more

articleView Paper downloadDownload

Mapping Dependency Relationships into Semantic Frame Relationships

by A. S. Perera

2022

Key finding: Refactored the RelEx2Frame component of the OpenCog AGI framework by integrating the Drools rule engine and supervised/statistical methods aided by WordNet to expand concept variables. Association mining on semantic frames... Read more

articleView Paper downloadDownload

Visual Grouping of Association Rules by Clustering Conditional Probabilities for Categorical Data

by Ranadhir Ghosh

2025, Business Applications and Computational Intelligence

Key finding: Proposed a visualization and clustering method based on conditional probabilities of association rules to help non-technical users interpret large sets of categorical association rules. This approach addresses the rare item... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Association Mining

Computers and

by Abdellah AZMANI

2025

In the present paper a model of a multi agent based system is presented, which helps marketers on the one hand to address its products to the best targets and in the another hand to generate relevant product recommendations for customers... more

descriptionView Paper arrow_downwardDownload

Incremental Maintenance of Frequent Itemsets in Evidential Databases

by Mohamed Anis Bach Tobji

2025, Lecture Notes in Computer Science

In the last years, the problem of Frequent Itemset Mining (FIM) from imperfect databases has been sufficiently tackled to handle many kinds of data imperfection. However, frequent itemsets discovered from databases describe only the... more

descriptionView Paper arrow_downwardDownload

A Potential Causal Association Mining Algorithm for Screening Adverse Drug Reactions in Postmarketing Surveillance

by John Tran

2025, IEEE Transactions on Information Technology in Biomedicine

leverage, two traditional frequency-based measures. Among the top 50 signal pairs (i.e., enalapril versus symptoms) ranked by the potential causal-leverage measure, the physicians on the project determined that eight of them probably... more

descriptionView Paper arrow_downwardDownload

Load balancing in a massively parallel semantic database

by Artyom Shaposhnikov

2025

IX7 uns ~and 4 2768 0.2243 0.1968 5 1.923 70.8443 6P. We wo uld like to nments on the des ign 1 at Bloomsburg Uni-1 and design of exper--2 Tec hnical Summary. A.R7-4. Thinking Mach-I 1987) 'crformance analysis of • • the Connc<:ti on M<... more

descriptionView Paper arrow_downwardDownload

Finding “persistent rules”: Combining association and classification results

by Karthik Rajasethupathy

2025, Expert Systems with Applications

Different data mining algorithms applied to the same data can result in similar findings, typically in the form of rules. These similarities can be exploited to identify especially powerful rules, in particular those that are common to... more

descriptionView Paper arrow_downwardDownload

Chapter 8 DepMiner: A Method and a System for the Extraction of Significant Dependencies

by Rosa Meo

2025

We propose DepMiner, a method implementing a simple but effective model for the evaluation of itemsets, and in general for the evaluation of the dependencies between the values assumed by a set of variables on a domain of finite values.... more

descriptionView Paper arrow_downwardDownload

On rethinking organizational document genres for electronic document management

by Pasi Tyrväinen

2025, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers

Document management has to be rethinked and clarified in organizations, especially for the coordinated adoption of organization-wide electronic document management systems (EDMSs). This paper reports the identification and evaluation of... more

descriptionView Paper arrow_downwardDownload

Video data mining: semantic indexing and event detection from the association perspective

by Xindong Wu

2024, IEEE Transactions on Knowledge and Data Engineering

Advances in the media and entertainment industries, including streaming audio and digital TV, present new challenges for managing and accessing large audio-visual collections. Current content management systems support retrieval using... more

descriptionView Paper arrow_downwardDownload

Recommendation Technologies: Survey of Current Methods and Possible Extensions

by Alexander Tuzhilin

2024, Social Science Research Network

The paper presents a survey of the field of recommender systems and describes current recommendation methods that are usually classified into the following three main categories: content-based, collaborative, and hybrid recommendation... more

descriptionView Paper arrow_downwardDownload

The Distributed Algorithms in Mining Association Rules

by Mirela Pater

2024

With the ever-growing database sizes, we have enormous quantities of data, but unfortunately we cannot use raw data in our day-today reasoning/decisions. We desperately need knowledge. This knowledge is in most cases in the gathered data,... more

descriptionView Paper arrow_downwardDownload

Overview of the GUHA method as a data mining technique

by Ivan Chorbev

2024

This paper is concerned with current applications and researches of GUHA, a method for hypothesis generation. The GUHA method is very promising in the field of association rules data mining. Some of the current software implementations of... more

descriptionView Paper arrow_downwardDownload

Advancements and Applications in Association Rule Mining A Review of Key Algorithms and Future Directions

by FERI SULIANTA

2024

Association rule mining is a crucial data mining technique used to uncover relationships between variables in large datasets. This paper provides a comprehensive review of various association rule algorithms, including Apriori, FP-Growth,... more

descriptionView Paper arrow_downwardDownload

CoMMA: a framework for integrated multimedia mining using multi-relational associations

by MUHAMMAD AHMAD

2024, Knowledge and Information Systems

Generating captions or annotations automatically for still images is a challenging task. Traditionally, techniques involving higher-level (semantic) object detection and complex feature extraction have been employed for scene... more

descriptionView Paper arrow_downwardDownload

An associative watermarking based image authentication scheme

by Professor Aboul Ella Hassanien

2024, 2010 10th International Conference on Intelligent Systems Design and Applications

In this paper, we propose an associative watermarking scheme which is conducted by the concept of Association Mining Rules (AMRs) and the ideas of Vector Quantization (VQ) and Soble operator. Performing associative watermarking rules to... more

descriptionView Paper arrow_downwardDownload

A systemic framework for the field of data mining and knowledge discovery

by Zhengxin Chen

2024

This paper proposes a systemic framework that attempts to define the domain and major areas of Data Mining and Knowledge Discovery (DMKD). Grounded theory approach, a qualitative method that inductively develops an understanding of... more

descriptionView Paper arrow_downwardDownload

CoMMA: a framework for integrated multimedia mining using multi-relational associations

by Ayaan Ahmad

2024, Knowledge and Information Systems

descriptionView Paper arrow_downwardDownload

A Comparative Discussion on Various Modern Video Retrieval Strategies

by Keerthi B Lingam

2024, Advances in intelligent systems and computing

In the recent past, wide ranges of video retrieval processes were presented by different researchers. In order to boost the ease of access of video clip, keen applications, which have item removal, video purchasing, video clip healing and... more

descriptionView Paper arrow_downwardDownload

Experiences of Using a Quantitative Approach for Mining Association Rules

by Christos Tjortjis

2024, Springer eBooks

In recent years interest has grown in "mining" large databases to extract novel and interesting information. Knowledge Discovery in Databases (KDD) has been recognised as an emerging research area. Association rules discovery is an... more

descriptionView Paper arrow_downwardDownload

An Efficient Spark-Based Hybrid Frequent Itemset Mining Algorithm for Big Data

by nermin othman

2024, Data

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY

descriptionView Paper arrow_downwardDownload

Development of Top K-Association Rule Mining for Discovering pattern in Medical Dataset

by Sachin Jain

2024

Association rules consist of the discovery of association between mining transaction items. This is one of the most important information mining jobs. It has been integrated into many commercial data mining software and has a wide variety... more

descriptionView Paper arrow_downwardDownload

Comparative evaluation of pattern mining techniques: an empirical study

by Anindita Borah

2024, Complex & Intelligent Systems

Pattern mining has emerged as a compelling field of data mining over the years. Literature has bestowed ample endeavors in this field of research ranging from frequent pattern mining to rare pattern mining. A precise and impartial... more

descriptionView Paper arrow_downwardDownload

A novel Boolean algebraic framework for association and pattern mining

by Hatim Aboalsamh

2024, WSEAS Transactions on Computers archive

Data mining has been defined as the non-trivial extraction of implicit, previously unknown and potentially useful information from data. Association mining and sequential mining analysis are considered as crucial components of strategic... more

descriptionView Paper arrow_downwardDownload

Template guided association rule mining from XML documents

by Maseud Rahgozar

2024

Compared with traditional association rule mining in the structured world (e.g. Relational Databases), mining from XML data is confronted with more challenges due to the inherent flexibilities of XML in both structure and semantics. The... more

descriptionView Paper arrow_downwardDownload

Scalable technique to discover items support from trie data structure

by Noraziah Ahmad

2024

One of the popular and compact trie data structure to represent frequent patterns is via frequent pattern tree (FP-Tree). There are two scanning processes involved in the original database before the FP-Tree can be constructed. One of... more

descriptionView Paper arrow_downwardDownload

Minimizing Space Time Complexity by RSTDB a New Method for Frequent Pattern Mining

by Vaibhav Singh

2024, Proceedings of the First International Conference on Intelligent Human Computer Interaction

Data-mining is the extraction of meaningful patterns from the large source of data. Association Rule Mining (ARM) is an important data mining technique. Mining of frequent patterns is a very important association rule mining problem. The... more

descriptionView Paper arrow_downwardDownload

User-Driven Association Rule Mining Using a Local Algorithm

by Fabrice Guillet

2024, Proceedings of the 11th International Conference on Enterprise Information

One of the main issues in the process of Knowledge Discovery in Databases is the Mining of Association Rules. Although a great variety of pattern mining algorithms have been designed to this purpose, their main problems rely on in the... more

descriptionView Paper arrow_downwardDownload

To cite this version

by Fabrice Guillet

2024

A 2D-3D visualization support for human-centered rule-mining

descriptionView Paper arrow_downwardDownload

Enhancing N-List Structure and Performance for Efficient Large Dataset Analysis

by IJCSMC Journal

2024, International Journal of Computer Science and Mobile Computing (IJCSMC)

One of the main challenges in data-intensive sectors like scientific research, data mining, and machine learning is efficiently analyzing enormous datasets. A popular data structure in similarity search algorithms to speed up the... more

Fig. 5. Running time on the Chess dataset

The Fl = {a, b, c, d, e} is set by the frequent 1-itemsets. Fig. 1 displays a PPC-tree. The node with (4, 5) indicates that the item name is c, the count is 4, and the pre-order is 4.

is displayed in Fig. 3. The node in the lower left corner of Figure 3 illustrates the itemset "bceaf" and registers item b. SS 9 a A new and improved itemset representation has been created for frequent itemset mining. We present a successful method for mining frequent itemsets based on the DiffNodesets structure: DiffNodesets. Sometimes, DiffNodesets can achieve great efficiency by just enumerating frequently occurring itemsets without creating candidates. In other scenarios, to find frequently occurring itemsets, DiffNodesets employs a hybrid search strategy combined with a set enumeration tree.

Fig. 4. Running time on the mushroom dataset he post algorithm shows running time performance like the Fin method up to the min sup criterion of 1%. In this sense, the suggested algorithm is better than the others.

The datasets in Table 2 are subjected to memory consumption tests in this part, much like in running time experiments, where test parameter settings are the same as in previous running time experiments. Figs. 8, 9, 10, and 11 display the memory utilization results from the experiment. Fig. 8 illustrates the memory required to store the user input data for the Fin, PrePost, DiffNodesets, and EN-list algorithms on the mushroom datasets. Mushroom is a relatively small dataset, and the RAM requirements of Prepost and DiffNodesets are likewise modest. The Prepost approach still requires a large amount of RAM to mine frequently occurring itemsets, even with a lower threshold. The least amount of memory is used by the EN-list method for each minute period (5%, 10%, 15%, 20%, 25%). The EN-list and DiffNodesets techniques demand less memory than the Prepost algorithm when the min sup is lower. The memory utilization resulting from the EN-list, Fin, PrePost, and DiffNodesets of the chess Jatasets is shown in Fig. 9. The performance of the Prepost algorithm drastically deteriorates as the threshold alls below 15%. The EN-list technique guarantees the best possible memory use for the threshold settings and Jataset that are supplied. The memory results for the EN-list, Fin, PrePost, and DiffNodesets algorithms for the

Kosarak datasets are shown in Fig. 10. Out of all of them, the EN-list method offers the most dependable anc effective memory performance. Prepost encounters memory overflow at almost every threshold value. Fig. 11 displays the memory required for the EN-list, Fin, PrePost, and DiffNodesets algorithms on the pumsb datasets Furthermore, the EN-list method has the most efficient and reliable memory performance out of all of the alternatives. Post encounters memory overflow with respect to the entire threshold value. The PPC tree is a special kind of tree structure that the PrePost algorithm uses. There are usually more transactions than nodes in the PPC tree when using the N list structure, which is what the Postal Algorithm employs. Therefore, the EN-lis approach usually requires less memory than the Fin, PrePost, and DiffNodesets algorithms. These experiments s how that the best technique for mining common itemsets is EN-list, which requires the least amount of memory for all min sup.

Fig. 9. Memory consumption on the Chees dataset

Fig. 10. Memory consumption on the Kosarak dataset

Fig. 11. Memory consumption on the pumsb dataset

CHARACTERISTICS OF EXPERIMENTAL DATASETS.

descriptionView Paper arrow_downwardDownload

Risk Assessment of Heavy Metals in Abandoned Mine Lands as Signifcant Contamination Problem in Romania

by Győző Jordán

2024

descriptionView Paper arrow_downwardDownload

Maintenance of generalized association rules with multiple minimum supports

by Wen-Yang Lin

2024, Intelligent Data Analysis

Mining generalized association rules among items in the presence of taxonomy has been recognized as an important model in data mining. Earlier work on generalized association rules confined the minimum supports to be uniformly specified... more

descriptionView Paper arrow_downwardDownload

Multi-Sorted Inverse Frequent Itemsets Mining for Generating Realistic No-SQL Datasets (Discussion Paper)

by Edoardo Serra

2023, SEBD

The development of novel platforms and techniques for emerging "Big Data" applications requires the availability of real-life datasets for data-driven experiments, which are however not accessible in most cases for various reasons, e.g.,... more

descriptionView Paper arrow_downwardDownload

Finding “persistent rules”: Combining association and classification results

by Anthony Scime

2023, Expert Systems With Applications

descriptionView Paper arrow_downwardDownload

Finding Persistent Strong Rules

by Anthony Scime

2023, Advances in data mining and database management book series

Data mining is a collection of algorithms for finding interesting and unknown patterns or rules in data. However, different algorithms can result in different rules from the same data. The process presented here exploits these differences... more

descriptionView Paper arrow_downwardDownload

An efficient stream mining technique

by Hatim Aboalsamh

2023, WSEAS Transactions on Information Science and Applications

Abstract: Stream analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Stream data is a sequence of observations collected over intervals of time. Each... more

descriptionView Paper arrow_downwardDownload

Complete Discovery of Weighted Frequent Subtrees in Tree-Structured Datasets

by Maseud Rahgozar

2023

Mining frequent subtree patterns has many useful applications in XML mining, bioinformatics, network routing, etc. Most of the frequent subtree mining algorithms (such as FREQT, TreeMiner and CMTreeMiner) use anti-monotone property in the... more

descriptionView Paper arrow_downwardDownload

Mining Indirect Positive and Negative Association Rules

by Prof A Govardhan

2023, Communications in Computer and Information Science

Indirect association is a new kind of infrequent pattern, which provides a new way for interpreting the value of infrequent patterns and can effectively reduce the number of uninteresting infrequent patterns. The concept of indirect... more

descriptionView Paper arrow_downwardDownload

A framework for the automated generation of paradigm-adaptive summaries of games

by Gowri Srinivasa

2023, International journal of computer applications in technology

We present a framework to analyse text streams with minute details of a game and generate summaries for multiple paradigms of desired lengths (as determined by the user). Multiple paradigms refer to a summary that is player-specific,... more

descriptionView Paper arrow_downwardDownload

Frequent itemset mining using cellular learning automata

by reza roshani

2023, Computers in Human Behavior

A core issue of the association rule extracting process in the data mining field is to find the frequent patterns in the database of operational transactions. If these patterns discovered, the decision making process and determining... more

descriptionView Paper arrow_downwardDownload

Role of Segment Progressive Filter in Dynamic Data mining

by Dr Sohail Asghar

2023

Association rule mining perhaps the most widely described technique among the minding paradigms. The temporal association rule mining in the association rule mining tries to find relations among items in datasets. The temporal association... more

descriptionView Paper arrow_downwardDownload

Pruning Large Data Sets for Finding Association rule in cloud: CBPA (Count Based Pruning Algorithm )

by Dr. Nidhi Khurana

2023

Organizations are more interested in the interesting data rather than the bulk of data. So they need a systematic and scientific approach to extract meaningful data out of heaps of the data and to find out the relations among these... more

descriptionView Paper arrow_downwardDownload

BloomEclat: Efficient Eclat Algorithm based on Bloom filter

by sina abbasi

2023

Eclat is an algorithm that finds frequent itemsets. It uses a vertical database and calculates item's support by intersecting transactions. However, Eclat suffers from the exponential time complexity of calculating the intersection of... more

Figure 3: The accuracy of proposed algorithm for different k. S. Abbasi/ JAC 53 issue 1, June 2021, PP. 197- 208

Figure 4: The comparison of the proposed method with the size F P = 0.001, for different values of k and m with the normal Eclat algorithm.

Figure 5: The values of m (size of Bloom Filter) for different values of k.

descriptionView Paper arrow_downwardDownload

Revised ECLAT Algorithm for Frequent Itemset Mining

by Dr.Sarika Khandelwal

2023, Advances in intelligent systems and computing

Frequent and infrequent itemset mining are trending in data mining techniques. The pattern of Association Rule (AR) generated will help decision maker or business policy maker to project for the next intended items across a wide variety... more

In the horizontal layout, each transaction T; is represented as T;: (tid, !) where tid is the transaction identifier and J is an itemset containing items occurring in the transaction. The initial transaction consists of all transactions T;.In the vertical layout, each item i, in the item base B is represented as i,: fix, t(i,)} and the initial transaction database consists of all items in the item base. For both layouts, it is possible to use the bit format to encode tids and also a combination of both layouts can be used [7], [8]. Figure 1 illustrates horizontal and vertical layout of data representation by [7].The items in B consist of {a,b,c,d,e} and each itemsets are allocated with unique identifiers (tids) for each transactions. This is clearly visualized in horizontal format. To switch to vertical format, every items {a,b,c,d,e} are then organized where all items are allocated with their corresponding tids. When this is done, it is clearly visualized the support of each items through the counting number of every item’s tids.

Figure 2. Search tree for {a,b,c,d,e} with null set Eclat starts with prefix {} and the search tree is actually the initial search tree. To divide the initial search tree, it picks the prefix {a}, generate the corresponding equivalence class and does frequent itemset mining in the sub tree of all itemsets containing {a}, in this sub tree it divides further into two sub trees by picking the prefix {ab}: the first sub tree consists of all itemset containing {ab}, the other consists of all itemsets containing {a} but not {b}, and this process is recursive until all itemsets in the initial search tree are visited. The search tree of an item base {a,b,c,d,e} is represented by the tree as shown in Figure 2.

When using diffset format, we will have d(PX) instead of t(PX) and d(PX) = t(P) — t(X), the set of tids in t(P) but not in t(X). Similarly, we have d(PY) = t(P) — t(Y). So the support of PX is not the size of its diffset. By the definition of d(PX), it can be seen that |t(PX)| = |t(P)| — |t(P) — t(xX)| = |t(@P)| - |\d(PX)|. In other word, sup(PX) = sup(P) — |d(PX)|. Refer to the illustration in Figure 4. To use diffset format, the initial transaction database in vertical layout is firstly converted to diffset format in which diffset of items are sets of tids whose transactions do not contain items. This is deduced from the definition of diffset, the initial transaction database in vertical layout is an equivalence with the prefix P={}, so the tidset of P includes all tids, all transactions contain P, and the diffset of an item iis d(i) = t(P) — t(i), this is a set of tids whose transactions do not contain i. From this initial equivalence class, we could generate all itemsets with their diffsets and supports. The dEclat is different from Eclat in step 5, instead of generating a new tidset, a new diffset is generated. The performance and memory usage of dEclat has shown to achieve significant improvements over traditional Eclat (tidset) especially in dense database. But when database is sparse, it loses its advantages over tidsets. Then in [5] the authors suggested to use tidset format at starting for sparse database and later switch to diffset format when switching condition is met. From this starting point, postdiffset is proposed.

Figure 6. Performance on diffset, sortdiffset and postdiffset in chess, mushroom, retail and T10I4D100K Referring to Figure 6, in dense dataset, postdiffset lose its performance by 63% to diffset and 44% to sortdiffset in chess. In mushroom, postdiffset outperform with 23% in diffset and 84% in sortdiffset. For sparse dataet category, postdiffset tremendously outperform with 94% and 95% to diffset in retail and TIO0I4D100K. The algorithm continues to outperform dramatically in sortdiffset with 99% both in retail and T1014D100K dataset.

Figure 3. Pseudocode for Eclat algorithm 5.2. dEclat (Diffset)

7.1. Empirical Results All experiments are performed on a Dell N5050, Intel ® Pentium ® CPU B960 @ 2.20 GHz with 8GB RAM in a Win 7 64-bit platform. The software specification for algorithm development is deployed using open source software i.e. MySQL version 5.6.20 —- MySQL community server (GPL) for our database server, Apache/2.4.10 (Win32) OpenSSL/1.0.11 PHP/S.5.15 for our web server, php as a programming language and phpMyAdmin with version 4.2.7.1, the latest stable version as to handle the administration of MySQL over the Web. The phpMyAdmin[91] is a free software tool written in PHP, that supports a wide range of operations on MySQL, MariaDB and Drizzle. The database characteristics is shown in Table 1. For the ease and fast experimentation purposes, we have modified datasets to be only thousand rows of item sets that are randomly processed for mining purposes. Our experimentation is with regards to dEclat (diffset), com-Eclat (sortdiffset) and postdiffset algorithm because from our past experimentation on postdiffset implementation in frequent itemset mining, the results of traditional-Eclat (tidset) will always be the last in performance and memory usage among those three (3) algorithms. Figure 6 shows the graph of performance evaluationwith regards to execution time (in second) within four (4) datasets i.e. chess, mushroom, retail and T10I4D100K.

descriptionView Paper arrow_downwardDownload

Robust counterpart optimization for the redundancy allocation problem in series-parallel systems with component mixing under uncertainty

by Roya Soltani

2023, Applied Mathematics and Computation

In this paper, a robust optimization approach is used to solve the redundancy allocation problem (RAP) in series-parallel systems with component mixing where uncertainty exists in components' reliabilities. In real world, the... more

descriptionView Paper arrow_downwardDownload

A Theoretical Framework for Association Mining based on the Boolean Retrieval Model

by Aladdin Hafez

2023, Lecture Notes in Computer Science

Data mining has been defined as the non-trivial extraction of implicit, previously unknown and potentially useful information from data. Association mining is one of the important sub-fields in data mining, where rules that imply certain... more

descriptionView Paper arrow_downwardDownload

A Matrix Approach for Association Mining

by Aladdin Hafez

2023

Association Mining, a class of data mining techniques, is one of the most researched field in data mining, where algorithms are designed to discover rules that reflect dependencies among values of an attribute. Because of the vast amounts... more

descriptionView Paper arrow_downwardDownload

An efficient stream mining technique

by Aladdin Hafez

2023, WSEAS Transactions on Information Science and Applications

descriptionView Paper arrow_downwardDownload

An efficient time series data mining technique

by Aladdin Hafez

2023, Proceedings of the 12th …

Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical... more

descriptionView Paper arrow_downwardDownload

CoMMA: a framework for integrated multimedia mining using multi-relational associations

by Muhammad A Ahmad

2023, Knowledge and Information Systems

descriptionView Paper arrow_downwardDownload

Mining Software Change History in an Industrial Environment

by Methanias Colaço Júnior

2023

Version control systems are among the type of repositories that are frequently explored as sources of software change history. They can be mined to identify associations between software module modifications. This information is useful to... more

descriptionView Paper arrow_downwardDownload

webSPADE: a parallel sequence mining algorithm to analyze web log data

by Ayhan Demiriz

2023

In this work we made a study of several other works were the association and sequence mining techniques were applied to the field of web usage mining. This report is to be submitted to classification to the Data Mining course at the phd... more

descriptionView Paper arrow_downwardDownload