Academia.eduAcademia.edu

We observe that only 5 rules (i.e., R1, R5, R7, R9, R10) are potentially interesting among the top ten rules considering the quality of data they are computed from. With data quality-awareness, the other rules (R2, R3, R4, R6, R8) are not interesting despite a good rank in the top ten list. It’s also interesting to notice that the profit per rule predicted by (Wang et al., 2005) may be considerably counterbalanced by the cost of the rule computed from low-quality data (although it depends from initial costs defined in Table 3.3). The second best rule R2 whose predicted profit is $61.73 has a cost of $109.5 and thus is classified as not interesting due to the low quality of its data sets.

Table 3 We observe that only 5 rules (i.e., R1, R5, R7, R9, R10) are potentially interesting among the top ten rules considering the quality of data they are computed from. With data quality-awareness, the other rules (R2, R3, R4, R6, R8) are not interesting despite a good rank in the top ten list. It’s also interesting to notice that the profit per rule predicted by (Wang et al., 2005) may be considerably counterbalanced by the cost of the rule computed from low-quality data (although it depends from initial costs defined in Table 3.3). The second best rule R2 whose predicted profit is $61.73 has a cost of $109.5 and thus is classified as not interesting due to the low quality of its data sets.