Can I get some help with these questions? File Attached, let's make a deal.

Exercises

  1. What is the Apriori property?

  2. Following is a list of five transactions that include items A, B, C, and D:

  • T1 : { A,B,C }

  • T2 : { A,C }

  • T3 : { B,C}

  • T4 : { A,C,D}

Which itemsets satisfy the minimum support of 0.5? (Hint: An itemset may include more than one item).

  1. How are interesting rules identified? How are interesting rules distinguished from coincidental rules?

  2. A local retailer has a database that stores 10,000 transactions of last summer. After analyzing the data, a data science team has identified the following statistics:

  • {battery} appears in 6,000 transactions.

  • {sunscreen} appears in 5,000 transactions.

  • {sandals} appears 4,000 transactions.

  • {bowls} appears in 2,000 transactions.

  • {battery, sunscreen} appears in 1,500 transactions.

  • {battery, sandals} appears in 1,000 transactions.

  • {battery, bowls} appears in 250 transactions.

  • {battery, sunscreen, sandals} appears in 600 transactions.

Answer the following questions:

  1. What are the support values of the preceding itemsets?

  2. Assuming the minimum support is 0.05, which itemsets are considered frequent?

  3. What are the confidence values of {battery} -> {sunscreen} and {battery, sunscreen} -> {sandals}? Which of the two rules is more interesting?

  4. List all the candidate rules that can be formed from the statistics. Which rules are considered interesting at the minimum confidence 0.25? out of the interesting rules, which rule is considered the most useful (that is, least coincidental)?