How many candidate and frequent itemsets will be generated


Homework

Consider the traffic accident data set shown in Table.

Table: Traffic accident data set.

WeatherCondition

Driver'sCondition

TrafficViolation

Seat Belt

CrashSeverity

Good

Alcohol-impaired

Exceed speed limit

No

Major

Bad

Sober

None

Yes

Minor

Good

Sober

Disobey stop sign

Yes

Minor

Good

Sober

Exceed speed limit

Yes

Major

Bad

Sober

Disobey traffic signal

No

Major

Good

Alcohol-impaired

Disobey stop sign

Yes

Minor

Bad

Alcohol-impaired

None

Yes

Major

Good

Sober

Disobey traffic signal

Yes

Major

Good

Alcohol-impaired

None

No

Major

Bad

Sober

Disobey traffic signal

No

Major

Good

Alcohol-impaired

Exceed speed limit

Yes

Major

Bad

Sober

Disobey stop sign

Yes

Minor

Task

• Show a binarized version of the data set.

• What is the maximum width of each transaction in the binarized data?

• Assuming that the support threshold is 30%, how many candidate and frequent itemsets will be generated?

• Create a data set that contains only the following asymmetric binary attributes: (weather = Bad, Driver' s condition = Alcohol-impaired, Traffic violation = Yes, Seat Belt = No Crash Severity = Major) . For Traffic violation, only None has a value of 0. The rest of the attribute values are assigned to 1. Assuming that the support threshold is 30%, how many candidate and frequent itemsets will be generated?

• Compare the number of candidate and frequent itemsets generated in parts (c) and (d).

Format your homework according to the give formatting requirements:

• The answer must be using Times New Roman font (size 12), double spaced, typed, with one-inch margins on all sides.

• The response also includes a cover page containing the student's name, the title of the homework, the course title, and the date. The cover page is not included in the required page length.

• Also include a reference page. The references and Citations should follow APA format. The reference page is not included in the required page length.

Solution Preview :

Prepared by a verified Expert
Database Management System: How many candidate and frequent itemsets will be generated
Reference No:- TGS03104693

Now Priced at $65 (50% Discount)

Recommended (91%)

Rated (4.3/5)