Answered You can hire a professional tutor to get the answer.
(5 points)What is the difference between "Discretization" and "Binarization"? 2.(5 points)What are training and test datasets?
P3
65000
0.8
You are asked to find the most similar two data points. Here we have three data points which are p1, p2 and p3.
a) (5 points) What similarity measure would you use knowing that both attributes are continuous and Ratio? Justify your answer. Also, explain any modification or procedure that should be done on data in order to use your suggested similarity measure.
b) (6 points) Use Euclidean distance to find the most similar data points.
c) (4 points) Normalize your data for each attribute.
d) (6 points) Use Euclidean distance on the normalized data to find the most similar data points.
e) (4 points) Is your answer in part b is different from the one in part d? Which one is more reasonable and why?