Answered You can hire a professional tutor to get the answer.
Use the file names to identify the corresponding questions. For instance,Question1.yourname. For all the questions below, use degrees of freedom N -...
• Use the file names to identify the corresponding questions. For instance,Question1.yourname.sas'.
• For all the questions below, use degrees of freedom N - 1.
1. (32 points) Given a dataset, (file data.online.scores.txt) which includes the records of students' exam scores (sample from the population) for the past few years of an online course. The first column students' id, the second column is the mid-term scores, and the third column is the final scores, and data are splitted by tab. Based on the dataset, give out the following statistical description of data. If the result is not integer, then round it to 3 decimal places. Give out the basic statistical description about mid-term scores.
a. (8 p) Max, min
b. (12 p) First quartile Q1, median, third quartile Q3.
c. (4 p) The mean score.
d. (4 p) The mode score.
e. (4 p) Empirical Variance.
2. (25 points) Based on the data of students' score (file data.online.scores.txt). Please normalize the mid-term score using z-score normalization (divided by the empirical standard deviation).
a. (10 p) Compare the empirical variance before and after normalization.
b. (5 p) Given original score of 90, what is the corresponding score after normalization?
c. (5 p) Pearson's correlation coefficient between midterm scores and final scores is:
d. (5 p) Covariance between midterm scores and final scores is:
data.online.score.txt
09795
18399
27981
37284
49284
57891
67989
76867
87876
97781
107583
119099
128584
1377100
148986
158687
166587
176794
186776
199393
208391
2197100
228689
236143
248577
259690
267998
276168
289393
299897
308487
315497
328875
338695
347793
359199
3695100
378083
386278
396481
408089
4194100
429079
438278
447466
455981
467189
479293
487999
499497
506178
517694
529487
538397
543982
558093
565371
5710098
587891
599796
606993
615593
629083
637597
645882
659399
668198
676888
687792
697373
709198
719287
727577
736593
747496
757795
766162
778086
785261
795357
808493
819471
829099
836478
845587
855881
869186
877597
889192
898798
9090100
918897
926893
936180
949597
955576
967790
979198
988486
997992
1007788
1019092
1028295
1038391
1047790
1057485
1068281
1078092
1086490
1096691
1106388
1115064
1127981
1138897
1148784
1157682
1167499
117100100
1187684
1198894
1207782
1217587
1228793
1238290
1247277
1257274
1267892
1276897
1288298
1297897
1306059
1316977
13210095
1337092
1347383
1359295
1366680
13762100
1388395
13974100
1407975
1417289
1426995
1437186
1447796
1458391
1465776
1477182
1488188
1497296
1507687
1517891
1527566
1537496
1549095
1557082
1566586
1579093
1585561
1598090
1609686
1617792
1628595
1638598
1647582
1658280
1668592
1675074
1688199
1697286
1709794
1715474
1727481
1736578
1747687
1758080
1768595
1775883
1786076
1797193
1809395
1815667
1827999
1839597
1846496
1858392
1869190
1877476
1886796
1897661
1909797
1918689
19210096
1938089
1947592
1955459
1967190
1976261
1988197
1996185
2006761
2015872
2027394
2039391
2046356
2057597
2068694
2079195
2085989
2096994
2107099
2116290
2125866
2138992
2145286
2158573
2166389
2179195
2189793
2196868
2208192
2218190
2227787
2239090
2247598
2257796
2266574
2277692
2288388
2295384
2305882
2318392
2326284
2338692
2346189
2357293
2368487
2378382
2387095
2399899
2407496
2417688
2426893
2436885
2448788
2456677
2468590
2479077
2487386
2497496
25095100
2519799
2528783
2537399
2548271
2558396
2568497
2577485
2586988
2599293
2606284
2619297
2625867
2637885
2646896
2657483
2666789
2676377
2685990
26967100
2707496
2717099
2727567
2738092
2748497
2759494
2766695
2778696
2786376
2798083
2807171
2818087
2828388
2839183
28496100
2858699
2868593
2876199
2885182
2897792
2907286
2916890
2928392
2938086
2946179
2959996
2969675
2977480
2987067
2997389
3005664
3016488
3027899
3036497
3047379
30589100
3069797
3078988
3088085
3098597
3106980
3117284
3127787
3137085
3148495
3156989
3168393
3174567
3187596
3197365
3208993
3213851
3225065
3237577
3248985
3258589
3268277
3277892
3288894
3296696
3309499
3317778
3327071
3338976
3345159
3359385
3365885
3378085
3388393
33910098
3409595
3417993
3426572
3437492
3447296
3459088
3466582
3475789
34887100
3497771
3506076
35190100
3528382
3535875
3543751
3558079
3567892
3578795
3586986
3598494
3607369
3619391
3629585
3636699
3648397
3657273
3669895
3678096
3688598
3697792
3706680
37187100
3725892
3737486
3747990
3757684
3766764
3776876
3785095
3797474
3807295
3818088
3829471
3837688
3848594
3854485
3865888
3875664
3888284
3897776
3908093
3919697
3926888
3935285
3946476
39592100
3968384
3975894
3989187
3997693
4008178
4019998
4029595
4036395
4047080
4058196
4065774
4079285
4089189
4095286
4108083
4118091
4127074
4139191
4148999
4158177
4167583
4178598
4188278
4198397
4204443
4218799
4228984
4238988
4245685
4258696
4267494
4277699
4288888
4298495
4305878
4317196
4328686
4334873
4348386
4357495
4367085
4379387
4385888
4394574
4409484
4418897
4427777
4439792
4448297
4459697
4467072
4478195
4486196
4498291
4507877
4518287
4526067
45396100
4545696
4556269
4568891
45710092
4589196
4597191
4607087
4617483
4628377
4637271
464100100
4655791
4669986
4678098
4688084
4698194
4706263
4716681
4726399
4738593
4746783
4756497
47687100
4778385
4788289
4797690
4805167
4815762
4828498
4837387
4847187
4859484
4868194
4877398
4888998
4899299
4904568
49110094
49289100
4937694
4949087
4959699
4968181
4979198
4985881
4999693
5006599
5016995
5026886
5035877
5047699
5056779
5066488
5078499
5086576
50994100
5105279
5118490
5128985
5139199
5148593
5155589
5165887
5179299
5189495
5197783
5206492
5219397
5227981
5236676
5247996
5257660
5268890
5278067
5285071
5296685
5308197
5319697
5327487
5336982
5349483
5358276
5368795
5375063
5387188
5397087
5407994
5418993
5426285
5435850
5448299
5457484
5466181
54710096
5487086
5497088
5509496
55169100
5526389
55382100
5546775
5558590
5567178
5576795
5587493
5597193
5606576
56185100
56291100
5639492
5649076
56590100
5666174
5678983
5685477
5697898
5708598
5719296
57291100
5739996
5749096
5755470
57610099
5776272
5786681
5799197
5807185
5817691
5827699
5839390
5847174
5858396
58684100
5876082
5889984
5898797
5909298
5919089
5929598
5938792
5948484
5958492
5968692
5977873
5989079
5998899
6004360
6018885
6028497
6038169
6048897
6056583
6065779
6078086
6087884
6098986
6106995
6117792
61297100
61395100
6147589
6158999
6167584
6177296
6189295
6199986
6209592
6219493
6228995
6237798
6247589
6257678
6265786
62759100
6289783
6299198
6306575
6315867
6327195
6336654
6345735
6358786
6369185
6379096
6387398
6398193
6405851
6417271
6426753
6437288
6449289
6458286
6467992
6479293
6489482
6497378
6507588
6517697
65272100
6538380
6547494
6559999
65666100
6574971
6587983
6597294
6608382
6615786
6627987
6639198
6647372
6655694
6666785
6676499
6688169
6699195
6707880
6715262
6726987
6735779
6747293
6758095
6768391
6776193
6787696
67985100
6806696
6816679
6828191
6838697
6849697
6857688
6866994
6875668
6889193
6895870
6905080
6915996
6925687
6937587
6945167
6956077
6968799
6979291
6987189
6999889
7007066
7018399
7027194
7036988
7049397
7058895
7067979
7078879
7087398
7098494
7107889
7117095
7126283
7137083
7148098
7158190
7168190
7178892
7185277
7197196
7206457
7216796
7227288
7235886
7245887
7258696
72610097
7276583
7288498
7296973
7309688
7319594
7329784
7336988
7345469
7354951
7367998
73797100
7387594
7397581
7409396
7419198
7428698
7437585
7448497
7458987
7468788
7477982
74894100
7499780
7507396
7518488
7529686
7537572
7549694
7558482
7568488
7577797
7584870
7597491
76099100
7619777
7626871
7639597
7649991
7658499
7667788
7675265
7687185
7697194
7706583
7717570
7727585
7737998
7746469
7759397
7767182
7778982
7787685
7797187
7808986
7818086
7827288
7839483
7849496
7857894
7867478
7877587
7886882
7896088
7907899
7915397
7927886
7938177
7947382
7957498
7968097
79785100
79896100
7998197
8009292
8017396
8029495
8039086
80479100
8059592
8068986
8076184
8087179
8093848
8107688
8118393
8128393
8136472
8146487
8155354
8167170
8177472
8188385
8197461
8205473
8216081
8228099
8239390
8248989
8259486
8268199
8276386
828100100
8299784
8307579
8317568
83286100
83384100
8347283
8359494
8369096
8376083
8384281
8397879
8406692
8413735
8428384
8434162
8449799
8457993
84670100
8478596
8487799
8498397
8505697
8517692
85210088
8536897
8548085
8556393
8567794
8576497
8589692
8597774
8606992
8617796
8628393
8636784
8645985
8659490
8667293
8677374
86882100
8694973
8706066
8718797
8727772
8735956
8748890
8757483
8768897
8777885
8786690
8796977
8805055
8817484
8828579
8839796
8849697
88590100
8866689
8877584
8887587
8897799
8907691
8919898
8926265
8938598
8948097
8956683
8965353
8979199
8988795
8997695
9007190
9018698
9025275
9038898
9047179
9059597
9067797
9077795
9089297
9099189
9106480
91190100
9126699
9135768
9145661
9156694
9166382
9178396
9188592
9195883
9207693
9218383
9225694
9238995
9248998
9257293
9268094
9279092
9288085
9299195
9306258
93175100
9328995
9336164
9347298
9355878
9366579
9379796
9389899
9399093
9406086
94187100
9428086
9437289
9447985
9457893
9468498
9476193
9488192
9498390
9509597
9516593
9524670
9535672
9548680
9558490
9567693
9576894
9587792
9599986
9606969
9618978
9629596
9639990
9645161
9657795
9667481
9678590
9686772
9696599
9709195
9718699
9727783
97368100
9746097
9755480
9766998
9777594
9789298
9795881
9807774
9818175
9828489
9839399
9845777
9858186
9864470
9879685
9887493
9898199
9908381
9917889
9926383
9935162
9946674
9959798
9968296
9978591
9988499
9995687
3. (38 points) Given the inventories of two libraries Citadel's Maester Library (CML) and Castle Black's library(CBL), compare the similarity between this two libraries by using the different proximity measures. if the result is not integer, then round it to 3 decimal places.
a. (5 p) Given 200 books, the following table summarizes how many books are supplied by corresponding library in Table 1. In Table 1, for CBL = 0, CML = 0, it corresponds the number of items among the 200 items that are served neither by CBL nor CML. For CBL = 1, CML = 0, it corresponds the number of items among the 200 items that are served by CBL but not CML. So on and so forth. Based on Table 1, calculate the Jaccard coefficient of Citadel's Maester Library (CML) and Castle Black's library(CBL).
Citadel's Maester Library (CML)
Castle Black's library(CBL) 0 1
0 20 120
1 2 58
b. (15 p) For each kind of books, we have multiple copies. Based on all books (treat the counts of the 100 books as a feature vector of the two libraries), (file data.libraries.inventories.txt), calculate the minkowski distance of the two vectors with regard to different h values:
1. h = 1
2. h=2
3. h = ∞
c. (9 p) The Cosine similarity between Citadel's Maester Library (CML) and Castle Black's with regard to the feature vector. (file data.libraries.inventories.txt).
d. (9 p) Kullbac-Leibler divergence between Citadel's Maester Library (CML) and Castle Black's library(CBL) with regard to the feature vector. We denote that there are i_1 of book1 in Citadel's Maester Library (CML), and j_1 of book1 in Castle Black's library(CBL). Assume that someone will pick up a book randomly, the probability of this person to pick up book 1 in Citadel's Maester Library (CML) is i_1 / (i_1 + ... + i_100). Based on this probability distribution, calculate the Kullback-Leibler divergence of these two libraries P(CML || CBL).
DATA Library inventories:
librarybook_1book_2book_3book_4book_5book_6book_7book_8book_9book_10book_11book_12book_13book_14book_15book_16book_17book_18 book_19book_20book_21book_22book_23book_24book_25book_26book_27book_28book_29book_30book_31book_32book_33book_34book_35 book_36book_37book_38book_39book_40book_41book_42book_43book_44book_45book_46book_47book_48book_49book_50book_51book_52 book_53book_54book_55book_56book_57book_58book_59book_60book_61book_62book_63book_64book_65book_66book_67book_68book_69 book_70book_71book_72book_73book_74book_75book_76book_77book_78book_79book_80book_81book_82book_83book_84book_85book_86 book_87book_88book_89book_90book_91book_92book_93book_94book_95book_96book_97book_98book_99book_100
CML5313690103369319551192148793048802427717013741146116186101173110501491393011986972563173130601481296916783 12128133173149105641341372117911635119601401506814519780731283180961119512213264181165129796372187991434913184179 21511276645014446110163183107129162173
CBL1038013216714417210889103636558141116152122129831751161736761114195166854287183217125109719371772091345999147 75501811021731601479790571511001021811181631411461491492501281551481231898018680114102316634592131042513175164214 537911875169192921391221231236113590125118
4. (5 points) The Table 2 is a summary about customers' purchase history of diapers and beer. Calculate the chi-square correlation value. If the result is not integer, then round it to 3 decimal places.
Table 2: Purchase history.
Buy diaper Do not buy diaper
Buy beer 150 40
Do not buy beer 15 3300