Waiting for answer This question has not been answered yet. You can hire a professional tutor to get the answer.

QUESTION

python data analysis project

Introduction

Use five-fold mode and separate the data into training and testing sets.

Data Dictionary description

Variable      Definition                                           Keysurvival       Survival                                               0 = No, 1 = Yespclass          Ticket class                                         1 = 1st, 2 = 2nd, 3 = 3rdsex              Sex              Age             Age in years                   sibsp           # of siblings/spousesparch          # of parents/childrenfare             Passenger fare              embarked  Port of Embarkation                         C = Cherbourg, Q = Queenstown, S = Southampton  

Variable Notes

pclass: A proxy for socio-economic status (SES)1st = Upper2nd = Middle3rd = Lowerage: Age is fractional if less than 1. sibsp: The dataset defines family relations in this way...Sibling = brother, sister, stepbrother, stepsisterSpouse = husband, wife (mistresses and fiancés were ignored)parch: The dataset defines family relations in this way...Parent = mother, fatherChild = daughter, son, stepdaughter, stepsonSome children travelled only with a nanny, therefore parch=0 for them. 

Analysis requirement:

1. How is the survival chance related to the gender?2. How is the survival chance related to age?3. How is the survival chance related to socio-economic status?5. What are the first three most important factors related to the survival chance?6. What is average prediction accuracy?
Show more
LEARN MORE EFFECTIVELY AND GET BETTER GRADES!
Ask a Question