Skip to content

The code, data, plots, proposal, tests, readme for capstone-1 project about work-life-balance from the authentic happiness

Notifications You must be signed in to change notification settings

scottschmidl/Lifestyle-and-Wellbeing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

happy girl

Table of Contents

Background and Motivation

For most of my life I have been into exercising, but for most of my life I was not consistent and I never got into lifting weights. It was not until the fall of 2017 were it became important to me as a lifestyle and now I do it almost every day. My motivation was I had realized how much better I felt after a good workout. It was then that I put the pieces together; those of exercising, eating right, the various stimulus one might encounter in a day and how one reacts to said stimulus can affect your day to day wellness. I strongly believe that one can prevent even conquer ailments such as stress and depression with the above remedies. This is what guided me to finding the ‘Lifestyle_and_Wellbeing_Data’ on kaggle.com.

Data

The ‘Lifestyle_and_Wellbeing_Data’ was gathered using 12,757 survey responses, from July 2015 until February 2020. The 'bal_score' column is the sum of the values of the questions for each survey.

Raw Data:

timestamp fruits_veggies daily_stress places_visited core_circle supporting_others social_network achievment donation bmi_range todo_complete flow daily_steps live_vision sleep_hours lost_vacation daily_shouting sufficient_income personal_awards time_for_passion daily_meditation age male_female bal_score
7/7/15 3 2 2 5 0 5 2 0 1 6 4 5 0 7 5 5 1 4 0 5 36 to 50 Female 60
7/7/15 2 3 4 3 8 10 5 2 2 5 2 5 5 8 2 2 2 3 2 6 36 to 50 Female 78
7/7/15 2 3 3 4 4 10 3 2 2 2 2 4 5 8 10 2 2 4 8 3 36 to 50 Female 80
7/7/15 3 3 10 3 10 7 2 5 2 3 5 5 0 5 7 5 1 5 2 0 51 or more Female 80
7/7/15 5 1 3 3 10 4 2 4 2 6 0 5 0 7 0 0 2 8 1 5 51 or more Female 66

Table Info:

Questions

I am interested in discovering which groups have the better or worse work-life balance according to this test.

Which brings me to my question:

Which demographic, amongst three different groups, has the better or worse work-life balance when compared to others in their group?

Closer Look

With these questions in mind I begin to take a closer look at my data. The first thing I did was look at the columns and inspect for areas with bad data. I added a 'bal_score' column which is the sum of the answers to each question, as they are numerical in nature, and found the min and max possible scores.

From min_max_poss.py:

min possible score = 4 max possible score = 169

I, then, isolated the data three times and by the columns: males and females, age ranges, and finally by males and females by ages.

Upon completion I moved on to inspect the means of those three areas described below:

Means to note: 'males' are lower than females, '21 to 35' is lower than the others, and 21m to 35 is lower than the others. The latter is not what I initially expected.

Visualization

The above results lead me to the below distributions. All of the results below are around 80 +- 5 over and 3 under. It appeared that this had the potential to be normally distributed, since 80 is about half.

As I was unsure of the exact distribution I decided to plot against a kernel density estimation. KDE is a non-parametric estimate of the PDF. Non-parametrized means that the data distrubtion is unknown or known but with uncertain parameters. As you can see from the below graphs the data is nearly normal distributed with some quite large standard deviations.

Figure 1: Comparing the balance scores of males and females

Figure 2: Comparing the balance scores of four age ranges

Figure 3: Comparing the balance scores of males and females by age

The above distributions, while nice, left me wanting a slightly better way to visualize what was actually gone on with the work-life balance scores. I made the below box plots to extract the min, first quartile, median, third quartile, and max values easily visible.

Figure 4: Comparing 5-number summary of males and females

Figure 5: Comparing 5-number summary of four age ranges

Figure 6: Comparing 5-number summary of males and females by age

Conclusion

On figure 4 one can see that the females have a plot that is more nestled and with a higher mean. This leads me to conclude that more of their values are around the mean with stronger values in the north than in the south. If you compare this to the males who are more spread out to the south when compared to the females, then to north, one can conclude that the females have a much better handle on their work-life balance. These results lead me to not rejecting my eariler hypothesis.

On figure 5 one can see that not only is the mean lower on ages '21 to 35', but their plot also has less values to the north of the mean. Ages '51 or more' have the highest mean. I am compeled to conclude that ages '21 to 35' have the hardest time managing their work-balance, and therefore I can not reject my earlier hypothesis.

On figure 6 one can see that 'males, 21 to 35' have the lowest mean and overall lower values with 'males 36 to 50' and 'females 51 or more' leading the pack. These results are not what I expected and compele me to reject my earlier hypothesis and now I am interested to know why this is the case.

Photo and Data Credits

The picture in my heading was webscraped from the main website:

Link to main website:

http://www.authentic-happiness.com/

The data was acquired from:

https://www.kaggle.com/ydalat/lifestyle-and-wellbeing-data

Extras

Link to Work-Life Balance Test:

http://www.authentic-happiness.com/your-life-satisfaction-score

Questions and range of respones on test:

Q1) How many steps(in thousads) do you typically walk everyday?

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q2) How many people are very close to you?

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q3) How many people are very close to you?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q4) With how many people do you interact during a typical day?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q5) How many times do you donate your time or money to good causes?

[0, 1, 2, 3, 4, 5]

Q6) How sufficient is your income to cover basic life expensives?

[1, 2]

Q7) Of how many remarkable achievements are you proud?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q8) How many hours do you spend everyday doing about what you are passionate?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q9) In a typical week, how many times do you have the opportunity to think about yourself?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q10) How many recognitions have you received in your life?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q11) For how many years ahead is your life vision very clear?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q12) How many days of vacation do you typically lose every year?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q13) What is your body mass index range?

[1, 2] #1(below 25); 2(above 25)

Q14) How many new places do you visit?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q15) How well do you complete your weekly to-do lists?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q16) How many people do you help achieve a better life?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q17) How many fruits or vegetables do you eat everyday?

[0, 1, 2, 3, 4, 5]

Q18) In a typical day, how many hours do you experience 'flow'?

flow - mental state, in which you are fully immersed in peforming an activity. You then experience a feeling of energized focus, full involvement, and enjoyment in the process of this activity.

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

Q19) How much stress do you typically experience everyday?

[0, 1, 2, 3, 4, 5]

Q20) How often do you shout or sulk at somebody?

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

About

The code, data, plots, proposal, tests, readme for capstone-1 project about work-life-balance from the authentic happiness

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published