Executive Summary

Data Sources Used

FitBit Fitness Tracking Data on Kaggle

Bellabeat Product Website

Data Cleaning Process

The first step I took when analyzing the collected data was to compare which features were unique to each device, and then determine what features were missing from the Bellabeat app. The below table shows the results of that comparison:

Tracked Data FitBit Bellabeat
Daily Activity Yes Yes
Calories Yes No
Intensities Yes No
Steps Yes Yes
Heart Rate Yes Yes
Sleep Yes Yes
Weight Yes No
Cycle No Yes


From here, I wanted to look at the FitBit data to see how consumers were using the features available. Using R, I imported the data files to inspect the data and determine if there were any errors.

# Load "tidyverse" library

library(tidyverse)
## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ──
## ✔ ggplot2 3.3.6     ✔ purrr   0.3.4
## ✔ tibble  3.1.8     ✔ dplyr   1.0.9
## ✔ tidyr   1.2.0     ✔ stringr 1.4.0
## ✔ readr   2.1.2     ✔ forcats 0.5.1
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag()    masks stats::lag()
# Import FitBit datasets

fit_activity <- read_csv('dailyActivity_merged.csv', show_col_types = FALSE) %>% 
    select(-"Calories")
fit_calories <- read_csv("dailyCalories_merged.csv", show_col_types = FALSE)
fit_intensity <- read_csv("dailyIntensities_merged.csv", show_col_types = FALSE)
fit_steps <- read_csv("dailySteps_merged.csv", show_col_types = FALSE)
fit_heart <- read_csv("heartrate_seconds_merged.csv", show_col_types = FALSE)
fit_sleep <- read_csv("sleepDay_merged.csv", show_col_types = FALSE)
fit_weight <- read_csv("weightLogInfo_merged.csv", show_col_types = FALSE)
# Merge common columns into one dataset by matching "Id" column

fit_join <- merge(x = fit_activity, y = fit_calories, by = "Id")
fit_join <- merge(x = fit_activity, y = fit_intensity, by = "Id")
fit_join <- merge(x = fit_activity, y = fit_weight, by = "Id")

# Used 'head()' function to preview the combined dataset

head(fit_join)
##           Id ActivityDate TotalSteps TotalDistance TrackerDistance
## 1 1503960366    4/16/2016      12669          8.16            8.16
## 2 1503960366    4/16/2016      12669          8.16            8.16
## 3 1503960366    4/18/2016      13019          8.59            8.59
## 4 1503960366    4/18/2016      13019          8.59            8.59
## 5 1503960366    4/15/2016       9762          6.28            6.28
## 6 1503960366    4/15/2016       9762          6.28            6.28
##   LoggedActivitiesDistance VeryActiveDistance ModeratelyActiveDistance
## 1                        0               2.71                     0.41
## 2                        0               2.71                     0.41
## 3                        0               3.25                     0.64
## 4                        0               3.25                     0.64
## 5                        0               2.14                     1.26
## 6                        0               2.14                     1.26
##   LightActiveDistance SedentaryActiveDistance VeryActiveMinutes
## 1                5.04                       0                36
## 2                5.04                       0                36
## 3                4.71                       0                42
## 4                4.71                       0                42
## 5                2.83                       0                29
## 6                2.83                       0                29
##   FairlyActiveMinutes LightlyActiveMinutes SedentaryMinutes
## 1                  10                  221              773
## 2                  10                  221              773
## 3                  16                  233             1149
## 4                  16                  233             1149
## 5                  34                  209              726
## 6                  34                  209              726
##                   Date WeightKg WeightPounds Fat   BMI IsManualReport
## 1 5/2/2016 11:59:59 PM     52.6     115.9631  22 22.65           TRUE
## 2 5/3/2016 11:59:59 PM     52.6     115.9631  NA 22.65           TRUE
## 3 5/2/2016 11:59:59 PM     52.6     115.9631  22 22.65           TRUE
## 4 5/3/2016 11:59:59 PM     52.6     115.9631  NA 22.65           TRUE
## 5 5/2/2016 11:59:59 PM     52.6     115.9631  22 22.65           TRUE
## 6 5/3/2016 11:59:59 PM     52.6     115.9631  NA 22.65           TRUE
##          LogId
## 1 1.462234e+12
## 2 1.462320e+12
## 3 1.462234e+12
## 4 1.462320e+12
## 5 1.462234e+12
## 6 1.462320e+12

Summary of Analysis

When looking at the FitBit datasets, I noticed that a majority of the tracked data points were also represented by Bellabeat products. The data that the Bellabeat app did not provide were the calories burned, activity intensities at various time intervals, and weight. It is not clear from the available data whether Bellabeat is missing out on potential customers from the lack of these features.

An interesting observation that came up is the decline of user’s calories burned from the beginning of the tracked dates until the end of the dates.
an image caption Source: Tracked data shows decline in calories burned over the course of the month.
Looking at this plotted data, we can see how the overall daily calories burned decreases as the month goes on. This could be the result of diminishing interest among participants as data recording went on, or this could be from a lack of a scheduled reminders to exercise from their fitness tracking app.

Recommendation 1: Bellabeat should begin tracking calories burned and focus on consistent activity reminders in order to encourage user participation in healthy activities.

According to a 2015 study on workout intensity (2015), high-intensity exercise directly relates to more calories burned, as opposed to low-intensity exercise, and of course, sedentary lifestyles. This conclusion can be seen in the FitBit 30-day fitness data by comparing exercise intensity with calories burned. an image caption Source: Tracked data shows increase in calories burned over the course of the month compared to high-intensity exercise.

Recommendation 2: Bellabeat should encourage and/or recommend High-Intensity exercise for users seeking to eliminate more calories in their daily activities. Adding the capability for the Bellabeat app to track exercise intensity could be beneficial to many in their user base.

Review Based On Analysis

Based on the trends identified in my analysis, I recommend the following:

Wrapup

Today we discussed different methods for the Bellabeat marketing team to enhance the Bellabeat app for their users.

Thank you for time and I hope this analysis is beneficial to the improvement of the Bellabeat app!

References

Falcone PH, Tai CY, Carson LR, Joy JM, Mosman MM, McCann TR, Crona Kp, Kim MP, Moon JR. 2015. “Caloric Expenditure of Aerobic, Resistance, or Combined High-Intensity Interval Training Using a Hydraulic Resistance System in Healthy Men.” Denver, Colorado: MusclePharm Sports Science Institute. https://doi.org/10.1519/JSC.0000000000000661.