Featured Project
Bellabeat Case Study
For Google Data Analytics Certification
Bellabeat case study is the final project in the Google Data Analytics Professional Certification course.


Introduction
How can a Wellness Technology Company Play it Smart?
About the Company: Bellabeat
Bellabeat is a wellness brand company, based in San Francisco, California, that has created a number of smart wearable products for women. Their ecosystem of smart wearables devices that monitor biometric and lifestyle data that can be used as a guide to help women know more about their own bodies and make healthier choices.
Products:
- Bellabeat app- provide users with health data related to their activity
- Leaf Urban – a wellness tracker that can be worn like a bracelet, necklace, or clip
- Time – smart technology tracker, activity, sleep,stress, etc.
- Spring – water bottle that tracks daily water intake
- Bellabeat membership – 24/7 subscription-based progam for users
Scenario
anBellabeat is looking for undiscovered opportunities in the global smart device market. It’s co-founder Urška Sršen would like for me, a new Bellabeat junior data analyst employe, to analyze a public dataset from FitBit Fitness Tracker Data – CC0:Public Domain (made available through [Mobius]). My job is to conduct a market analysis of current trends and use the insights discovered within the FitBit data to help guide the marketing strategy for one of Bellabeat products.
I have selected to use my analysis for Leaf Urban. Leaf Urban is a piece of jewelry worn on the wrist or as a necklace. It is one of Bellabeat’s most popular wellness tracker. Leaf Urban doubles as a wellness and lifestyle (mediation) tracker that focuses on a users wellness goals. More specifically sleeping habits, daily activities, menstrual cycle, stress, sedentary behaviors and more.

ACT
Business task
Analyze existing customer public dataset from FitBit Tracker Fitness to identify potential new growth opportunities and present recommendations for Bellabeat’s marketing strategy for Leaf Urban (bracelet or necklace).
Questions for the analysis
- What are some of the trends discovered in smart device usage?
- How could these trends be applied to Bellabeat customers?
- How could these trends help influence Bellabeat’s marketing strategy?
Key Stakeholders
Urška Sršen: Bellabeat’s cofounder and Chief Creative Officer
Sando Mur: Mathematician and Bellabeat’s cofounder; key member of the Bellabeat executive team
Bellabeat marketing analytics team: A team of data analysts responsible for collecting, analyzing, and reporting data that helps guide Bellabeat’s marketing strategy.
PREPARE
Prepare and Load packages
The dataset contains personal fitness tracking data from 30 **fitbit** users. It is a Public data set from FitBit Fitness Tracker Data – CC0:Public Domain, dataset made available through [Mobius], that you can find on Kaggle. It includes 18 data sets information about daily activity, steps, and heart rate that can be used to explore user’s habits.
About the Dataset
This dataset for this case study was generated by respondents to a distributed survey via Amazon Mechanical Turk between 03.12.2016-05.12.2016. Thirty eligible Fitbit users consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, and sleep monitoring. Individual reports can be parsed by export session ID (column A) or timestamp (column B). Variation between output represents use of different types of Fitbit trackers and individual tracking behaviors / preferences.
The data set can be downloaded from Kaggle website.
Hypothesis: My reason for selecting to analyze only seven of the 18 datasets for this study is because I am interested in discovering potential opportunities between a user daily activities, including steps and sleep. And how wearing Bellabeat Leaf Urban can help users make better lifestyles choices (food, meditation, decrease stress). I will be looking at the public data from FitBit to find trends of users how who are getting little sleep and who have a sedentary life style can affect their daily activity output. Bellebeat can then use this information and offer a nutritional program that can help improve their users sleep as well as custom mediation exercises.
Dataset tables were renamed
- dailyActivity_merged.csv
- dailyCalories_merged.csv
- dailyIntensities_merged.csv
- dailySteps_merged.csv
- hourlySteps_merged.csv
- minuteSleep_merged.csv
- sleepDay_merged.csv
Dataset Limitations
The limitations for this public dataset are:
- old data (from 2016)
- small sample size
- the data is most likely not representative of all eligible FitBit users with only 30 participants
- missing user characteristics (gender, age, health, lifestyle, location, employment status, etc.)
- missing useful data from users as many did not record any sleep data (or removed their fitbit device when sleeping)
- users were not asked to be aware of when they are using their Fitbit and ot use it for the entire 30 day testing period.
Cleaning the data: What I am looking for
Google Sheets and RStudio (desktop) was used for this case study
Because of the small data sample size I used both Google sheets and R Studio to complete this analysis and also for data visualization.
My Data Cleaning Approach
- identifying variable types
- using the select and filter function to find missing data
- find and eal with missing data and also duplicates
Dates and times needed to be reformatted from ‘text’ to ‘date’
Data formats were inconsistent improved
Duplicate data removed
Some data values were reduced to three decimals places.
RStudio – Desktop
Installing and loading of packages:
- install.packages(“tidyverse”)
- install.packages(“skimr”)
- install.packages(“janitor”)
- install.packages(“here”)
- install.packages(“pillar”)
- library(tidyverse)
- library(skimr)
- library(janitor)
- library(here)
- library(pillar)
I first viewed the csv files in Google sheets to make formatting changes to the dates and time. I imported the files into RStudio (desktop) and created the data frames.
Installing and loading of packages:
Because I am working with RStudio desktop, I imported the csv files directly from my computer directory by first ‘setting the working directory’.
[Workspace loaded from ~/.RData]

PROCESS
During this process I used the head(), colnames(), glimpse() and summary() functions to view more details of the frames and find some commonalities that could be used for further exploration.
head()
colnames()

ANALYZE
loeruams aonaf afaonafd a
SHARE
loeruams aonaf afaonafd a
ACT
loeruams aonaf afaonafd a
“Fusce imperdiet imperdiet massa eu posuere risus ullamcorper vitae. Praesent viverra odio quis gravida nunc ut rutrum erat congue dui vitae.”
Product Review
WordPress plugin
Blog Article
Product Release
SEO Strategy
Online STore
Info Architecture
Businesss Website
Creative Writing
Marketing Campaign
Content Strategy