Data Projects
Showcasing my work in data analysis and visualization.
Interactive Music Dashboard (Updates Daily)
Insights:
dominant genres for each region
song reach
artist chart dominance
time trends
Drilldown filters
Sort by artist, title, release date, or number of charts by clicking on the header.
Fully automated pipeline:
Extraction: A Python script collects the Apple Top 100 data from their website
Transform and Load: The same script cleans and transforms the data before loading them into a local database
The data is further transformed by SQL views in the database
Another Python script queries the database daily and loads the output to a google sheet via GCP (Google Cloud Platform) API. (Necessary for automatic data refresh in the free Tableau Public)
Visualization: The Tableau dashboard is linked to the google sheet
Insights
average rankings across platforms
differences in user preferences between platforms
dominant categories
trend information for categories and individual podcasts
Quickly find the dominant categories or find a podcast to listen to in any category
Drilldown filters
Sort by rank, duration, or title
Fully automated pipeline:
Extraction: A Python script collects the Apple Top 100 podcasts data while another scrapes Spotify
Transform and Load: The same scripts clean and transform the data before loading them into a local database
The data is further transformed by SQL views in the database
Another Python script queries the database daily and loads the output to a google sheet via GCP (Google Cloud Platform) API. (Necessary for automatic data refresh in the free Tableau Public)
Visualization: The Tableau dashboard is linked to the google sheet
Interactive Podcast Dashboard (Updates Daily)
National Syndromic Surveillance Program Dashboard
Insights:
Mental health related ER visits increased massively during the COVID lockdown time period but has leveled off since.
Teens are the only group that has visited the ER more frequently for depressive disorders than anxiety disorders
Teenage admissions also seem to be higher during months when school is in than during breaks
Data source: