Data Projects

Showcasing my work in data analysis and visualization.

Interactive Music Dashboard (Updates Daily)
  • Insights:

    • dominant genres for each region

    • song reach

    • artist chart dominance

    • time trends

  • Drilldown filters

  • Sort by artist, title, release date, or number of charts by clicking on the header.

  • Fully automated pipeline:

    • Extraction: A Python script collects the Apple Top 100 data from their website

    • Transform and Load: The same script cleans and transforms the data before loading them into a local database

      • The data is further transformed by SQL views in the database

      • Another Python script queries the database daily and loads the output to a google sheet via GCP (Google Cloud Platform) API. (Necessary for automatic data refresh in the free Tableau Public)

    • Visualization: The Tableau dashboard is linked to the google sheet

  • Insights

    • average rankings across platforms

    • differences in user preferences between platforms

    • dominant categories

    • trend information for categories and individual podcasts

  • Quickly find the dominant categories or find a podcast to listen to in any category

  • Drilldown filters

  • Sort by rank, duration, or title

  • Fully automated pipeline:

    • Extraction: A Python script collects the Apple Top 100 podcasts data while another scrapes Spotify

    • Transform and Load: The same scripts clean and transform the data before loading them into a local database

      • The data is further transformed by SQL views in the database

      • Another Python script queries the database daily and loads the output to a google sheet via GCP (Google Cloud Platform) API. (Necessary for automatic data refresh in the free Tableau Public)

    • Visualization: The Tableau dashboard is linked to the google sheet

Interactive Podcast Dashboard (Updates Daily)
National Syndromic Surveillance Program Dashboard