View on GitHub

Cameron S. - Data Engineer

Portfolio showcasing my data and programming projects since 2022.

👋 Welcome to My Github Portfolio

Contact Me

📌 Customer Purchase Behavior Analysis

An in-depth behavior analysis using Python to identify key insights on customer behavior, product performance, and sales trends.

-> View Project


TABLE OF CONTENTS

Resources for Aspiring Analysts

📝 Resource 🔗 Links
🎨 Dashboard Color Guide View Guide
📊 Dashboard Color App Try the App
🖥️ Free Datasets List View the list
📈 Graph DOs and DONTs View Article
🔤 REGEXP SQL Basics Part 1 - Part 2
📝 My Data Blog View Blog

Find me on LinkedIn LinkedIn

(back to top)


Python Python

☑️ Data cleaning and Exploring using pandas
☑️ Experienced with pandas, matplotlib, plotly for graphing and data visualization
☑️ Running and testing code chunks in Jupyter, Kaggle, VS code Notebooks
☑️ Variety of Python projects, including web scraping, data analysis, machine learning, and web apps
☑️ Coursera ‘Programming for everybody’ Certificate Certainly! Here is the table with the “Libraries” column removed:

Project Link Technology Project Description
✂️ background-removal-tool Logo A simple tool that removes backgrounds from images using the rembg library. Built with Flask and deployed on Docker.
💯 100 Days of Python UdemyLogo Master the Python programming language by building 100 unique projects over 100 days.
🐍 Classic Snake Game Logo A remake of the classic Snake game made in Python.
🛒 Customer Purchase Behavior Analysis Logo An in-depth behavior analysis to identify key insights on customer behavior, product performance, and sales trends.
👮 SLC Police Cases 2011-2014 Logo Analyzed police case data to identify and understand crime trends over time and across different locations in Salt Lake City.
🤑 Retail Profit Predictions Logo Performed data cleaning, processed the data for prediction models, and evaluated predictions using both Prophet and Linear Regression.
🌍 Data Extraction using Azure Map API Logo Using the Azure Map API to gather School information in a particular area for our client.
🐒 Survey Monkey Data Processing Logo Process and clean the data from a large dataset of survey responses from SurveyMonkey.
💵 E-commerce Sales Analysis Logo Generated a random transaction dataset and used Python in a Jupyter notebook to explore the data and address key business questions.
📈 Principal Component Analysis (PCA) Logo Using Streamlit, created a simple app that can display data from a CSV file and allow the user to filter different data points to see how they correlate.
🍽️ What’s 4 Dinner? LogoLogo A web app designed to simplify both meal planning and shopping. Programmed with Python, JavaScript, and uses a SQLite local database to track all information.
😐 Review Sentiment Analysis Logo Scraped review data from two competing websites and analyzed the overall sentiment of each review using nltk VADER and huggingface RoBERTa models.
💸 Price Comparison using Selenium Logo Using Selenium and Python, I scraped product prices from two websites based on user input. The prices are then averaged and compared to find the best deal.
🕸️ Python Job Web Scraping Logo Scraped Python job data from pythonjobs.github.io using BeautifulSoup and requests_html, with pagination to gather more job listings.
🎢 Kaggle Coasters Logo Explored and analyzed a dataset of roller coaster information from the early 1900s.
🔢 Serial Number Automation Logo Created a GUI app that processes multiple Excel files with serial numbers of various formats, exporting a formatted Excel file and a PDF.
😴 Rob Sleep Predictions Logo Kaggle competition to predict sleep patterns using a sample dataset and machine learning models such as XGBoost and Optuna.
🤰 Maternity Leave Infographic LogoLogo Created an infographic to provide data on maternity leave trends for business use.
📊 Data Visualization with Pandas and Matplotlib Logo Created a Jupyter notebook with basic visualizations using pandas and matplotlib.
🖩 Aggregate Functions in Pandas Logo Created a Jupyter notebook demonstrating basic aggregate functions in pandas.
🧹 Data Cleaning in Pandas Logo Demonstrated how to clean data using Python and pandas in Jupyter notebooks.
🌪️ Basic Filtering with Pandas Logo A notebook demonstrating various filtering techniques in pandas.
🐱 Merge, Join, and Concat Functions in Pandas Logo Created a Jupyter notebook showcasing merge, join, and concat functions in pandas.
📁 Pandas File Import Basics Logo A detailed guide explaining basic file import methods for various file types (txt, csv, json, xlsx).

(back to top)


SQL SQL

☑️ Data cleaning and data exploring
☑️ Experienced with Syntax in MS SQL, PostgreSQL, MySQL, and SQLite
☑️ CTEs, CASE WHEN, Advanced SQL subqueries
☑️ Google data analytics professional certificate

[!NOTE]
☑️ 📺 Advanced SQL skills on display on my YouTube Channel

Project Link Area of Study Project Description
🍜 8 Week SQL Challenge # 1 Creating Tables, Joins, CTEs, and Date expressions Explored complex queries to clean data, compute customer figures, and organize data in unusual ways.
🍕 8 Week SQL Challenge # 2 Creating and updating Tables, Joins, Subqueries, CTEs, and complex aggregations Used SQL to explore Pizza data and answer questions about metrics, pricing, and customer trends
🐒 Data Lemur SQL Questions CTEs, CASE WHEN, subqueries, advanced JOINs, SQL Optimization Easy, Medium, and Hard SQL questions with multiple joins, subqueries, aggregations, as well as advanced SQL functions and optimization techniques available from Data Lemur
🎓 Khan Academy Advanced SQL TABLE INSERT and DELETE, GROUP BY, HAVING, CASE WHEN, multiple JOINs SQL Queries begin with basic concepts and advance to more complex topics as we progress.
⚡ SQLbolt - SQL lessons Basics of SQL, SELECT, FROM, WHERE, ORDER BY, JOINs, Aggregations, INSERT, DROP, DELETE This collection of SQL queries was learned from SQLbolt, a website that provides interactive tutorials to help users learn SQL (Structured Query Language) and improve their skills.

(back to top)


Excel Power BI

☑️ Experience in building live dashboards
☑️ Knowledge in using various dataflow and import methods to showcase live data
☑️ Developed reports for company management showcasing sales performance over the last four years.
☑️ Completed the DataCamp Intro to Power BI course

Here’s your table with the “Area of Study” column removed:

Project Link Project Description Dashboard Link
🛒 Retail Price Comparison and review analysis Using Selenium and python, I scraped products and their prices to compare who has better pricing. Then I analyzed the reviews and compared rating to sentiment using NLP. Dashboard Link
Logo LinkedIn Engagement Analysis I wanted to analyze my LinkedIn growth and engagement since I started my account in February, and started posting March 27th, 2023. Dashboard Link
🚧 Adventure Works Dashboard Created a dashboard for a fictional manager using the Adventure Works sample database, which involved restoring the database, identifying data needs, cleaning and transforming the data, and importing it into Power BI. Dashboard Link
📑 Data Professional Survey Alex the Analyst conducted a survey among 630 participants in the data technology field to gather insights. Dashboard Link

(back to top)


Tableau Tableau

☑️ Experience in building live dashboards
☑️ Knowledge in using various dataflow and import methods to showcase live data
☑️ Developed reports for company management showcasing sales performance over the last four years.
☑️ Create custom measures and integrate with graphics

Here’s your table with the “Area of Study” column removed:

Project Link Project Description Dashboard Link
👮‍♀️ SLC Crime Reports (2010-2014) Analyzed case data to identify and understand crime trends over time and across different locations in Salt Lake City. Dashboard Link
👩 Crimes Against Women in India (2001-2021) Cleaned and processed crime data, fixed missing data with linear regression and mean estimates. Then visualized data with Matplotlib and a Tableau dashboard. Dashboard Link
🏪 2020 Retail Store Sales Built out multiple sheets to display on a single visual, and created an interactive dashboard. Dashboard Link
Logo LinkedIn Engagement Analysis I wanted to analyze my LinkedIn growth and engagement since I started my account in February, and started posting March 27th, 2023. Dashboard Link
🪐 Intergalactic Age Calculator A fun project made in Tableau to see what your age (in years) would be anywhere in the galaxy! Dashboard Link
Logo Netflix Movies and TV Shows Dashboard to show the most popular movies and TV shows on Netflix up to the year 2019. Dashboard Link
🌎 RWFD Employee Demographics Demographics Dashboard built using the HR data from the RWFD Community Project. Dashboard Link

(back to top)


Excel Excel

☑️ Experience in cleaning data both for projects and real world data sets
☑️ Knowledge in DAX and using custom formulas to alter data
☑️ Using VLOOKUP, HLOOKUP, and XLOOKUP to quickly alter and organize data
☑️ Understanding of Pivot Tables and their importance to displaying insights
☑️ Completed the Linkedin Excel assessment and Excel to SQL certification

Project Link Area of Study Project Description
🚲 Bike Sales Dashboard Pivot Tables, Pivot Charts, DAX Formulas, Data Buckets Using a dataset from Alex the Analyst, Cleaned the data, altered columns to get data I need, used pivot tables and charts to build out a dashboard.
🧈 Customer Churn Data Cleaning, Pivot Charts, Find and Replace, Insights and Business Solutions Used a Dataset about customer churn to explore possible reasons customers cancelled service. Came up with possible solutions that need to be tested. But the Data points to several changes that could be made.
🏢 Employee Turnover Data Cleaning, Data Transform, Vlookup, Pivot Charts Explored a dataset containing employee info and Satisfaction survey results. Let’s see what Insights we can pull from the data.
😠 SLC civilian complaints Basic visualization, Data cleaning, comparison to external sources I found that SLC has a website and app where residents can report complaints to the city. Explored the data for any interesting findings since I live so close to SLC the data seemed relevant
📊 Sales Performance Review Data cleanup, Xlookup, pulling from multiple data sources A 4 year Sales Breakdown across multiple locations

(back to top)


Javascript Javascript

☑️ learning how to call sripts within HTML
☑️ learn coding structure and importance of personal documentation
☑️ Built out a few simple personal projects and websites
☑️ Using APIs and RSS feeds to call live information

Project Link Platform Project Description
🧪 Test Projects Logo A place for several Web apps and JavaScript projects. Including #Javascript30 project files.
🎙️ Anchor Podcast Episode List Logo creates an Episode list from an Anchor RSS feed for a Podcast
📻 Anchor Podcast Latest Episode Logo Similar to the episode list, but Only gets the info for the most recent Episode from the Anchor RSS feed
🔍 Carrd.co Search Bar Logo creates a search bar to search through the Podcast episode list
📊 Countapi Page View Counter Logo creates a Page View counter using the Countapi on https://api.countapi.xyz

(back to top)


R R

☑️ Data cleaning and visualization
☑️ ggplot graphing and data visualization
☑️ Built out a few simple automated scripts through Self Learning

Project Link Libraries Project Description
📱 Comparing Phone Prices in R ggplot, geom_point Explored and cleaned a cell phone price dataset found on Kaggle.
Logo Basics ggplot, dplyr Made a full breakdown detailing the basic functions and uses of the R* programming language.

(back to top)