Transforming data into automation, intelligence, and business value
I'm a Data Engineer & Analytics Engineer completing a Data Engineering Apprenticeship at Nashville Software School.
I build end-to-end data pipelines, design medallion architectures, and turn raw data into decisions that drive business outcomes.
- Dual-track training in Data Analytics and Data Engineering β Python, SQL, dbt, Snowflake, Airflow, AWS, Databricks
- Passionate about pipeline architecture, data modeling, and BI storytelling
- Background in e-commerce operations & digital marketing β I understand the business side of data
- Based in Tennessee, actively targeting Seattle / Pacific Northwest opportunities
Languages: Python | SQL
Data Platforms: Snowflake | Databricks | AWS S3 | PostgreSQL | DuckDB
Orchestration & Pipelines: Apache Airflow | dbt (Core) | Docker | GitHub Actions
BI & Visualization: Power BI (Advanced DAX) | Tableau | Power Query
Libraries: Polars | Pandas | NumPy | Scikit-learn | Matplotlib | Seaborn | Folium
Other: MinIO | FastAPI | AWS CLI | Linux
| Project | Description | Tools |
|---|---|---|
| NPPES Healthcare Provider Analytics | Production-grade ELT pipeline processing 8.85M records (9.9 GB) through a medallion architecture. Analyzes geographic distribution, specialty mix, and provider density across US counties. Orchestrated with Airflow, modeled in dbt, served from Snowflake. | Python, Polars, DuckDB, dbt, Snowflake, Airflow, AWS S3 |
| Brazilian E-Commerce Analytics | Multi-page Power BI dashboard analyzing 99K+ orders across 27 states. Features dynamic DAX narratives, min-max normalized radar scoring, and RANKX-based state performance rankings. | Power BI, DAX, Power Query |
| Global Online Retail Strategic Intelligence | End-to-end BI solution transforming 541K+ raw records into executive insights. Automated Python ETL feeding a multi-page Power BI report with advanced DAX measures and $8.91M revenue analysis. | Python, Pandas, Power BI, DAX |
| From Calls to Crimes: Nashville Public Safety | Capstone analytics project joining 911 call data with crime records to surface spatial and temporal public safety trends across Nashville neighborhoods. | Python, Pandas, Folium, Power BI |
| Telco Customer Churn Prediction | End-to-end ML pipeline predicting customer churn with classification models. Includes feature engineering, model evaluation, and business-framed findings. | Python, Scikit-learn, Pandas |
| COVID-19 Public Health Analytics | End-to-end ELT pipeline ingesting CDC and Census data through Bronze β Staging β Gold layers. State-level mortality analysis, demographic vulnerability scoring, and interactive Streamlit dashboard with choropleth maps. | Python, Snowflake, dbt, Airflow, Streamlit, Plotly |
- Medallion architecture pipelines on Snowflake + dbt (staging β intermediate β marts, incremental models, snapshots)
- Databricks / Apache Spark β distributed processing and large-scale transformation
- Event-driven data systems β S3 listeners and trigger-based pipeline patterns
- Preparing for DP-700 Microsoft Fabric Data Engineer certification
"Turning Data into Decisions, and Decisions into Impact."
Open to connecting with data professionals and hiring managers β let's talk pipelines, architecture, and business impact.


