Data Engineering Projects & Analytics Solutions

End-to-end data programs with measurable business impact across retail, e-commerce, and media industries.

Portfolio

Client missions

Enterprise engagements

Product Analytics with GA4

GA4 data modeling across e-commerce and financial services clients using dbt and Dataform. Medallion architecture with specialized marts for user behavior and product interactions. FinOps optimization for cost and performance. Deliverables: Data Studio (formally Looker Studio) dashboards.

GA4dbtDataformBigQueryKestraFinOpsData Studio (formally Looker Studio)

Challenge

Raw GA4 exports were inconsistent and difficult to reuse across teams and industries.

Solution

Medallion architecture (bronze, silver, gold) with dbt and Dataform. Specialized marts for user behavior, product interactions, and simulations. FinOps strategies for cost optimization. Data Studio (formally Looker Studio) dashboards for analytics delivery.

Results

Unified GA4 models enabling product insights and reliable KPI tracking. Cross-platform expertise in dbt and Dataform. Data Studio (formally Looker Studio) dashboards delivered to business teams. Saves 6 hours per week in analytics delivery.

2
Platform implementations
+6h
Saved per week

GA4 FinOps & Performance Optimization

Refactoring GA4 workloads in BigQuery: query optimization, partitioning, clustering, and governance implementation.

BigQueryGA4SQLFinOpsKestra

Challenge

GA4 queries were inefficient with high costs and inconsistent patterns requiring refactoring.

Solution

Refactored queries with partitioning, clustering, and query optimization. Implemented cost monitoring and governance guidelines.

Results

Refactored GA4 workloads resulting in lower costs and improved query performance for analytics teams.

-30%
Average query cost
2x
Query speed

Marketing Analytics Platform Migration

Migration from Dataroma to Modern Data Stack. Technical lead for multi-source integration (Catchr, Couchdrop) and DSP data extraction. dbt standards implementation, platform documentation, and client team training.

dbtKestraGCPPythonCouchdropCatchr

Challenge

Legacy Dataroma infrastructure was brittle and required migration to a modern platform.

Solution

Migrated to Modern Data Stack. Technical lead for multi-source integration (Catchr, Couchdrop) and DSP extraction via Couchdrop. Built dbt standards, documented platform, and delivered client training.

Results

Modern, scalable marketing analytics platform with standardized dbt practices and unified multi-source integration.

2x
Delivery speed
100%
Governed models

Multi-Store Retail Analytics & Monitoring

Refactoring and optimization of multi-store dashboard platform. Dashboard architecture redesign, Row-Level Security (RLS) implementation, consolidated global dashboard, and automated Slack alerting.

dbtSlack APIData Studio (formally Looker Studio)GCPGA4

Challenge

Multi-store dashboards needed secure access, architecture refactoring, and monitoring for missing data.

Solution

Dashboard architecture redesign with Row-Level Security (RLS) for multi-store access. Built consolidated global dashboard and automated Slack alerting for missing store data. Performance optimization.

Results

Refactored dashboard platform with secure multi-store access, consolidated views, and automated monitoring.

RLS
Secure access
24/7
Monitoring coverage

Modern Data Foundation & Orchestration

Medallion architecture with Kestra orchestration and Airbyte ingestion. Modular dbt models with marts layer. Automated retry mechanisms, error handling, and data quality framework. Integrated with Funnel for marketing analytics. Delivered across multiple clients covering 3 industries.

KestraAirbytedbtBigQueryFunnelShopify

Challenge

Multiple disparate data sources requiring consolidation. Time-consuming manual reporting and lack of confidence in numbers due to fragmented sources.

Solution

Medallion architecture (bronze, silver, gold) with Kestra orchestration. Airbyte for seamless source integration. Modular dbt models with marts layer. Automated retries, error handling, and data quality testing. Funnel integration for marketing analytics.

Results

More than 5 hours saved per week for reporting through automation. 100% confidence restored in numbers. Comprehensive monitoring of net sales, bundle performance, upsell rates, and customer acquisition costs.

+5h
Saved per week
100%
Data confidence

Data Monitoring - Slack-Based Analytics Alert System

Slack monitoring system for dbt pipelines and business KPIs. Automated alerts for failures and errors. Scheduled KPI reporting with thresholds. Interactive Slack commands for data access. Delivered across multiple clients covering 3 industries.

Slack APIKestraBigQueryPythondbt

Challenge

Data quality issues and dbt failures detected too late. Business teams lacked real-time KPI visibility.

Solution

Slack API integration with dbt and BigQuery. Automated alerts for test failures and pipeline errors. Scheduled business KPI reporting with thresholds. Interactive Slack commands for data access.

Results

Reduced time to detect failures through instant Slack notifications. Scheduled KPI delivery in team channels. Improved collaboration and faster incident response.

Real-time
Alert system
3
Industries covered

Personal R&D

Side projects

Berlioz — AI Nutrition Assistant for Cats

Personal project: RAG chatbot helping cat owners evaluate pet food for renal and urinary conditions. Semantic search over a veterinary knowledge base with URL analysis, OCR label reading, and an AI expert chat.

Vanilla JSNode.jsVercelOpenAIUpstash VectorRAGGroqCursor

Challenge

Cat owners managing renal disease or urinary conditions had no reliable tool to evaluate pet food labels against evidence-based veterinary thresholds (phosphorus, proteins, Ca/P ratio, acidifiers).

Solution

Designed and shipped a RAG-powered web app using Cursor with AI-assisted development — handling both frontend (Vanilla JS, responsive UI, dark mode) and backend (Node.js serverless API, Upstash Vector, OpenAI embeddings, Groq LLaMA). Architecture and technical decisions driven by data engineering principles: single source of truth, chunking strategy, cost optimization, and retrieval quality.

Results

Functional end-to-end product: URL-based product analysis, OCR label reading, semantic RAG search over a curated veterinary knowledge base, and an AI expert chat grounded in evidence-based thresholds. Deployed on Vercel, infra cost < $0.15/month.

~$0
Infrastructure cost / month
< 1.1s
End-to-end response time
50
Indexed veterinary chunks
1
Knowledge base, 0 hallucination risk

Automated Tech Watch System (Personal Project)

Personal project: automated content aggregation and analysis from email and RSS sources to build a continuous intelligence pipeline.

PythondbtdltBigQueryCloud RunTerraform

Challenge

Manual tech watch was time-consuming and inconsistent across sources and formats.

Solution

Python ingestion for email + RSS, dbt transformations, dlt loading to BigQuery, and serverless deployment.

Impact

Automated knowledge base creation to centralize and organize technical monitoring. Saves 5 hours per week in research time while ensuring consistent coverage across all sources.

+5h
Saved per week
1
Knowledge base created

Ready to transform your data platform?

Let's discuss how I can help you build scalable, reliable data solutions for your business.

Get in touch