Skip to content
View mfonsecaoliveira's full-sized avatar

Block or report mfonsecaoliveira

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
mfonsecaoliveira/README.md

๐Ÿ‘‹ Hi, I'm Marcelo Fonseca

10+ Years Operations & Analytics | Power BI ยท SQL ยท Python | Ex-Team Supervisor โ†’ Data Analyst

๐Ÿ“ Dublin, Ireland | ๐Ÿ‡ง๐Ÿ‡ท Brazilian | ๐Ÿ”“ Open to Work


๐ŸŽฏ About Me

I'm not a fresh graduate learning to code.
I'm a seasoned operations professional who's been living in data for over a decade.

I've supervised collections teams, managed project analytics, presented to C-level executives, and built Power BI dashboards that drove real business decisions โ€” not just portfolio pieces.

What makes me different:

  • โœ… 10+ years operational experience across collections, sales, project management
  • โœ… Led teams that lived and died by the KPIs I tracked
  • โœ… Built production dashboards used by real businesses (not just classroom projects)
  • โœ… Improved process efficiency by 70% through structured data analysis
  • โœ… Published scientific researcher in Environmental Sciences (Unifesp)
  • โœ… Presented insights to C-level โ€” I know how to translate data into decisions

Now I'm formalizing what I've always done intuitively โ€” turning messy operational data into clear, actionable insights.


๐Ÿ’ป Tech Stack & Expertise

Core Technologies

Python SQL Power BI Excel

Cloud & Data Engineering

Google Cloud BigQuery Apache Kafka Dataflow

Python Libraries & Tools

Pandas NumPy scikit-learn Apache Beam dbt


๐Ÿ“Š Code Portfolio & Experience

Technology Experience Real-World Applications
SQL 1+ years
500+ lines in repos
โ€ข BigQuery schemas & star models
โ€ข Fraud detection queries (P99 <1s)
โ€ข Complex JOINs & window functions
โ€ข Performance optimization
Python 1+ years
400+ lines in repos
โ€ข ETL pipelines (Apache Beam)
โ€ข Data validation & quality checks
โ€ข PII tokenization (GDPR compliant)
โ€ข Pandas/NumPy data analysis
Power BI 4+ years
Production dashboards
โ€ข Built dashboards from scratch
โ€ข Executive reporting for C-level
โ€ข 70% process efficiency improvement
โ€ข DAX, Power Query, data modeling
Excel 10+ years
Advanced level
โ€ข Complex formulas & pivot tables
โ€ข VBA macros for automation
โ€ข Financial modeling & forecasting
โ€ข Operational reporting templates
GCP Academic + Projects
Production patterns
โ€ข BigQuery lakehouse architecture
โ€ข Pub/Sub, Dataflow, Cloud Storage
โ€ข Vertex AI ML pipelines
โ€ข Multi-zone medallion design

๐Ÿ’ผ Beyond GitHub

My professional experience extends far beyond public repositories:

  • SQL: 1+ years writing complex queries, stored procedures, database optimization
  • Power BI: 4+ years building production dashboards for executive decision-making
  • Python: 1+ years for data automation, ETL processes, and analysis (Pandas, NumPy)
  • Excel: 10+ years with advanced formulas, pivot tables, and VBA macros
  • Team Leadership: Supervised operations teams, presented to C-level stakeholders

GitHub repos show recent projects. Enterprise work was in private repositories.


๐Ÿ’ผ Professional Highlights

Operations & Guest Services Coordinator | Mc enaney group (Dublin)

May 2023 - Present

  • Designed operational reporting templates in Excel, standardizing performance tracking
  • Analyzed guest flow & occupancy patterns, identifying inefficiencies
  • Translated raw operational data into actionable insights for management

Project Analyst | Full Time Soluรงรตes (Sรฃo Paulo)

July 2022 - April 2023

  • Built Power BI dashboards from scratch for executive reporting
  • Presented insights directly to leadership, bridging data and strategy
  • Drove 70% improvement in post-implementation efficiency via data analysis

Credit Control & Commercial Supervisor | Full Time Soluรงรตes

February 2018 - June 2022

  • Monitored operational & financial KPIs across collections and sales teams
  • Developed management reports improving decision-making efficiency
  • Led customer segmentation analysis to enhance targeting strategies

Published Researcher | Unifesp (Environmental Sciences)

2012 - 2017

  • Published scientific research project analyzing environmental data
  • Applied statistical methods and geospatial analysis (ArcGIS, AutoCAD)
  • Foundation in evidence-based decision making

๐Ÿš€ Featured Projects

Production-Grade Medallion Lakehouse for Cross-Border Fintech

Designed a multi-zone data platform for a fictional Ireland-Brazil fintech processing 55,000+ transactions/month:

  • โšก 99.9% SLA compliance with P99 <1s fraud detection latency
  • ๐Ÿ”’ GDPR & PCI-DSS compliant with column-level encryption
  • ๐Ÿ’ฐ 60% TCO reduction vs legacy ERP systems
  • ๐Ÿ—๏ธ Five-zone medallion architecture: Pub/Sub โ†’ GCS โ†’ Dataflow โ†’ BigQuery โ†’ Looker

Tech Stack: Google Cloud Platform, BigQuery, Apache Kafka, Dataflow, Vertex AI, dbt, Great Expectations

Key Innovations:

  • Hybrid ingestion (streaming + batch + micro-batch)
  • Kimball star schema with SCD Type 2 customer modeling
  • Real-time fraud detection via BigQuery ML (AUC >0.85)
  • Federated governance with DataHub lineage tracking

Code Highlights:

  • ๐Ÿ“„ 500+ lines SQL: Star schema design, fraud detection queries
  • ๐Ÿ 400+ lines Python: ETL pipelines, data validation, PII tokenization
  • ๐Ÿ“Š Complete documentation: Technical specs, architecture diagrams

๐Ÿ“„ Read Full Technical Documentation


An End-to-End Power BI Investigation That Builds an Argument, Not Just Charts

A narrative-driven diagnosis of why a call center closed 2020 with customers in distress โ€” analyzing 13,055 interactions across 4 channels, 6 contact reasons, and the full United States:

  • ๐Ÿšจ NPS โˆ’69 with 51% negative sentiment surfaced as a systemic, not local, problem
  • ๐Ÿ” Five-hypothesis investigation (channel, reason, geography, seasonality) โ€” each ruled out with evidence
  • ๐Ÿงฎ Null-safe DAX NPS measure โ€” corrected a โˆ’78 โ†’ โˆ’69 distortion caused by blank surveys counted as detractors
  • ๐ŸŽฏ Prioritized recommendations: fix efficiency (61% of calls >20 min) + deflect 77% money-related demand

Tech Stack: Power BI, DAX, Power Query, dimensional (star schema) modeling

Key Highlights:

  • Drill-through detail pages by U.S. state and custom sidebar navigation
  • Honest visualization: zero-anchored axes and CSAT shown on its true 0โ€“7 scale
  • Data storytelling structured as problem โ†’ eliminated suspects โ†’ root cause โ†’ action
Cover Overview ("The Alarm")
Cover Overview
Root Cause Recommendations
Root Cause Action

๐Ÿ“„ Explore the Full Report


Evidence-Based Visual Analytics on 8,494 Retail Products โ€” Graded 88/100

A visual analytics study testing whether the beauty industry's price-equals-quality promise holds up, across a real Sephora catalogue of 8,494 products from 304 brands:

  • ๐ŸŽฏ No priceโ€“quality link: a $15 moisturiser can out-rate a $200 cream โ€” value beats cost
  • ๐Ÿงฎ Custom metrics built from scratch: Value Score (rating per dollar) and Review Density (reviews per love)
  • ๐Ÿ“Š 10 business questions โ†’ 10 purpose-built charts, each chart type matched to the data
  • ๐ŸŽจ Grounded in visualization theory: Cleveland & McGill (length over angle), Tufte (data-ink ratio), colourblind-safe palette

Tech Stack: Microsoft Excel, VLOOKUP, IF logic, PivotTables, star-schema modeling, Kaggle dataset

Key Highlights:

  • Star-schema workbook โ€” fact table plus a price-range dimension over 8,494 products with 8 calculated fields
  • Budget products dominate rating-per-dollar; limited editions win loves but not ratings
  • Viral products show broad but shallow engagement โ€” many loves, few written reviews
Top Brands by Loves Average Rating by Price Bucket
Top Brands by Loves Price vs Rating
Category Distribution Best Value (Rating per Dollar)
Category Distribution Best Value

๐Ÿ“„ Explore the Full Project


A BigQuery Analytics Consulting Study, From Star Schema to Revenue Strategy โ€” Graded 88/100

Acting as a Senior Data Consultant on a retail client's BigQuery warehouse, turning six SQL-driven insights into five costed revenue strategies:

  • ๐Ÿ›๏ธ Four-layer medallion architecture (raw โ†’ clean โ†’ warehouse โ†’ data mart) feeding a governed vw_sales_summary single source of truth
  • โญ Retail star schema: fact_sales with seven dimensions, built for OLAP aggregation
  • ๐Ÿงฎ RFM segmentation in BigQuery SQL using NTILE(5) scoring and CASE logic to split customers into Champions, At Risk and Lost
  • ๐Ÿ’ถ Insight-to-revenue roadmap: five strategies with projected uplift (5โ€“20%) and effort ratings

Tech Stack: Google BigQuery, SQL (window functions, CTEs, NTILE), Star Schema, Medallion Architecture, Looker Studio

Key Highlights:

  • Six analytical insights powered by RANK(), NTILE(5), DATE_DIFF and CTE-based queries
  • Revenue-concentration analysis (top 1% of customers โ‰ˆ 28% of revenue) framed as a retention opportunity
  • Future-state design: BigQuery ML (churn, ARIMA_PLUS forecasting), streaming and a semantic layer
Warehouse Architecture Retail Star Schema
Architecture Star Schema
Revenue Growth Strategy Map Executive Dashboard Mockup
Strategy Map Dashboard Mockup

๐Ÿ“„ Explore the Full Project


Applied Machine Learning โ€” Graded 78/100

A dual binary classification project predicting vaccine uptake from the 2009 U.S. National H1N1 Flu Survey (26,707 respondents, 35 features) to help public health teams target outreach:

  • ๐ŸŽฏ ROC-AUC up to 0.86 (Random Forest, H1N1) following a full CRISP-DM pipeline
  • โš™๏ธ GridSearchCV tuning across C, penalty, and solver with 5-fold stratified cross-validation
  • ๐Ÿ’ถ Business-impact framing: false positive < โ‚ฌ10 vs false negative > โ‚ฌ1,000 โ†’ recommends lowering the threshold to 0.3โ€“0.4 to maximize recall
  • ๐Ÿ“Š 10 EDA visualizations; doctor recommendation identified as a top predictor for both targets

Tech Stack: Python, scikit-learn, pandas, NumPy, matplotlib, seaborn, missingno

Key Highlights:

  • Median/mode imputation, one-hot encoding, StandardScaler, stratified 60/20/20 split
  • Logistic Regression vs Random Forest comparison across both targets
  • Conceptual deployment design: REST API + EHR integration with weekly retraining for concept drift

๐Ÿ“„ Read the Full Analysis


Business Intelligence & Data Warehousing โ€” Graded 85/100

Design and critical evaluation of an end-to-end pipeline that moves a medium-sized retail organization from a strained transactional MySQL system toward a governed analytical ecosystem:

  • ๐Ÿ›๏ธ Six-layer architecture mapped onto the Medallion pattern (Bronze โ†’ Silver โ†’ Gold)
  • โญ Kimball star schema with Type 2 Slowly Changing Dimensions for historical tracking
  • ๐Ÿ›ก๏ธ Layered data quality framework: duplicate detection, null validation, referential integrity, domain checks
  • โš–๏ธ Critical evaluation of real trade-offs: batch vs streaming, query performance vs schema flexibility, cost vs performance

Tech Stack / Concepts: Data Warehousing (Kimball & Inmon), Medallion Architecture, Star Schema, OLTP vs OLAP, SQL, MySQL

Key Highlights:

  • Raw vs clean staging layers enabling validation checkpoints and pipeline restart
  • Metadata and data lineage treated as first-class governance concerns
  • "Near benchmark quality" โ€” one of the strongest submissions per lecturer feedback

๐Ÿ“„ Read the Full Report


๐Ÿš€ GitHub Activity

Marcelo's GitHub stats

Profile Views


๐ŸŽ“ Education & Certifications

Higher National Diploma (HND) in Data Analytics

City Colleges Dublin | 2025 - 2026

BSc Environmental Sciences

Unifesp - Universidade Federal de Sรฃo Paulo | 2012 - 2017
Published scientific research project

Certifications

  • โœ… Microsoft Power BI for Business Intelligence and Data Science
  • โœ… General Data Protection Law (GDPR/LGPD)
  • โœ… Lean Six Sigma White Belt
  • โœ… Integrating AI with Other Skills

๐Ÿ’ก What I Bring to the Table

Not just a "data analyst":

  • ๐ŸŽฏ Business-first mindset: I don't just query data โ€” I solve business problems
  • ๐Ÿ‘ฅ Team leadership: Supervised teams, coached performance, drove results
  • ๐Ÿ“Š C-level communication: Presented insights to executives who don't speak SQL
  • ๐Ÿ”„ Process improvement: 70% efficiency gains through data-driven redesign
  • ๐ŸŒ Cross-cultural: Worked across Brazil, Ireland, hospitality, finance, environmental sectors
  • ๐Ÿ“ˆ Real production experience: Dashboards that ran businesses, not just portfolios

Why hire me over a traditional graduate?

  1. I've managed the teams that use the reports I build
  2. I've defended the numbers in boardroom presentations
  3. I know what "good data" looks like because I've seen what bad data costs
  4. I understand operations โ€” data isn't abstract to me; it's how businesses run

๐Ÿ” What I'm Looking For

I'm seeking Data Analyst, Business Analyst, or BI Analyst roles where I can:

  • โœ… Turn operational chaos into clean, actionable insights
  • โœ… Build dashboards that actually drive decisions (not just look pretty)
  • โœ… Bridge the gap between technical teams and business stakeholders
  • โœ… Apply 10+ years of operational intuition to data problems

Location: Dublin, Ireland (or remote within EU)
Work Authorization: Stamp 1G (full work rights in Ireland)
Languages: Portuguese (native), English (fluent), Spanish (working proficiency)


๐Ÿ“ซ Let's Connect

LinkedIn Email GitHub

๐Ÿ“ง Email: marcelo.dafonsecaoliveira@gmail.com
๐Ÿ“ฑ Phone: +353 83 834 2102
๐Ÿ”— Portfolio: github.com/mfonsecaoliveira


๐Ÿ† Career Highlights Timeline

2008-2010  โ”‚ Administrative Assistant โ†’ Air Ticketing Analyst (GOL Airlines)
2013-2016  โ”‚ President, Junior Enterprise (Environmental Sciences)
2016-2017  โ”‚ Environmental Data Analyst (Waterloo Brasil, Geojรก)
2017-2018  โ”‚ Account Receivable Analyst (Full Time Soluรงรตes)
2018-2022  โ”‚ Credit Control & Commercial Supervisor
2022-2023  โ”‚ Project Analyst (70% efficiency improvement)
2023-Now   โ”‚ Operations Coordinator (Dublin) + HND Data Analytics
2026       โ”‚ โ†’ Data Analyst (YOU?)

๐Ÿ’ฌ A Few Words

"Most data analysts can build a dashboard. Fewer know what it feels like to stand in front of a management team and defend the numbers โ€” or to lead a team that lives and dies by the KPIs on that screen.

I spent over a decade doing exactly that. Now I'm formalizing what I've always done intuitively: turning data into decisions that matter."


โญ Interested in working together?

I'm actively seeking Data Analyst opportunities in Dublin or remote within the EU.

๐Ÿ“ง Email Me | ๐Ÿ’ผ LinkedIn | ๐Ÿ“‚ View Projects


Last Updated: June 2026

Popular repositories Loading

  1. Brarish-Data-Architecture Brarish-Data-Architecture Public

    This project presents a **production-grade, multi-zone data architecture** designed for *Brarish* โ€” a fictional 130-employee Fintech SME headquartered in Sligo, Ireland, with operations in Sรฃo Paulโ€ฆ

    Python 3

  2. mfonsecaoliveira mfonsecaoliveira Public

    2

  3. -Designing-and-Evaluating-a-Modern-Data-Pipeline-for-Business-Intelligence -Designing-and-Evaluating-a-Modern-Data-Pipeline-for-Business-Intelligence Public

    Academic project โ€” Business Intelligence & Data Warehousing City College Dublin | Data Analytics Program | 2026

    1

  4. -Flu-Shot-Learning-Predicting-H1N1-and-Seasonal-Flu-Vaccines -Flu-Shot-Learning-Predicting-H1N1-and-Seasonal-Flu-Vaccines Public

    Jupyter Notebook 1

  5. Customer-Experience-Diagnosis-CallCenter Customer-Experience-Diagnosis-CallCenter Public

    Power BI dashboard that doesn't just answer questions, it builds a data-driven argument: problem โ†’ root cause โ†’ recommendation.

    1

  6. Sephora-Beauty-Products-Data-Visualization-in-Excel Sephora-Beauty-Products-Data-Visualization-in-Excel Public

    The beauty industry sells luxury and exclusivity hard. I wanted to check if the data backs that up.