RevivCart – E-Commerce Marketplace Revenue & Operations Analysis

Published in Data Analytics Project, 2026

Context

Conducted an end-to-end analytics project on RevivCart, a simulated B2C e-commerce marketplace operating across Electronics, Fashion, and Home & Living categories. The platform was experiencing margin pressure despite 12% YoY revenue growth, prompting a full diagnostic across revenue performance, customer retention, category health, and operational efficiency.

Dataset: Olist Brazilian E-Commerce Public Dataset (Kaggle) — 100K+ real orders across 8 relational tables.

Problem

Despite consistent GMV growth, profitability was declining. Leadership suspected three root causes: high cancellation rates in specific categories, over-reliance on discounts eroding AOV, and poor delivery performance driving customer churn. The analysis was scoped to answer seven targeted business questions across Commercial, Category Management, and Operations stakeholder groups.

Actions

Data Preparation

  • Audited 8 raw CSV tables for NULL patterns, orphan records, and data type inconsistencies before analysis
  • Built an automated Python pipeline (pandas, SQLAlchemy) to clean, transform, and load 7 normalized tables into PostgreSQL
  • Designed a star schema data model (1 fact table, 4 dimension tables) following data warehousing best practices

SQL Analysis — 7 Targeted Business Queries

  • Calculated monthly GMV and AOV trends using date aggregation and status filtering to identify seasonality patterns
  • Ranked top 10 revenue-generating categories with average review scores using multi-table JOINs and LEFT JOIN for optional review data
  • Measured cancellation rate by category using conditional aggregation (CASE WHEN) and HAVING to filter statistically significant groups
  • Segmented customer purchase frequency using subqueries to quantify repeat purchase rate and retention opportunity
  • Correlated delivery delay buckets with review scores using CASE segmentation and date arithmetic
  • Identified top 3 sellers per category using RANK() OVER (PARTITION BY category) window function
  • Analyzed payment method mix and AOV differential using single-table aggregation and SUM() OVER() window function

Power BI Dashboard — 3 Report Pages

  • Built star schema model in Power BI with custom Date Table and 7 DAX measures including dynamic cancellation rate and cross-table review score calculations using RELATEDTABLE
  • Delivered Executive Overview page: GMV trend line chart, 4 KPI cards, order status donut chart, and synced year/status slicers
  • Delivered Category Performance page: revenue ranking, cancellation rate comparison, and scatter plot quadrant analysis (Star/Hidden Gem/At Risk/Problem)
  • Delivered Delivery & Experience page: delivery bucket distribution, review score histogram, and delay-to-review correlation line chart

Outcome

  • Identified that 97% of 93K customers are one-time buyers — quantified retention opportunity at ~R$300K incremental GMV from a 2% improvement in repeat purchase rate
  • Diagnosed delivery delays exceeding 3 days as the primary driver of poor customer satisfaction, with affected orders scoring 1.86★ vs 4.29★ for early deliveries — a 2.43-point drop
  • Surfaced seller concentration risk in the computers category where the top seller accounts for 83% of category revenue, representing a critical single point of failure
  • Revealed voucher inefficiency: voucher AOV (R$66) is 60% below credit card AOV (R$163) due to absence of minimum order value enforcement
  • Segmented categories into four strategic quadrants, identifying health_beauty as the highest-ROI investment opportunity and office_furniture as an at-risk category requiring seller quality intervention

Key Insights

#InsightRecommended Action
197% one-time buyers → R$300K retention opportunityLaunch post-purchase loyalty program
2health_beauty: highest GMV + highest review scoreIncrease marketing budget allocation
3Delivery delays 3+ days → review score drops 2.43 pointsEnforce hard SLA with auto-compensation
4Top seller = 83% of computers category revenueDiversify seller pool to reduce concentration risk
5Voucher AOV 60% lower than credit cardSet R$100 minimum order value for voucher redemption
6office_furniture: high GMV but lowest review score in top 10Audit seller quality before scaling

Skills Demonstrated

CategorySkills
SQLMulti-table JOIN, LEFT JOIN, GROUP BY, HAVING, conditional aggregation, subquery, window functions (RANK, SUM OVER)
Pythonpandas, SQLAlchemy, os.path, data type conversion, ETL pipeline
Power BIStar schema modeling, DAX measures, RELATEDTABLE, Date Table, cross-page slicers
AnalyticsKPI definition, data auditing, cohort thinking, business impact quantification
DomainE-commerce KPIs (GMV, AOV, cancellation rate, NPS proxy), marketplace dynamics

Dashboard Preview

Dashboard Overview 1

Dashboard Overview 2

Dashboard Overview 3