
Harsh
Pipalia
Data Engineer
Passionate and detail-oriented Data Engineer with hands-on experience in building data pipelines, warehouses, and analytics solutions. Proficient in SQL, Python, and big data frameworks including Hadoop, Kafka, NiFi, and PySpark. Skilled in cloud platforms (Azure, AWS, GCP) and BI tools such as Tableau and Power BI, with a proven ability to transform raw data into actionable insights. Experienced in developing ETL workflows, data modeling, and advanced SQL reporting to support business decision-making. Seeking to leverage technical expertise and problem-solving skills to design scalable data solutions and drive data-driven success.
Professional experiences
Education
Skills
Certifications
Projects
• Built a modern data warehouse integrating ERP & CRM CSV data (60,000+) using Medallion Architecture (Bronze-
Silver-Gold) in SQL Server.
• Designed ETL workflows for data cleaning, standardization, normalization, and enrichment, producing a Star
Schema for analytics.
• Developed SQL-based reports to analyze customer behavior, product performance, and sales trends, enabling
data-driven decision-making.
• Developed a comprehensive suite of SQL scripts for data exploration, segmentation, and performance analysis
across sales, products and customers.
• Built advanced queries leveraging window functions, CTEs, and conditional logic to deliver KPIs sucha s customer
segmentation, product performance tiers, and category contributions.
• Designed reusable analytical views enabling insights into sales trends, cumulative growth, and year-over-year com-
parisons.
• Implemented Apriori and FP-Growth algorithms to identify product associations, enhancing cross-selling opportunities.
• Analyzed 50,000+ transactional records to uncover purchasing patterns, optimizing inventory management.
• Utilized association rule mining techniques to inform strategic decisions, improving customer targeting by 2%.
Languages
🇺🇸
English (Fluent)
🇮🇳
Hindi (Fluent)
🇩🇪