Shiva Manhar

Senior / Staff Data Engineer • Databricks • Spark • Cloud Data Platforms

Email Resume GitHub Kaggle Medium LinkedIn

About Me

I am a data engineer with over 12 years of experience building scalable, reliable, and cost-efficient data platforms. My primary focus is on Databricks-based lakehouse architectures using Apache Spark, Delta Lake, and cloud-native services on AWS and Azure.

I enjoy solving complex data problems, optimizing large-scale pipelines, and designing systems that support analytics and business decision-making. I prefer hands-on individual contributor roles where I can own architecture and implementation end to end.

What I Work On

Typical areas I work on day to day:

Batch and streaming data pipelines using Spark and Databricks
Lakehouse architecture (Bronze / Silver / Gold)
Performance tuning, data skew handling, and cost optimization
Cloud data platforms on AWS and Azure
Data modeling and analytics enablement

Projects

Selected projects that reflect my work as a senior data engineer:

Real-Time Lakehouse Pipeline

Kafka → Databricks Structured Streaming pipeline implementing Bronze–Silver–Gold layers with Delta Lake, schema evolution, and checkpointing for reliable near real-time analytics.

Case study (coming soon)