About Me

As a Data Engineering Architect, I specialize in the migration and modernization of legacy data stacks to cutting-edge, cloud-native solutions. My expertise lies in transforming complex data landscapes into agile, scalable, and high-performance environments.

I excel at designing and implementing advanced data solutions that not only address intricate technical challenges but also derive actionable business insights, driving innovation and strategic decision-making.

My focus is on delivering robust and secure data-intensive solutions, primarily leveraging the Google Cloud Platform (GCP), while also drawing on practical experience with AWS and Azure to operate effectively in multi-cloud scenarios. I am committed to utilizing the latest technologies, including Google Cloud's premier data services (such as BigQuery, Dataflow, Dataproc, and Pub/Sub), to deliver high-impact results. This professional profile is crafted to be seamlessly integrated with GitHub Pages, ensuring a fluid and user-friendly website experience.

Skills

database Data Engineering & Big Data:

  • Apache Spark
  • Hadoop
  • Apache Kafka
  • Apache Hive
  • Apache Sqoop
  • SQL
  • Python
  • Scala
  • Data Modeling
  • ETL/ELT Design
  • Data Warehousing
  • Performance Optimization
  • Data Architecture
  • Legacy System Modernization
  • Cloud Migration Strategies
  • Solution Design & Prototyping

cloud_circle Cloud Platforms & DevOps:

Primary Expertise: Google Cloud Platform (GCP)

  • Google BigQuery
  • Google Dataflow
  • Google Dataproc
  • Google Pub/Sub
  • Google Cloud Storage
  • Google Cloud Composer

Working Experience with Other Cloud Platforms:

  • AWS
  • Microsoft Azure

DevOps, CI/CD & Automation:

  • GitHub Actions
  • Terraform
  • Docker
  • Kubernetes
  • Azure DevOps
  • CI/CD Principles & Tools
  • Infrastructure-as-Code (IaC)

monitoring AI Ops & Monitoring:

  • Prometheus (Familiar)
  • Grafana (Familiar)
  • Cloud Monitoring Tools
  • Native Cloud Logging/Monitoring
  • Proactive Anomaly Detection
  • Automated Alerting
  • Recovery Strategies

hub Blockchain Technology:

  • Core Blockchain Principles
  • Blockchain Solution Analysis
  • Conceptualizing Blockchain Solutions

storage Databases:

  • NoSQL
  • MongoDB
  • Cassandra
  • Relational Databases
  • PostgreSQL
  • MySQL
  • Oracle
  • SQL Server

query_stats Other:

  • Power BI
  • Data Visualization & Reporting
  • Agile Methodologies
  • Scrum Methodologies

Experience

2025
2023
2022
2019
2018
2015
Jan 2025 – Present

Associate Data Architect | NRG Energy

  • Led AWS to GCP Databricks migration, optimizing storage costs by 20%.
  • Automated data validation using Python & Shell, reducing manual work by 80%.
  • Designed secure cross-cloud networking with VPC Peering & Private Google Access.
Sep 2022 – Jan 2025

Lead Data Engineer | HCA Healthcare

  • Migrated 100+ TB from Cloudera & Teradata to GCP.
  • Built real-time streaming with Kafka & Pub/Sub, improving accessibility.
  • Optimized ETL performance by 30% using PySpark & cloud efficiencies.
Apr 2019 – Aug 2022

Senior Data Engineer | Charles Schwab

  • Migrated legacy Hadoop & Teradata to GCP , improving speeds by 25%.
  • Automated PySpark pipelines, supporting 30M+ daily transactions.
Aug 2018 – Mar 2019

Data Engineer | DSO MCS Group

  • Developed cloud-native data solutions with Talend ingestion pipelines.
Jun 2015 – Jul 2018

ETL Developer | InnoMinds

  • Designed ETL workflows, improving data accuracy and reporting.
Sep 2023 – Present

Associate Data Architect | NRG Energy

  • Led AWS to GCP Databricks migration, optimizing storage costs by 20%.
  • Automated data validation using Python & Shell, reducing manual work by 80%.
  • Designed secure cross-cloud networking with VPC Peering & Private Google Access.
Sep 2022 – Jan 2025

Lead Data Engineer | HCA Healthcare

  • Migrated 100+ TB from Cloudera & Teradata to GCP.
  • Built real-time streaming with Kafka & Pub/Sub, improving accessibility.
  • Optimized ETL performance by 30% using PySpark & cloud efficiencies.
Apr 2019 – Aug 2022

Senior Data Engineer | Charles Schwab

  • Migrated legacy Hadoop & Teradata to GCP , improving speeds by 25%.
  • Automated PySpark pipelines, supporting 30M+ daily transactions.
Aug 2018 – Mar 2019

Data Engineer | DSO MCS Group

  • Developed cloud-native data solutions with Talend ingestion pipelines.
Jun 2015 – Jul 2018

ETL Developer | InnoMinds

  • Designed ETL workflows, improving data accuracy and reporting.

Key Projects

cloud_sync

AWS to GCP Migration | NRG Energy

  • Migrated petabyte-scale data from S3 to GCS with automated validation.
insights

Enterprise Data Modernization | HCA Healthcare

  • Migrated 50+ data sources to BigQuery, reducing costs by 30%.
  • Built serverless ETL pipelines, reducing processing time by 40%.
cloud_upload

On-Prem to Cloud Migration | Charles Schwab

  • Optimized ETL workflows with PySpark & Talend, cutting times by 40%.

Education & Certifications

Education:

B.Tech – Jawaharlal Nehru Technological University (2015)

Certifications:

  • Google Cloud Certified - Professional Data Engineer
  • Talend Data Explorer Certification
  • CCA Spark and Hadoop Developer
  • Advanced Certification in Block-chain and Distributed Ledger Technologies - IIIT Hyderabad
  • Globee Awards Judge

Connect with Me