Senior Cloud Engineer

Building cloud platforms
that scale.

AWS Community Builder – Containers. 9+ years architecting large-scale AWS platforms at Expedia Group — 430+ Kubernetes clusters, 9,000+ microservices, and AI/ML infrastructure in production.

430+
Kubernetes clusters
9K+
Microservices
9+
Years in cloud
≤5 min
RTO achieved
Get in touch → LinkedIn GitHub Blog
AWS Community Builder – Containers NVIDIA-Certified AI Infrastructure AWS Solutions Architect Associate HashiCorp Terraform Associate
Expertise

Technical Skills

Containers & Kubernetes
EKSKubernetesDocker HelmKarpenterArgoCD ECS / Fargate
Cloud Platforms
AWSEC2S3 RDS / AuroraLambdaVPC IAM
Infrastructure as Code
TerraformCloudFormation GitOpsImmutable Infra
Observability
DatadogSplunkCloudWatch Incident Response
AI / ML Infrastructure
GPU SchedulingInference Platforms Distributed ComputeGenAI Workloads
Languages & Automation
PythonGoBash YAML / JSON
Serverless & Events
LambdaAPI GatewayEventBridge SQS / SNSStep Functions
Migration & DR
AWS MGNAWS DMS Elastic DRRPO / RTO Planning
🏅
AWS Community Builder – Containers
2026
🟢
NVIDIA-Certified: AI Infrastructure & Operations
2025
☁️
AWS Certified Solutions Architect – Associate
Active
🔷
HashiCorp Certified: Terraform Associate
Active
Career

Experience

Senior Cloud Engineer Expedia Group Mar 2024 – Present
  • Led GitOps transformation from Kubernetes Federation to ArgoCD across 430+ clusters, cutting deployment failures by 25%.
  • Architected Karpenter-based Kubernetes platforms supporting GenAI and AI/ML workloads at scale.
  • Built GPU-enabled Kubernetes environments for inference workloads with distributed compute optimization.
  • Reduced cloud spend by ~20% through capacity planning and optimization across multiple business units.
  • Designed monitoring architectures using Datadog, Redis, and Splunk to improve incident response.
Cloud Engineer III Expedia Group Mar 2021 – Mar 2024
  • Built and operated multi-cluster Kubernetes platform supporting 430+ clusters and 9K+ microservices across multiple AWS accounts.
  • Led migration of 150+ on-prem servers to AWS using MGN, achieving RTO ≤ 5 min and RPO ≤ 1 min.
  • Designed core networking, IAM, and security foundations for multi-account AWS environments.
  • Migrated production workloads from ECS to EKS, improving scalability and cloud-native standardization.
Cloud Engineer II Hotwire.com – Expedia Group Dec 2018 – Mar 2021
  • Owned architecture decisions across EKS, ECS Fargate, and Lambda — evaluating scalability, cost, and security trade-offs.
  • Led migration planning workshops defining modernization strategies for legacy workloads.
  • Led incident response and platform reliability efforts to maintain high availability.
Cloud Engineer Hotwire.com – Expedia Group Oct 2017 – Dec 2018
  • Managed EC2, S3, RDS, Lambda, and Elasticsearch; optimized resource utilization and reliability.
  • Implemented ACLs to enhance network security and directed junior engineers.
Hashnode

Latest Writing

Kubernetes
Running Karpenter at Scale: Lessons from 430+ Clusters
Production insights on dynamic node provisioning, autoscaling strategies, and cost control in multi-cluster environments.
Read on Hashnode →
AWS Migration
How We Migrated 150 Servers to AWS with RTO ≤ 5 Minutes
A deep dive into using AWS MGN and ADS for large-scale on-prem to cloud migration with tight recovery objectives.
Read on Hashnode →
AI Infrastructure
GPU Scheduling on Kubernetes for AI/ML Inference Workloads
Practical patterns for running GPU-accelerated inference on EKS — scheduling, resource limits, and cost optimization.
Read on Hashnode →
View all posts on Hashnode →
Open Source

GitHub Projects

eks-karpenter-bootstrap
Terraform modules for bootstrapping production-ready EKS clusters with Karpenter, node pools, and autoscaling best practices.
HCL / Terraform
multi-account-aws-foundations
Reference architecture for multi-account AWS environments — VPC, IAM, Security Hub, and CloudTrail using CloudFormation StackSets.
Python / CloudFormation
gitops-argocd-patterns
GitOps deployment patterns using ArgoCD — app-of-apps, environment promotion, and multi-cluster sync strategies.
YAML / Shell
gpu-k8s-inference
Sample manifests and Helm charts for deploying GPU-accelerated AI/ML inference workloads on EKS with NVIDIA device plugin.
Go / Helm
View all on GitHub →
Let's connect

Get in Touch

Open to the right opportunity.

Whether you're looking for a cloud architect, want to collaborate on an open-source project, or just want to talk Kubernetes — feel free to reach out.