Summary
Cloud & AI Infrastructure Leader with a decade of experience architecting and scaling distributed systems, AI/ML training platforms, and mission-critical production environments across multi-cloud and hybrid setups. Proven in driving 0→1 platform engineering initiatives, modernizing cloud-native ecosystems, and delivering secure, highly available, cost-efficient infrastructure for global-scale applications. Skilled in leading cross-functional teams, defining technical strategy, and solving complex distributed-systems challenges with a focus on operational excellence and real-world impact. Eager to contribute to a dynamic team and learn from experienced professionals. I am excited to apply my skills and contribute to projects that make a real difference.
Key Achievement
Amazon Professional Services
DevOps Engineer | 2+ Years
Worked as a DevOps Engineer with Amazon Professional Services team for more than 2 years, helping enterprise clients migrate from on-premises data centers to AWS cloud. Led multiple cloud migration projects, designed scalable infrastructure architectures, and implemented DevOps best practices to ensure seamless transitions while maintaining high availability and security standards.
Services & Training
Offering professional services in infrastructure consultancy, freelancing, mentoring, and hands-on DevOps training
Freelancing
Available for freelance projects in cloud infrastructure, DevOps automation, and platform engineering. I deliver end-to-end solutions from architecture design to implementation and optimization.
- •Cloud infrastructure design & implementation
- •DevOps pipeline automation
- •Infrastructure as Code (IaC) development
- •Cloud migration & modernization
Infrastructure Consultancy
Strategic consulting services to help organizations design, implement, and optimize their cloud-native infrastructure and DevOps practices.
- •Infrastructure architecture review & design
- •DevSecOps strategy & implementation
- •Cost optimization & performance tuning
- •Security & compliance assessment
Mentoring
One-on-one and team mentoring to accelerate your DevOps journey. Get personalized guidance on career growth, technical challenges, and best practices.
- •Career guidance & skill development
- •Technical problem-solving sessions
- •Architecture review & feedback
- •Team coaching & knowledge transfer
DevOps Training
Hands-on training programs covering the complete cloud-native stack. Practical, real-world scenarios to build production-ready skills.
- •Interactive workshops & bootcamps
- •Customized training programs
- •Lab exercises & real-world projects
- •Certification preparation support
Hands-On Training Topics
Infrastructure as Code
- • Terraform
- • Terragrunt
- • Module development
- • State management
Cloud Platforms
- • AWS (EC2, S3, VPC, IAM)
- • Multi-cloud strategies
- • Cloud architecture patterns
- • Cost optimization
Containerization
- • Docker
- • Kubernetes
- • Container orchestration
- • Helm charts
Observability Stack
- • Prometheus
- • Grafana
- • Metrics & alerting
- • Dashboard creation
CI/CD Pipelines
- • GitHub Actions
- • Jenkins
- • ArgoCD
- • GitOps workflows
Logging Solutions
- • ELK Stack (Elasticsearch, Logstash, Kibana)
- • Loki
- • Log aggregation & analysis
- • Centralized logging
Interested in working together or learning more about my services?
Get in TouchSkills
Key Skills
DevOps Strategy & Transformation
CI/CD Pipeline Design & Management
Cloud Architecture & Engineering
Infrastructure as Code (IaC)
Containerization & Orchestration
Automation & Scripting
Monitoring & Incident Management
Security & Compliance
Version Control & Collaboration
Agile & Lean Methodologies
Cross-Functional Team Leadership
Infrastructure Scaling & Optimization
Disaster Recovery & High Availability
Client Engagement & Technical Consulting
Performance Tuning & Cost Optimization
Tools & Technology
IaC
Cloud
CI/CD
Containers
Monitoring
Languages
Version Control
SysAdmin
Container Security
AppSec
Certifications
AWS Certified Solutions Architect – Professional (SAP-C02)
AWS Certified Developer – Associate
AWS Certified Solutions Architect – Associate
AWS Certified SysOps Administrator – Associate
Microsoft Azure Fundamentals (AZ-900)
Certified Kubernetes Administrator (CKA)
Certified Kubernetes Security Specialist (CKS)
Certified Kubernetes Application Developer (CKAD)
ITIL Foundation Certified
Professional Experience
Lead AI Infrastructure Engineer
ThoughtWorks
Client: RAG application platform development
Oct 2025 – Present
Remote
- •Individually architected, led and delivered 0→1 platform engineering initiatives, transforming ambiguous requirements into scalable, secure infrastructure solutions used across engineering and research organizations for RAG workload.
- •Built automated GitHub Actions pipelines and unified observability with Prometheus, OpenTelemetry, and OpenLens.
- •Designed end-to-end autoscaling, high availability, disaster recovery, and resilience workflows for a secure, cost-efficient ML inference platform.
- •Built cloud abstractions using Terraform, Terragrunt, and multi-cloud architectures, enabling self-service provisioning and consistent, reproducible environments.
- •Supported AI/ML research teams by deploying high-throughput training environments, identifying bottlenecks, and optimizing performance for distributed training.
- •Mentored engineers and contributed to a diverse, inclusive, high-performance engineering culture with strong emphasis on technical excellence and collaboration.
- •Implemented RBAC, secrets management, and operational excellence practices, reducing deployment time from hours to minutes and improving reliability, performance, and inference throughput at scale.
- •Engineered a self-healing, one-click multi-cloud platform for RAG agents with automated provisioning, deployment, and monitoring.
Lead Platform Engineer
ThoughtWorks
Client: UK Betting & Gaming Company
June 2025 – Sep 2025
Remote
- •Delivered a cloud-native gaming platform by converting ambiguous requirements into architecture, Jira epics/stories, and Confluence documentation.
- •Launched a modular, scalable, production-ready platform within 4 months.
- •Built secure, automated CI/CD and GitOps workflows (Nexus, HashiCorp Vault, ArgoCD) enabling declarative deployments, drift detection, and zero-downtime blue-green releases.
- •Delivered production-like development and integration environments in under 2 weeks, accelerating testing cycles and game feature iteration.
Lead DevOps Engineer
AWS Professional Services (via ThoughtWorks)
Client: Germany's Leading Loyalty Provider
Oct 2023 – May 2025
Remote
- •Led on-prem to AWS migration, reducing infrastructure cost by 35% and increasing deployment throughput by 60%.
- •Built 80% reusable Terraform/Terragrunt modules and refactored monorepo into DRY, production-ready components, cutting provisioning time from hours to minutes.
- •Implemented Shift-Left security (SonarQube, Trivy, Snyk), reducing security incidents by 45%.
- •Built observability with CloudWatch and Prometheus, improving issue detection and resolution by 30%.
- •Created SLA/SLO frameworks backed by AWS telemetry, strengthening reliability governance.
- •Optimized Docker base images, reducing CVEs by 60%.
- •Delivered global Terraform, Cloud, and DevSecOps workshops, upskilling 50+ engineers and standardizing best practices.
Senior DevOps Engineer
ThoughtWorks
Client: Global Top-3 International Bank
Apr 2021 – Aug 2023
Singapore (Remote)
- •Automated Helm-based EKS/OpenShift deployments for 56 microservices, enabling zero-downtime releases.
- •Achieved 100% regulatory compliance (MAS, GDPR, HIPAA) via automated security checks.
- •Reduced CI/CD times by 65% through Docker, Gradle, and pipeline optimizations.
- •Built umbrella CI/CD pipelines for 50+ microservices and delivered on-demand sandbox environments, reducing validation time by 40%.
- •Scaled observability using Prometheus & Grafana; mentored SREs, improving incident resolution by 35%.
Cloud Engineer
Hitachi Data Systems (REAN Cloud)
Apr 2018 – May 2021
- •Engineered automation for RADAR & Assess platforms using Terraform, Ruby/Rake, Docker Compose, Artifactory, and REST APIs.
- •Built STIG-compliant AMIs and containerized products with Packer, Terraform, and Jenkins, enabling secure, end-to-end CI/CD.
- •Automated one-click provisioning of EKS/AKS/GKE clusters with HCAP using VDSS/VDMS architecture.
- •Reduced deployment time by 80% using IaC automation and Helm-based cloud-native workflows.
- •Standardized cloud processes and onboarded clients through demos and consulting.
- •Supported REAN-Opex initiatives, accelerating DevSecOps adoption and delivering significant cost and time efficiencies.
Education
Bachelor of Engineering (Information Technology)
D.Y. Patil College of Engineering — 2013
Awards & Recognition
Hitachi Pioneering Spirit Award – Innovation & Impact (2021)
HSBC Code Grand Winner – 1st Place (2018)
Top 10 Finalist, Hitachi Demo Jam (2020)
Best Leadership Award, D.Y.P.C.E.T. (2012)
Best Talented & Quick Learner, D.Y.P.C.E.T. (2013)
Trainer, Thoughtworks – Led DevSecOps & Security Week programs
Featured Projects
three-tier-webapp-deployment
Automated, scalable multi-tier web application deployment with IaC.
aws-vpc-api-service
RESTful API for VPC automation with best security practices.
terraform-aws-module
Reusable Terraform modules for AWS infrastructure.
python-helloworld
Simple Python starter app with CI/CD and containerization.
MediaWikiOnEKS
Enterprise-grade MediaWiki deployment on EKS.
