Professional Experience
DevOps and Site Reliability Engineer
CraftSchoolship · San Jose, Remote · Oct 2023 - Present
Key Highlights
- Cut AWS spend by 45% by applying FinOps practices including, rightsizing, cleanup, and usage reviews
- Reduced MTTR by 50% with Grafana, Prometheus, Loki, runbooks, and on-call routing
- Built a deployment platform for 20+ customer apps across 3 AWS regions
- Automated 70% of recurring operations with Python, Shell scripts, and GitHub Actions
View all responsibilities
- Built AWS infrastructure with Terraform and CloudFormation, including multi-region EKS clusters provisioned in under 10 minutes, IAM, networking, storage, and Kubernetes add-ons.
- Migrated workloads across AWS accounts and regions with planned cutovers and minimal downtime.
- Built GitHub Actions pipelines for backend services and mobile apps, covering lint, tests, builds, image publishing, and deploys in under 5 minutes; automated recurring operations with Python and Shell scripts.
- Shipped a deployment platform with GitHub Actions, Helm, Argo CD, React, and YAML config for 20+ customer apps in 3 AWS regions.
- Operated production Kubernetes clusters, handled incidents, joined the on-call rotation, and wrote runbooks, SOPs, and postmortem notes.
- Implemented the observability stack with Prometheus, Loki, Grafana, external HTTPS probes, Istio, Jaeger, and Kiali; reduced MTTR by 50%.
- Created Grafana dashboards for Kubernetes health, EC2 health, application debugging, ingress traffic, capacity, and alert review.
- Tuned PostgreSQL clusters to reduce resource usage by 35%, and automated backup and restore for Kubernetes resources and EBS/EFS persistent volumes.
- Centralized authentication with Keycloak, handled AWS IAM administration, and reduced employee onboarding time by 70%.
- Built REST APIs in Python and Go for payments, user management, analytics, and a live quiz platform with OpenAI quiz generation, Stripe subscriptions, LTI 1.3 support, and 200+ MAU.
Software Developer Intern
Box2Home · Sousse, Hybrid · Feb 2023 - June 2023
Key Highlights
- Built an internal data platform that replaced a paid third-party tool
- Exposed 100+ SQL tables through NestJS, React, Prisma, and role-based access
View all responsibilities
- Built a NestJS, React, Material UI, Prisma, and PostgreSQL platform for browsing internal business data and debugging production issues.
- Added role-based access control for admin, developer, and viewer roles.
- Logged queries and data operations with user and timestamp metadata.
- Deployed the frontend and backend with Docker on AWS ECS.
- Configured CloudWatch logs for monitoring and troubleshooting.
IT Operations Intern
DRÄXLMAIER Group · Sousse, On-site · Sept 2022 - Oct 2022
Key Highlights
- Built a Python CLI for switch diagnostics and Excel reporting
- Supported IT asset setup, activation, and warehouse interventions
View all responsibilities
- Built a Python CLI that pulls data from HP network switches and exports Excel reports for local network troubleshooting.
- Supported warehouse interventions, including IT asset setup and activation across sites.
- Installed network cabling and managed IT assets.