Company Description
California Fitness & Yoga, established in 2007, is Vietnam's first and largest international fitness company. It is more than just a gym; it is a hub of active lifestyles, inspiring joy and vitality in communities. Operated under California Wellness Group, it boasts 50 locations across Vietnam, including California Fitness & Yoga, California Centuryon, Hypoxi, and Vita Clinic. Employing over 3,000 team members, the company is committed to providing a flexible work environment, regular training programs, competitive compensation, and ample career growth opportunities.
Role Description
This is a full-time, on-site role for a Team Lead, DevOps Engineer, based in Ho Chi Minh City, Vietnam. The DevOps Engineer Team Lead role is responsible for designing, building and maintaining scalable and secure infrastructure systems that empower development and operations teams to deliver software rapidly and reliably. As a Team Lead, you will drive best practices for Infrastructure as Code (IaC), CI/CDautomation, and cloud reliability, mentoring peers, reviewing code and contributing to long-term platform strategy.
Key Responsibilities
30% Infrastructure Architecture & Automation
- Deep expertise in infrastructure architecture, automation, and operational excellence.
- Architect, implement, and maintain highly available, scalable cloud infrastructure using FPT Smart Cloud (FCI), TPCloud, Viettel Dedicated Cloud (vDC), AWS, Azure, or GCP.
- Develop reusable infrastructure modules with Terraform, CloudFormation, or Pulumi to ensure consistency and compliance.
- Automate environment provisioning, configuration, and system patching using Ansible, Chef, or Puppet.
30% Continuous Integration & Delivery
- Design and optimize robust CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI, or ArgoCD) for faster, safer deployments.
- Integrate testing, security scanning, and automated rollbacks into deployment workflows.
10% Observability & Reliability
- Implement advanced monitoring and alerting with Prometheus, Grafana, Datadog, or ELK/EFKstacks.
- Lead incident response and root cause analysis to improve platform resilience and uptime.
- Drive SRE best practices error budgets, SLAs/SLIs/SLOs, and automated remediation.
15% Security & Governance
- Embed security controls into IaC and CI/CD workflows(e.g., policy as code, secrets management, identity controls).
- Ensure infrastructure meets compliance standards internal governance.
15% Leadership & Collaboration
- Mentor and guide junior DevOps engineers on best practices in automation and system design.
- Collaborate closely with software, QA, and security teams to streamline release and operational processes.
- Evaluate and implement new tools, frameworks, and cloud-native technologies to enhance capabilities of platform and infrastructure.
Education | Experience
- 5+ years of experience as a DevOps Engineer, SRE, or Infrastructure Engineer, with increasing scope and leadership.
- Strong proficiency in CI/CD tools and deployment automation.
- Advanced scripting skills in Python, Bash, Groovy, Go.
- Proven expertise with Infrastructure as Code (Terraform, CloudFormation, or Pulumi).
- Hands-on experience with Kubernetes, Docker, and Helm for container orchestration.
- Deep understanding of cloud services (FCI and TPCloud preferred, AWS; Azure/GCP also valued).
- Experience with monitoring and logging platforms for large-scale distributed systems.
- Solid grasp of networking, security, and Linux systems administration.
- Good command of English and Vietnamese.
- Excellent communication with both technical and non-technical teams.
- Time management & prioritization under pressure.