About Plus8soft
Plus8soft is a global engineering company providing outstaffing and outsourcing services for fast-growing startups and established international companies. We focus on delivering high-quality engineering talent across Web, Mobile, Cloud, AI/ML, and DevOps with a strong presence in the US and global markets.
We're looking for an experienced
Site Reliability Engineer to join a team responsible for stabilizing and evolving a business-critical onboarding flow within a global trading company.
This system is operationally important but currently unstable. The goal of this role is to bring structure, reliability, and production confidence.
Responsibilities
- Take ownership of AWS-based infrastructure (EKS, networking, CI/CD)
- Investigate recurring incidents and eliminate root causes
- Improve production stability, deployment reliability, and observability
- Build meaningful SLIs/SLOs and structured monitoring
- Strengthen incident management and on-call processes
- Improve infrastructure security posture (IAM, secrets management, encryption)
- Maintain Infrastructure as Code (Terraform, Helm)
- Collaborate closely with Vietnam-based DevOps engineers and Team Lead in Cyprus
- Drive reliability improvements in a cross-functional environment
Requirements
- Strong hands-on AWS experience (IAM, VPC, EKS/ECS, S3, RDS, CloudWatch)
- Solid Kubernetes production experience
- Strong debugging and incident investigation capability
- Experience improving unstable or fragile systems
- Infrastructure as Code experience (Terraform required; Helm preferred)
- CI/CD ownership experience (GitHub Actions or GitLab CI)
- Observability experience (Prometheus, Grafana, ELK or similar)
- Scripting skills (Python or Bash)
- Strong English communication skills
- Ability to work effectively in VNCyprus collaboration model
Nice to Have
- Experience with SOC 2 or compliance frameworks
- Exposure to security monitoring or SIEM concepts
- FinOps / cloud cost optimization experience
What We're Looking For
- Strong ownership mindset
- Comfortable working in imperfect environments
- Proactive problem solver
- Calm and structured during production issues
- Able to reduce firefighting over time through systematic improvements
Location & Collaboration
- Ho Chi Minh City-based role (hybrid, several office days per week preferred)
- Direct collaboration with Team Lead in Cyprus
Conditions
- Long-term engagement within a global trading environment
- USD-based compensation
- High visibility role with meaningful system impact
- Opportunity to shape reliability foundations from the ground up