Position Summary
The candidate will be responsible for automated deployments, ensuring the highest reliability
and scalability of our Production services, and efficiently managing our cloud platform
infrastructure.
Our ideal candidate is a professional with experience in automating deployments with modernconfiguration and deployment management systems. The candidate requires a broad
knowledge of systems, servers, load balancers, storage, security, networking, and some
background in programming. We are using cloud infrastructure (AWS), containerization, CI/CD process.
Responsibilities
- Build, scale, and monitor various highly complex applications in our cloud platform infrastructure.
- Build and maintain highly available systems on containerization (Docker & Kubernetes).
- Manage and support multitier architecture focusing on web technology stack (CDN, Reverse Proxy, Application, DB).
- Working with application developers to automate and accelerate the testing, release and deployment of applications into a runtime environment quickly and reliably.
- Improve reliability and performance of test and build processes
- Design and maintain automated release channels
- Proactively look for ways to automate the installation and upkeep of build tools and dependencies 8. Review and recommend solutions and tools to improve the software development process
- Managing pre/post release code merges and the code branching strategies
- Responsible for mentoring and teaching existing team members. As such, the ideal candidate must have experience clearly explaining solutions to complex problems and demonstrate the ability to lead and impart knowledge effectively to junior resources
Skill Requirements:
- Bachelor's/College degree in Computer Science, IT, or arelated field
- 3+ years of DevOps experience with CI/CD & Automation: Strong hands-on experience building CI/CD workflows and automation using scripting languages (Python, Go, Java, PowerShell, etc.)
- Solid experience administering and optimizing Linux-based systems for stable build and deployment environments.
- Hands-on experience deploying and managing Kubernetes clusters in production.
- Practical experience deploying and supporting solutions on AWS (EC2, S3, EKS/Kubernetes, RDS, IAM, CLI/Console).
- Ability to automate provisioning and manage hybrid cloud (AWS + On-Prem) infrastructure at scale.
- Strong proficiency with Terraform and Ansible.
- Experience using Prometheus and Grafana to track system availability, latency, performance, and overall health.
- Familiarity with Kafka, Flink, Cassandra (ClickHouse is a plus).
- Understanding of branching strategies, pre/post-release processes, and automated release channels.
- Ability to troubleshoot complex issues across OS, networking, and databases in cloud-based environments.
- Skilled in performing root cause analysis and improving deployment and operation workflows.
Benefits
- Attractive compensation, regular assessments, and salary reviews;
- 13th-month salary, annual bonus, and performance bonus;
- 20 days annual leave;
- 100% social insurance, premium healthcare insurance, and annual routine check-up
- Company activities: annual teambuilding, New Year party, quarterly company party, weekly fruits day, monthly birthday, etc.;
- Sports activities: badminton, football;
- Special celebrations on 8/3, Father's day, 20/10, Christmas, Tet holiday, etc.
- Unlimited access to a selection of food and beverages;
- International working environment with a young, friendly, dynamic team, and creative.