ABOUT THE JOB
We are looking for a hands‑on Senior DevOps Engineer with strong expertise in Automation and Managed Monitoring / Enterprise Observability. You will play a key role in designing, building, and operating scalable, reliable, and observable platforms across cloud environments.
MAIN RESPONSIBILITIES
- Define/implement tools and processes to standardize and automate the way project software is developed, built, tested, and deployed;
- Ensure a homogeneous way of implementing continuous integration and/or continuous delivery across projects;
- Assist with the design and development of resilient, secure, supportable, and scalable systems;
- Automate infrastructure deployments and rollbacks for all developed work assuming responsibility for process support;
- Lead investigations into production incidents with assistance from the development team;
- Proactively manage any risks to the production environment;
- Continually improve the supportability of our systems by feeding improvements back into the design and development cycles;
- Fulfil other tasks as assigned by your People Leader and/or authorized representative of NAB Vietnam from time to time.
YOUR SKILLS & EXPERIENCE
Must-have
- Strong experience with Kubernetes (production);EKS a plus.
- Hands‑on experience with AWS (core services, networking, IAM, security best practices).
- Infrastructure as Code using Terraform (modules, workspaces, CI/CD integration).
- Solid understanding of SRE/AIOps practices (SLOs, error budgets, runbooks, auto‑remediation).
- Experience building and maintaining CI/CD pipelines.
- OpenTelemetry instrumentation and Collector pipelines (metrics, traces, logs).
- Hands‑on experience with Prometheus and Grafana.
- Experience with the monitoring stack: Grafana LGTM (Loki, Grafana, Tempo, Mimir), OpenSearch; AppDynamics (optional) — dashboards, alerting, retention.
- Kafka (Apache Kafka only): brokers, Connect/Streams, Schema Registry; monitoring consumer lag, throughput, error rates, DLQs (with alerting & dashboards).
- Application platforms & deployments: Java (Spring Boot), Node.js (Express/Nest), React (RUM/synthetics, source maps).
- Deployment strategies: blue/green, canary, feature flags; trace‑context propagation.
- Experience with infrastructure/application performance testing (stress & load); baselines/benchmarks and regression detection integrated into CI/CD.
- Effective English communication skills.
Nice to Have
- Experience supporting large‑scale, enterprise environments
- Familiarity with multi‑cloud or hybrid cloud architectures
THE BENEFITS AND PERKS
1. Generous compensation and benefit package
- Attractive salary and benefits
- 20-day annual leave and 7-day sick leave, etc.
- 13th month salary and Annual Performance Bonus
- Premium healthcare for yourself and family members
- Monthly allowance for team activities
- Premium welcome kit and frequent appreciation gifts
- Extra benefits for long-term employees
2. Exciting career and development opportunities
- Large scale products with modern technologies in banking domain
- Clear roadmap for career advancement in both technical and leadership pathways
- Well-structured learning and development programs (technical and soft skills)
- Sponsored certificates in both IT and banking/finance
- Premium accounts on Udemy/A Cloud Guru/Coursera/LinkedIn, etc.
- English learning with native teachers
- Opportunity for traveling & training in Australia
3. Professional and engaging working environment
- Hybrid working model and good work-life balance
- Well-equipped & modern Agile office with fully stocked pantry
- Special programs to improve your physical and mental health
- Annual company trip and events
- A solid talented team behind you - great people who love what they do