Search by job, company or skills

nab innovation centre vietnam

Senior Site Reliability Engineer

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We're looking for a hands-on, forward‑thinking Senior Site Reliability Engineer to elevate the reliability, automation, and scalability of one of our most strategically important domains.

You'll combine strong engineering capability with servant leadership to guide the team, automate production processes, improve resilience, and drive operational excellence. You'll enjoy solving complex operational challenges with code and mentoring others on engineering best practice in a high‑stakes production environment.

What you'll do

Reliability & Resilience Engineering

  • Design and automate production operational processes, including deployments, monitoring, alerting, and self‑service capabilities.
  • Relentlessly optimise best practice, balancing between ITIL process rigour and lean principles.
  • Improve system resilience, incident recovery, observability, and performance.
  • Deliver resilience and recovery testing, including chaos engineering and performance scenarios.
  • Balance development speed with reliability targets through well-defined SLOs and engineering standards.

Operational Excellence & Observability

  • Analyse metrics across OS, platform, and application layers to support tuning, fault diagnosis, audits, and capacity planning.
  • Oversee the SDLC for reliability-focused features, including code reviews, white-box testing, and maintaining test frameworks.

Change & Incident Management

  • Participate in automated change delivery, including resilience testing, verification, change control, and user communication.
  • Ensure operational readiness as workload and use cases scale, optimising both human and technical resources.

Leadership & Production Ownership

  • Act as Product Owner delegate and champion for production resilience and scale.
  • Provide data‑driven assessments and readiness reports to support program‑level go/no‑go decisions for major releases, migrations, and customer cutovers.
  • Provide technical leadership and mentorship to engineers earlier in their career journey.
  • Facilitate blameless post‑mortems and drive engineering‑first problem resolution.

What you'll bring

Essential

  • Strong software engineering background (Java, DevOps, platform engineering, or automation).
  • Proficiency with build and automation tools such as Gradle, Jenkins, Ant, Python/Jython, Artifactory, Terraform, SonarQube.
  • Knowledge of event-driven architectures with experience in Apache Kafka or IBM MQ.
  • Strong Linux (*nix) and cloud hosting skills (AWS preferred).
  • Excellent communication skills, with an ability to collaborate across engineering and business stakeholders.

Why join us

  • Work on systems central to Australia's financial ecosystem.
  • Influence engineering strategy and reliability practices.
  • High-impact technical leadership role with strong career growth.
  • Culture built around collaboration, learning, and blameless practice.

A DIVERSE AND INCLUSIVE WORKPLACE WORKS BETTER FOR EVERYONE

We know that our people make us who we are. That's why we have built a culture of respect – where everyone feels valued and appreciated for being their true authentic selves at NAB. With our focus on inclusion and diversity, and in partnership with our Employee Resource Groups, NAB is a place where First Nations colleagues, colleagues of all genders, sexualities and ages, carers and colleagues with disability, and colleagues from all cultures, races and religions have the opportunity to thrive, connect and grow.

We are intent on providing an environment where you can work your way. Ask about our many flexible work options and please let us know if we can provide any adjustments throughout the recruitment process.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 146435145

Similar Jobs