Key Responsibilities:
Leadership & Team Management:
- Lead, mentor, and develop a team of infrastructure engineers, driving high performance and continuous improvement
- Conduct performance reviews, career development planning, and regular 1:1s
- Manage resource allocation, scheduling, and workload to ensure 24/7 operational stability
- Promote best practices in incident management, change control, and operational documentation
Infrastructure Operations:
- Own end-to-end infrastructure operations across on-premise and cloud environments (Azure, AliCloud)
- Ensure high availability, reliability, and security of core systems (compute, storage, network, virtualization, databases)
- Act as the technical escalation point for complex infrastructure issues
- Oversee incident management processes, ensuring timely resolution and continuous improvement
Stakeholder Management:
- Serve as the main technical point of contact for clients and internal stakeholders
- Communicate effectively in English on system performance, incidents, and operational updates
- Provide clear technical recommendations and reporting
Requirements:
Experience:
- 5+ years in IT infrastructure operations, including 2+ years in leadership roles
- Proven experience managing enterprise-scale infrastructure environments
Technical Skills:
- Strong knowledge of infrastructure domains (compute, storage, networking, virtualization)
- Hands-on experience with cloud platforms (Azure, AliCloud, or similar)
- Solid understanding of monitoring, backup/recovery, and capacity planning
- Familiarity with ITIL practices (Incident, Change, Problem Management)
Other:
- Strong English communication skills for client-facing environments
- Leadership mindset with strong ownership and problem-solving capability