Handshakes by DC Frontiers is an award-winning DataTech company that leverages data to empower safe, informed business decisions.
We are currently looking for a System Engineer to support the daily operational management of our cloud infrastructure and enterprise systems.
We operate within a defined cloud operating model that separates platform engineering (build), security governance (control), and reliability operations (run), ensuring clarity of accountability and high operational standards.
This role is ideal for engineers eager to develop strong operational discipline, troubleshooting capability, and production environment awareness.
Key Responsibilities
Operational Monitoring and Incident Response
- Continuously monitor infrastructure, application, and platform health dashboards to identify anomalies, performance degradation, or service disruptions.
- Respond to alerts and ensuring timely acknowledgement and appropriate triage of incidents.
- Escalate complex or high-impact issues while maintaining accurate documentation of findings and actions taken.
Patch and Maintenance Execution
- Execute routine operating system and application patching activities in accordance with defined maintenance schedules and security policies.
- Validate system stability post-maintenance and report any deviations or risks identified during execution.
Deployment Support
- Assist in infrastructure and application deployment and coordinating with Platform Engineering and development teams.
- Ensure deployments are completed accurately and timely.
System Health Checks and Preventative Maintenance
- Perform routine system checks, validate backup completion status, review capacity metrics, and ensure scheduled jobs and services are operating as expected.
- Proactively raise concerns when trends indicate potential performance or availability risks.
Operational Documentation
- Maintain and update runbooks, standard operating procedures, and operational records to ensure repeatable and auditable execution of tasks.
- Contribute to improving clarity and usability of operational documentation.
Vulnerability Remediation Support
- Execute assigned remediation tasks arising from vulnerability scans or security findings, ensuring alignment with patch and remediation timelines defined by the Security team.
Required Skills & Experiences
- Bachelor's degree in Computer Science or related field
- Min 1 year of working experience in Cloud Platforms
- Foundational knowledge of cloud platforms (AWS preferred), including compute, storage, and networking concepts
- Basic understanding of operating systems administration (Linux and/or Windows)
- Familiarity with monitoring or observability tools
- Understanding of incident management fundamentals
- Strong troubleshooting and analytical skills
- Ability to follow structured processes and escalate appropriately
- Proficiency in English language (written & verbal)
Nice-to-Have
- Exposure to infrastructure-as-code concepts
- Basic scripting knowledge (Python, Bash, PowerShell)
- Understanding of backup and disaster recovery concepts